简体   繁体   English

谁能解释 Object 检测论文的特征金字塔网络中的“非滑动窗口”声明?

[英]Can anyone explain about the "non-sliding window" statement in Feature Pyramid Networks for Object Detection paper?

Feature Pyramid Networks for Object Detection adopt RPN technique to create the detector, and it use sliding window technique to classify. Feature Pyramid Networks for Object Detection采用RPN技术创建检测器,并使用滑动window技术进行分类。 How come there is a statement for "non-sliding window" in 5.2 section? 5.2小节怎么会有“非滑动窗口”的说法?

The extended statement in the paper: 5.2.论文中的扩展语句: 5.2。 Object Detection with Fast/Faster R-CNN Next we investigate FPN for region-based (non-sliding window) detectors. Object 使用 Fast/Faster R-CNN 进行检测接下来我们研究基于区域(非滑动窗口)检测器的 FPN。

In my understanding, FPN using sliding window in detection task.在我的理解中,FPN在检测任务中使用滑动window。 This is also mentioned in https://medium.com/@jonathan_hui/understanding-feature-pyramid.networks-for-object-detection-fpn-45b227b9106c the statement is这个在https://medium.com/@jonathan_hui/understanding-feature-pyramid.networks-for-object-detection-fpn-45b227b9106c中也提到了声明是

"FPN extracts feature maps and later feeds into a detector, says RPN, for object detection. RPN applies a sliding window over the feature maps to make predictions on the objectness (has an object or not) and the object boundary box at each location." “FPN 提取特征图,然后输入检测器,RPN 说,用于 object 检测。RPN 在特征图上应用滑动 window 以预测对象性(是否具有 object)和每个位置的 object 边界框。 “

Thank you in advanced.提前谢谢你。

Feature Pyramid Networks(FPN) for Object Detection is not an RPN.用于 Object 检测的特征金字塔网络 (FPN) 不是 RPN。

FPN is just a better way to do feature extraction. FPN 只是一种更好的特征提取方法。 It incorporates features from several stages together which gives better features for the rest of the object detection pipeline (specifically because it incorporates features from the first stages which gives better features for detection of small/medium size objects).它结合了几个阶段的特征,为 object 检测管道的 rest 提供了更好的特征(特别是因为它结合了第一阶段的特征,为检测中小型物体提供了更好的特征)。

As the original paper states: "Our goal is to leverage a ConvNet's pyramidal feature hierarchy, which has semantics from low to high levels, and build a feature pyramid with high-level semantics throughout. The resulting Feature Pyramid Network is general purpose and in this paper we focus on sliding window proposers (Region Proposal Network, RPN for short) [29] and region-based detectors (Fast R-CNN) "正如原始论文所述:“我们的目标是利用 ConvNet 的金字塔特征层次结构,它具有从低到高级别的语义,并构建一个始终具有高级语义的特征金字塔。由此产生的特征金字塔网络是通用的,在这个paper we focus on sliding window proposers (Region Proposal Network, 简称RPN) [29] and region-based detectors (Fast R-CNN)

So they use it to check "Two stage" object detection pipeline.所以他们用它来检查“两阶段”object 检测管道。 The first stage is the RPN and this is what they check in section 5.1 and then they check it for the classification stage in section 5.2.第一阶段是 RPN,这是他们在 5.1 节中检查的内容,然后他们在 5.2 节中检查分类阶段。

Fast R-CNN Faster R-CNN etc.. are region based object detectors and not sliding window detectors. Fast R-CNN Faster R-CNN 等是基于区域的 object 检测器而不是滑动 window 检测器。 They get a fixed set of regions from the RPN to classify and thats it.他们从 RPN 中获得一组固定的区域来进行分类,仅此而已。

A good explanation on the differences you can see at https://medium.com/@jonathan_hui/what-do-we-learn-from-region-based-object-detectors-faster-r-cnn-r-fcn-fpn-7e354377a7c9 .您可以在https://medium.com/@jonathan_hui/what-do-we-learn-from-region-based-object-detectors-faster-r-cnn-r-fcn-fpn看到关于差异的很好解释-7e354377a7c9

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM