简体   繁体   中英

Computer vision: how to create proper bounding boxes

I'm creating a dataset to train a Yolov5 model to recognize tabular data. I want to clarify bounding box best practices. Would differences in bounding box tightness in the following two images affect the model's mAP?

Somewhat tight bounding boxes:

在此处输入图像描述

Tight bounding boxes:

在此处输入图像描述

Original image:

在此处输入图像描述

Since it takes more time to create the tighter boxes, I want to check whether it's worth the extra effort.

A precise and tight bounding box always helps in focus and learning more relevant features specific to object of interest. In this scenario, tight bounding box annotation would help in following ways:

  • learning text edge based object bounding box and ignore the background features which will also help in reducing bounding box overlap with nearby text lines.
  • Annotated bounding box will also make the model learn the text line spatial shape features(width, height)

Check this link to understand best practices for object detection annotation https://nanonets.github.io/tutorials-page/docs/annotate

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM