Tensorflow object 检测 api 得到按边界框坐标排序的预测

Question

I trained a neural network to solve simple captcha using tensorflow object-detection API, but when I output the predictions with the following code:我使用 tensorflow 对象检测 API 训练了一个神经网络来解决简单的验证码，但是当我使用以下代码进行 output 预测时：

for index, value in enumerate(classes[0]):
object_dict = {}
if scores[0, index] > threshold:
    object_dict[(category_index.get(value)).get('name').encode('utf8')] = scores[0, index]
    objects.append(object_dict)

I get predictions in random order with every function run.每次运行 function 时，我都会以随机顺序获得预测。 I asked a question earlier, and I was advised to try using the coordinates, but I could not find a way to connect the classes and the coordinates of the box that is associated with this class.我之前问了一个问题，有人建议我尝试使用坐标，但我找不到一种方法来连接与此 class 关联的框的类和坐标。 Example of solved captcha is attached, so I need a way to output predictions in order from left to right.附上已解决的验证码示例，因此我需要一种方法来按从左到右的顺序进行 output 预测。

Image图片

Answer 1

Given that boxes[0] is an array of shape num_boxes * 4, where the first value in the box is xmin, this can get you the indicies of the boxes, sorted by the one with the lowest xmin (the one that starts further left).鉴于boxes[0]是一个形状为 num_boxes * 4 的数组，其中框中的第一个值是 xmin，这可以为您提供框的索引，按 xmin 最低的那个（从更左边开始的那个）排序）。

indices = np.argsort(boxes[0][:,0])

Then you can use these indices to sort the boxes, scores, and classed, as follows:然后您可以使用这些索引对框、分数和分类进行排序，如下所示：

sorted_scores = scores[0][indices]
sorted_boxes = boxes[0][indices]
sorted_classes = classes[0] indicies

If you wanted to, for example, sort by xmax instead, you'd use np.argsort(boxes[0][:,2]) .例如，如果您想按 xmax 排序，则可以使用np.argsort(boxes[0][:,2]) 。 You can play around with using 0-3 to sort by xmin, ymin, xmax, and ymax.您可以使用 0-3 按 xmin、ymin、xmax 和 ymax 进行排序。

Tensorflow object 检测 api 得到按边界框坐标排序的预测

问题描述

1 个解决方案

解决方案1
1 已采纳 2020-05-20 09:01:21

Tensorflow object 检测 api 得到按边界框坐标排序的预测

问题描述

1 个解决方案

解决方案1 1 已采纳 2020-05-20 09:01:21

解决方案1
1 已采纳 2020-05-20 09:01:21