带有VNRecognizedObjectObservation的boundingBox框架不正确

Question

我在使用Core ML＆Vision显示识别对象周围的边界框时遇到问题。

水平检测似乎工作正常，但是，垂直框太高，越过视频的顶部边缘，不会一直到视频的底部，并且它不跟随视频的运动相机正确。 在这里您可以看到问题： https ： //imgur.com/Sppww8T

这是视频数据输出的初始化方式：

let videoDataOutput = AVCaptureVideoDataOutput()
videoDataOutput.alwaysDiscardsLateVideoFrames = true
videoDataOutput.videoSettings = [kCVPixelBufferPixelFormatTypeKey as String: Int(kCVPixelFormatType_420YpCbCr8BiPlanarFullRange)]
videoDataOutput.setSampleBufferDelegate(self, queue: dataOutputQueue!)
self.videoDataOutput = videoDataOutput
session.addOutput(videoDataOutput)
let c = videoDataOutput.connection(with: .video)
c?.videoOrientation = .portrait

我也尝试了其他视频方向，但没有取得多大成功。

执行愿景要求：

let handler = VNImageRequestHandler(cvPixelBuffer: image, options: [:])
try? handler.perform(vnRequests)

最后一旦处理请求。 viewRect设置为视频视图的大小：812x375（我知道，视频层本身有点短，但这不是问题）：

let observationRect = VNImageRectForNormalizedRect(observation.boundingBox, Int(viewRect.width), Int(viewRect.height))

我也尝试过做（有更多问题）的事情：

var observationRect = observation.boundingBox
observationRect.origin.y = 1.0 - observationRect.origin.y
observationRect = videoPreviewLayer.layerRectConverted(fromMetadataOutputRect: observationRect)

我试图尽可能多地削减我认为不相关的代码。

我实际上遇到过使用Apple示例代码的类似问题，当边界框不会按预期垂直移动对象时： https ： //developer.apple.com/documentation/vision/recognizing_objects_in_live_capture也许这意味着有一些问题与API？

Answer 1

我使用这样的东西：

let width = view.bounds.width
let height = width * 16 / 9
let offsetY = (view.bounds.height - height) / 2
let scale = CGAffineTransform.identity.scaledBy(x: width, y: height)
let transform = CGAffineTransform(scaleX: 1, y: -1).translatedBy(x: 0, y: -height - offsetY)
let rect = prediction.boundingBox.applying(scale).applying(transform)

这假设纵向和16：9纵横比。 它假设.imageCropAndScaleOption = .scaleFill 。

致谢：转换代码取自此repo： https ： //github.com/Willjay90/AppleFaceDetection

带有VNRecognizedObjectObservation的boundingBox框架不正确

问题描述

1 个解决方案

解决方案1
3 已采纳 2019-03-13 10:20:05

带有VNRecognizedObjectObservation的boundingBox框架不正确

问题描述

1 个解决方案

解决方案1 3 已采纳 2019-03-13 10:20:05

解决方案1
3 已采纳 2019-03-13 10:20:05