简体繁体 English

使用深度学习识别场景

[英]Recognize scene with deep learning

原文 2019-05-10 10:54:38 1 1 tensorflow/ machine-learning/ image-processing/ keras/ deep-learning

What is the approach to recognize a scene with deep learning (preferably Keras).使用深度学习（最好是 Keras）识别场景的方法是什么。 There are many examples showing how to classify images of limited size eg dogs/cats hand-written letters etc. There are also some examples for the detection of a searched object within a big image.有许多示例展示了如何对有限大小的图像进行分类，例如狗/猫、手写字母等。还有一些示例用于检测大图像中的搜索对象。

But, what is the best approach to recognize eg is it a class-room, bed-room or a dinning room?但是，识别例如它是教室、卧室还是餐厅的最佳方法是什么？ Create a data-set with that images?使用该图像创建数据集？ I think no.我觉得不行。 I think one should train a model with many things, which may appear in the scene, create a vector of the found things in the analysed image and using the second classifier (SVM or simple NN) classify the scene.我认为应该训练一个包含许多事物的模型，这些事物可能会出现在场景中，在分析的图像中创建一个向量，并使用第二个分类器（SVM 或简单的 NN）对场景进行分类。 Is it a right approach?这是一个正确的方法吗？

PS: Actually, I'm facing another problem, which IHMO the same. PS：实际上，我面临另一个问题，这与 IHMO 相同。 My "scene" is a microscope image.我的“场景”是显微镜图像。 The images contain different sets of cells and artifacts.图像包含不同组的细胞和伪影。 Depending on a set, a doctor makes a diagnosis.根据一组，医生做出诊断。 So I aim to train a CNN with the artifacts, which I extract with a simple morphologicyl methods.所以我的目标是用我用简单的形态学方法提取的工件训练一个 CNN。 These artifacts (eg biological cells) will be my features.这些人工制品（例如生物细胞）将成为我的特征。 So the first level of the recognition - feature extraction is done by CNN, the later classification by SVM.所以识别的第一层——特征提取由CNN完成，后面的分类由SVM完成。 Just wanted be sure, that I'm not reinventing a wheel.只是想确定一下，我不是在重新发明轮子。

1 个解决方案

In my opinion the comparison between your room-scenes and the biological scenes differ.在我看来，你的房间场景和生物场景之间的比较是不同的。 Especially since your scene is a microscope image (probably of a limited predefined domain).特别是因为您的场景是显微镜图像（可能是有限的预定义域）。

In this case, pure classification should work (without seeing the data).在这种情况下，纯分类应该有效（无需查看数据）。 In other words the neural network should be able to figure out what it is seeing, without having you to hand-craft features (in case you need interpretability that's a whole new discussion).换句话说，神经网络应该能够弄清楚它看到的是什么，而无需您手工制作特征（以防您需要可解释性，这是一个全新的讨论）。

Also there are lots approaches for scene understanding in this paper .这篇论文中也有很多场景理解的方法。