简体   繁体   English

CRF ++或CRFSuite

[英]CRF++ or CRFSuite

I'm starting to work with crf++ and crfsuite (both use a very similar file format). 我开始使用crf ++和crfsuite(两者都使用非常相似的文件格式)。 I want to do things related to images (segmentation, activiy recognition, etc). 我想做与图像有关的事情(分割,活动识别等)。 My main problem is how to build the training file. 我的主要问题是如何构建培训文件。 Has anybody work with crf and images? 有没有人使用crf和图像? Has anybody explain me or give some file to learn. 有没有人解释我或给一些文件来学习。 Thanks in advance. 提前致谢。

I used CRF++ before and it worked very well. 我之前使用过CRF ++,效果非常好。 But my field is natural language processing, and I use CRF++ for named entity recognition or POS tagging. 但我的领域是自然语言处理,我使用CRF ++进行命名实体识别或POS标记。 CRF++ is easy to install on Linux but has some minor issue when compiling on windows. CRF ++很容易在Linux上安装,但在Windows上编译时有一些小问题。 You can just follow its document for training data format: each row represents a data sample and each column represents a feature type. 您可以按照其文档来训练数据格式:每行代表一个数据样本,每列代表一种要素类型。

Or, you can also consider Mallet which has a CRF component. 或者,您也可以考虑具有CRF组件的Mallet。

CRFsuite is faster than CRF++ and it can deal with a huge training data. CRFsuite比CRF ++更快,它可以处理大量的训练数据。 I tried both of them. 我试过了他们两个。 They perfectly work on a reasonable amount of data, but when my dataset increased to be more than 100,000 sentences, CRF++ did not manage to deal with it and suddenly stopped working. 它们可以完美地处理合理数量的数据,但是当我的数据集增加到超过100,000个句子时,CRF ++无法处理它并突然停止工作。 Look at the following link 请看以下链接
CRFsuite - CRF Benchmark test there is a comparison between many CRF software in some criteria CRFsuite - CRF基准测试在一些标准中对许多CRF软件进行了比较

我刚刚看到这个用于Windows的那个http//crfsharp.codeplex.com/

也许你也想在Mallet包中尝试CRF组件。

Probably you should start with the DGM library ( https://github.com/Project-10/DGM ), which is the best choice for those, who never worked with CRFs before. 可能你应该从DGM库( https://github.com/Project-10/DGM )开始,这是那些从未使用过CRF的人的最佳选择。 It includes a number of ready-to-go demo projects, which will classify/ segment your images just out-of-the-box. 它包括许多随时可用的演示项目,它们可以开箱即用地对您的图像进行分类/分割。 It is also well documented . 它也有很好的记录

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM