简体   繁体   English

NLP字典

[英]Dictionary for NLP

I am in need of an offline C# (or ANY language for that matter, I can port it over) library that will be able to take a part of speech (that I will detect) and tell me the plural form of the word or the different tenses of the word that I can then turn around and use. 我需要一个脱机的C#(或其他任何一种语言,我可以将其移植)库,该库将能够进行部分语音处理(我将检测到)并告诉我单词或“这个词的不同时态,然后我就可以转过来使用。

Does such a library exist for the English language? 是否存在此类英语库?

Late to jump in, but .NET 4.0 includes a pluralization service 迟到了,但是.NET 4.0包含一个多元化服务

http://msdn.microsoft.com/en-us/library/dd486930(v=vs.100).aspx http://msdn.microsoft.com/en-us/library/dd486930(v=vs.100).aspx

希望以后如果您熟悉其他与NLP相关的任务, 那么Sharpnlp将对您有所帮助。

The tool you are looking for is a "morphological generator". 您正在寻找的工具是“形态生成器”。 It takes a base word and an inflection code, and produces an inflected word form. 它采用一个基本单词和一个变位代码,并产生一个变位单词形式。

However, for English, a plural inflector is just a few lines of code, plus a dictionary of exceptions (base form and inflection for words with an irregular plural form). 但是,对于英语来说,复数形式的转折符只是几行代码,外加例外字典(基本形式和不规则复数形式的单词的变体形式)。 Just remember to handle the case of words ending on a sibilant. 只需记住处理以sibilant结尾的单词。

If you are interested with just some rules and a list like what @tripleee mentioned, then you can try/port from MorphAdorner's code. 如果您只对一些规则和@tripleee提及的列表感兴趣,则可以尝试/移植MorphAdorner的代码。 Demo: MorphAdorner Pluralizer 演示: MorphAdorner制浆机

But if you are interested with an exhaustive listing you can check Lexical Tools' Inflection Variants . 但是,如果您对详尽的清单感兴趣,可以查看Lexical Tools的Inflection Variants After downloading Lexical Tools , you will be importing the data to your database server. 下载Lexical Tools之后 ,您将把数据导入数据库服务器。 Then you can just query the database using their library (Java). 然后,您可以使用其库(Java)来查询数据库。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM