[英]Extracting Dead Name Entities from Obituaries - NLP
I have a continuous strings of ads , which are extracted from some newspaper. 我有一连串的广告,是从一些报纸上摘下来的。 The ads may appear in a format as shown below:My task here is to extract the deceased person's names.
广告的显示格式可能如下所示:我的任务是提取死者的姓名。
John, the small son of Mr. and Mrs.<br>
Elmer Cleppfer, died at their home in<br>
Lewistown on Wednesday. The funeral<br>
will He held on Saturday afternoon<br>
from the home of the grandparents<br>
on the child, Mr. and Mrs. John<br>
Kiopper, 224 Locust street, tortiorrow<br>
afternoon at 2 o'clock. Interment witt<br>
take place at Oberlin.<br>
Mrs. Lydia Mintch, aged 6S years <br>
died yesterday afternoon at the home<br>
of Fred Flowerfleld at Enhaut. Mrs.<br>
Mlnlch contracted a severe attack of<br>
pneumonia aggravated by other illness<br>
Several days ago which resulted in her<br>
death. Funeral arrangements have not<br>
yet been completed.<br>
The whole of the para is made up of 2 ads.. Can any one tell me how to classify such kind of text into paragraphs if there are more than 1 such ads? 整个段落由2个广告组成。如果有多个这样的广告,有人可以告诉我如何将这种文本分类为段落吗?
Well Stanford Parser is your option here. 那么斯坦福解析器是您的选择。
I am intentionally not giving away the pattern here as you should put in your efforts as well.
我故意不放弃这里的模式,因为您也应该努力。
Here is how I would approach the problem. 这是我将如何处理该问题的方法。
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.