简体繁体中英

How to perform Paragraph boundary detection in NLP frameworks?

原文 2013-11-19 11:04:18 7 1 nlp/ text-processing/ stanford-nlp/ opennlp/ apache-stanbol

I am working on extracting names of people from various ads appearing in English newspapers .

However , i have noticed that I need to identify the boundary of an Ad , before extracting the names present in it ,since I need only the first occurring name to be extracted .I started with Stanford NLP . I was successful in extracting names . But I got stuck in identifying the paragraph boundary.

Is there any way of identifying the paragraph boundary . ?

1 answers

This is a difficult problem, we are facing the same problem in one of our projects. There are some theory papers out there which help define the scope of the problem and potential solutions in detail. I'll include them below.

We're still in the process of R&D so we haven't many answers just yet, but we are willing to share what we have and find as time moves forward.

Here is one such paper:

Automatic Paragraph Identification: A Study across Languages and Domains

Here is the github link for the ISCIBoost Code they use:

Open-source implementation of Boostexter (Adaboost based classifier)

Lightweight NLP frameworks for Python

answer type detection in NLP

Sentence detection using NLP

NLP classify sentences/paragraph as funny

Sentence boundary detection in HTML

How to perform information extraction from news paper articles with a NLP library?

NLP Negation detection and Stop Words

NLP Frameworks for converting natural language to database queries

Perform Named Entity Recognition - NLP

Custom sentence boundary detection in SpaCy

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Lightweight NLP frameworks for Python answer type detection in NLP Sentence detection using NLP NLP classify sentences/paragraph as funny Sentence boundary detection in HTML How to perform information extraction from news paper articles with a NLP library? NLP Negation detection and Stop Words NLP Frameworks for converting natural language to database queries Perform Named Entity Recognition - NLP Custom sentence boundary detection in SpaCy

Related Tags

How to perform Paragraph boundary detection in NLP frameworks?

Question

1 answers

solution1 1 2015-12-10 15:25:22

solution1
1 2015-12-10 15:25:22