简体繁体中英

How does keyword extraction works?

原文 2018-11-29 06:03:53 6 1 ibm-watson/ watson-nlu

I tested the keyword extraction from the Natural Language Understanding service from IBM with the following text:

Desarrollo PDA. Ajustes PDA. Nuevo modulo PDA. Ajustes modulo PDA. No sincroniza PDA. Error modulo PDA.

And i got the following response:

modulo pda with 98.31% relevance
ajustes modulo pda with 64.44% relevance
nuevo modulo pda with 64.34 relevance

Now my question is why is "modulo pda" keyword relevance 98.31% and not just "PDA" with a higher relevance?. I've been searching everywhere about how does IBM works with no avail.

1 answers

The actual algorithm used to extract and score keywords would be a corporate proprietary recipe, I won't expect them to make it public. But you can find lot of research papers on that topic but usually the final commercial products would contain mix of different techniques to get the best results.

You can compare the different NLU services from different provides, like IBM, Google, Amazon and compare the results.

Specifically for your query, you are trying to extract keywords or topics from a single document. PDA occurs in every sentence in your document. If we apply a simple technique like TF-IDF where each sentence is a document, the the TF-IDF=0 for the word PDA since it occurs in every sentence and becomes irrelevant since its not adding an information to overall topic or document importance.

How to use Relationship Extraction using IBM watson?

NLU Analyze: does the -1 to 1 Sentiment Score for Keyword/Entity represent Magnitude or Confidence?

Does IBM Watson App works outside of Bluemix?

Entity extraction on large documents

How to extract the keyword properties of a PDF URL using IBM Watson Explorer?

How to use keyword spotting feature for IBM Waston Speech to text API?

Feedback Mechanism / Learning for Relationship Extraction

Watson keyword spotting unity

IBM Watson Relationship Extraction “Forwarding error” (status_code 500)

POST request to IBM Watson Relationship extraction returns error

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question How to use Relationship Extraction using IBM watson? NLU Analyze: does the -1 to 1 Sentiment Score for Keyword/Entity represent Magnitude or Confidence? Does IBM Watson App works outside of Bluemix? Entity extraction on large documents How to extract the keyword properties of a PDF URL using IBM Watson Explorer? How to use keyword spotting feature for IBM Waston Speech to text API? Feedback Mechanism / Learning for Relationship Extraction Watson keyword spotting unity IBM Watson Relationship Extraction “Forwarding error” (status_code 500) POST request to IBM Watson Relationship extraction returns error

Related Tags

How does keyword extraction works?

Question

1 answers

solution1 0 ACCPTED 2018-11-30 15:22:24

solution1
0 ACCPTED 2018-11-30 15:22:24