简体繁体中英

Word and phrase counting with XSLT

原文 2018-07-24 16:06:26 2 1 xml/ xslt/ word/ dita/ phrases

We would like to build a dictionary of the documentation of the products our company makes, to create a fixed terminology, so we would like to count the frequency of specific words and phrases.

This could be solved in a couple of different ways, but what we would like to solve somehow is to write an XSLT algorithm which can recognize phrases, as specific words occuring together often (so we don't have to specify beforehand all the phrases and all their versions with different conjugations, affixations, etc.).

What do you think, could this task be done with XSLT, or should we look after other solutions?

If anyone has any useful advice how we should start, I would be more than happy to hear about your ideas and have a conversation about this!

1 answers

You're looking for collocations, which in algorithmic terms is linked with Pointwise mutual information .

In XSLT, there is no framework for natural language processing (NLP), so you would have to invent one. However, there are NLP frameworks for programming languages, like Python's NLTK. Check out this example for finding collocations using Python .

It might be easiest to use an external app written in a popular data mining language like Python or R. (You could even plug it into your DITA OT processing.) You might also look at vendors with existing solutions. I haven't done any in-depth search for that, but I've seen systems like Watson, Semaphore, or even XDocs, return results from language analysis.

Counting a subset in xslt

Counting using XSLT

XSLT counting child nodes

Counting values in Xslt

Counting with variable/unknown values XSLT

XSLT counting elements with a matching value

XSLT — Counting attributes within an element

Word XML - XSLT to HTML

XSLT for Word Documents

Word Frequency Counter in XSLT

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Counting a subset in xslt Counting using XSLT XSLT counting child nodes Counting values in Xslt Counting with variable/unknown values XSLT XSLT counting elements with a matching value XSLT — Counting attributes within an element Word XML - XSLT to HTML XSLT for Word Documents Word Frequency Counter in XSLT

Related Tags

Word and phrase counting with XSLT

Question

1 answers

solution1 0 ACCPTED 2018-10-25 14:06:34

solution1
0 ACCPTED 2018-10-25 14:06:34