一行中的所有匹配项：Spacy matcher

Question

I am looking for a solution to print all the matching in a line using Spacy matcher我正在寻找一种使用 Spacy 匹配器在一行中打印所有匹配项的解决方案

The example goes like this, Here I am trying to extract experience.这个例子是这样的，在这里我试图提取经验。

doc = nlp("1+ years of experience in XX, 2 years of experiance in YY")
pattern = [{'POS': 'NUM'}, {'ORTH': '+', "OP": "?"}, {"LOWER": {"REGEX": "years?|months?"}}]
matcher = Matcher(nlp.vocab)
matcher.add("Skills", None, pattern)
matches = matcher(doc)
pirnt(doc[matches[0][1]:matches[0][2]]

Here I am getting output 1+ years .在这里，我得到了1+ years输出。

But I am looking for a solution having output ['1+ years','2 years']但我正在寻找具有输出['1+ years','2 years']的解决方案

Answer 1

You should specify the first item as 'LIKE_NUM': True :您应该将第一项指定为'LIKE_NUM': True ：

pattern = [{'LIKE_NUM': True}, {'ORTH': '+', "OP": "?"}, {"LOWER": {"REGEX": "(?:year|month)s?"}}]

Code:代码：

import spacy
from spacy.matcher import Matcher

nlp = spacy.load("en_core_web_sm")
matcher = Matcher(nlp.vocab)
pattern = [{'LIKE_NUM': True}, {'ORTH': '+', "OP": "?"}, {"LOWER": {"REGEX": "(?:year|month)s?"}}]
matcher.add("Skills", None, pattern)

doc = nlp("1+ years of experience in XX, 2 years of experiance in YY")

matches = matcher(doc)
for _, start, end in matches:
  print(doc[start:end].text)

Output:输出：

1+ years
2 years

一行中的所有匹配项：Spacy matcher

问题描述

1 个解决方案

解决方案1
2 已采纳 2020-01-06 15:07:38

一行中的所有匹配项：Spacy matcher

问题描述

1 个解决方案

解决方案1 2 已采纳 2020-01-06 15:07:38

解决方案1
2 已采纳 2020-01-06 15:07:38