简体繁体中英

String fuzzy-matching From R to Python

原文 2021-07-06 12:46:56 8 1 python/ r

I am trying to use string fuzzy-matching with both R and Python. I am actually using two packages:

stringdist from R
fuzzywuzzy from Python

When I try amatch("PARI", c("HELLO", "WORLD"), maxDist = 2) on R, I get NA as a result, which is intuitive. But when I try the same thing with Python : process.extract("PARI", ["HELLO", "WORLD"], limit = 2) , I get [('world', 22), ('HELLO', 0)]

Could anyone tell me why I have a 22 as a ratio matching between "PARI" and "WORLD" ? How could I get the same result as in R ? Thanks in advance

1 answers

The problem here is limit = 2 specifically says you want 2 results regardless of the score, whereas in R you are specifying that you only want a result if the strings are very close to one another. The score here is a measure from 0 to 100 of how similar the words are. You can see PARI and world both have R as their third letter, which is why you get a non-zero score, but it still isn't a very good one.

Fuzzy string matching in Python

Fuzzy String Matching Python - dataframe

Fuzzy matching two columns in R or Python

What is a simple fuzzy string matching algorithm in Python?

Python find all fuzzy matching sequences in a string

Fuzzy matching from string candidate list

Matching 2 large csv files by Fuzzy string matching in Python

Python selenium and fuzzy matching

Fuzzy URL matching in Python

Fuzzy matching with pyspark or python

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Fuzzy string matching in Python Fuzzy String Matching Python - dataframe Fuzzy matching two columns in R or Python What is a simple fuzzy string matching algorithm in Python? Python find all fuzzy matching sequences in a string Fuzzy matching from string candidate list Matching 2 large csv files by Fuzzy string matching in Python Python selenium and fuzzy matching Fuzzy URL matching in Python Fuzzy matching with pyspark or python

Related Tags

String fuzzy-matching From R to Python

Question

1 answers

solution1 1 2021-07-06 12:55:14

solution1
1 2021-07-06 12:55:14