简体   繁体   English

oracle sql 通过打断字符串并比较每个单词来匹配两列中的单词

[英]Match the words in two columns oracle sql by breaking the string and comparing each word

I have two columns in my table for instance column_A and column_B我的表中有两列,例如column_Acolumn_B

Suppose column_A has data "best buy card credit" and column_B has "credit no take buy order", here i need to match the words in both the columns and return the number of matching words .假设column_A有数据“best buy card credit”, column_B有“credit no take buy order”,这里我需要匹配两列中的单词并返回匹配单词的数量。 In this case it should return 2 as "buy" and "credit" match.在这种情况下,它应该返回 2 作为“购买”和“信用”匹配。 Can anyone please suggest sql code to do the same.任何人都可以建议 sql 代码来做同样的事情。

Please NOTE : size of column_a and column_b is not fixed ie the number of words in both might change.请注意column_acolumn_b大小不是固定的,即两者中的字数可能会改变。

with t as (
SELECT 1 AS ID, 'best buy card credit' column_1,
       'credit no take buy order' column_2
  FROM DUAL
UNION ALL
SELECT 2 AS ID, 'gaurav is  fool' column_1, 'saurabh is fool' column_2
  FROM DUAL
           )
,t1_column1 as (
SELECT     ID, LEVEL AS n, REGEXP_SUBSTR (column_1, '[^ ]+', 1, LEVEL) AS val
      FROM t
CONNECT BY REGEXP_SUBSTR (column_1, '[^ ]+', 1, LEVEL) IS NOT NULL
        )
,t1_column2 as (
SELECT     ID, LEVEL AS n, REGEXP_SUBSTR (column_2, '[^ ]+', 1, LEVEL) AS val
      FROM t
CONNECT BY REGEXP_SUBSTR (column_2, '[^ ]+', 1, LEVEL) IS NOT NULL
        )

select id, LISTAGG(VAL,',') WITHIN GROUP(ORDER BY VAL ) words ,COUNT(*) "total matched words"
from
(
SELECT DISTINCT t1_column1.ID ID, t1_column1.val val
           FROM t1_column1, t1_column2
          WHERE t1_column1.ID = t1_column2.ID
            AND t1_column1.val = t1_column2.val
)
group by id

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM