简体   繁体   English

如何使用SQL识别数据中的重复项

[英]How to identify duplicates in data using SQL

Here's my scenario, I am querying a salesforce database and I have certain cases which have different case_id's but the content remains the same. 这是我的情况,我正在查询Salesforce数据库,并且某些案例具有不同的case_id,但内容保持不变。 ie the case description, case owner, case assigned team and all other related information are the same. 也就是说,案例描述,案例所有者,案例分配团队以及所有其他相关信息都是相同的。

I can do a self join to identify such cases but my question to all you SQL gurus is - can this be done without using joins? 我可以进行自我联接来识别这种情况,但是我对所有SQL专家的问题是-是否可以不使用联接来完成? The reason being, all these information is housed in different tables and makes things harder to do a self join? 原因是,所有这些信息都存储在不同的表中,这使得进行自我连接变得更加困难吗?

Try: 尝试:

SELECT COUNT(CASE_ID), CaseDescription, CaseOwner, CaseAssignedTeam, ... SELECT COUNT(CASE_ID),CaseDescription,CaseOwner,CaseAssignedTeam,...

FROM Table 从表

GROUP BY CaseDescription, CaseOwner, CaseAssignedTeam, ... GROUP BY CaseDescription,CaseOwner,CaseAssignedTeam,...

HAVING COUNT(CASE_ID)>1 拥有COUNT(CASE_ID)> 1

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM