简体   繁体   English

如何在引擎上使用“casa”一词高级搜索推文但不包含“câsa”结果?

[英]How can I advanced search for tweets on engine with the word 'casa' but not having 'câsa' results included?

I try this method:我试试这个方法:

casa -câsa

But that way it excludes the casa without accents too, then the search returns blank.但是这样它也排除了没有重音符号的casa ,然后搜索返回空白。

To the best of my knowledge, Twitter flattens-out all accented latin letters and treats them the same, so...据我所知,Twitter 将所有带重音的拉丁字母变平并一视同仁,所以......
a = á = â = à = ä = ā = ã = å . a = á = â = à = ä = ā = ã = å

One possible way to clean a little bit your search results is to use Twitter's advanced search language operator lang:[xx] in negation -lang:[xx] , where [xx] represents the 2 letter ISO language code of the languages which might be using that particular letter (assuming you wish to filter-out from the results).稍微清理搜索结果的一种可能方法是使用 Twitter 的高级搜索语言运算符lang:[xx]否定-lang:[xx] ,其中[xx]代表可能是的语言的2 个字母 ISO 语言代码使用该特定字母(假设您希望从结果中过滤掉)。

In your example, the letter Ââ (circumflex) is used by the following languages : Sami, Romanian, Vietnamese, French, Frisian, Portuguese, Turkish, Walloon and Welsh.在您的示例中,以下语言使用字母Ââ (circumflex):萨米语、罗马尼亚语、越南语、法语、弗里斯兰语、葡萄牙语、土耳其语、瓦隆语和威尔士语。 Assuming you wish to filter-out results from these specific languages, your Twitter search query would look like this:假设您希望过滤掉这些特定语言的结果,您的 Twitter 搜索查询将如下所示:

"casa" -lang:se -lang:ro -lang:vi -lang:fr -lang:fy -lang:pt -lang:tr -lang:wa -lang:cy

try it... 试试吧...

Alternatively, you can use the same lang:[xx] operator to limit Twitter's search results to one specific language (for example - English):或者,您可以使用相同的lang:[xx]运算符将 Twitter 的搜索结果限制为一种特定语言(例如 - 英语):

"casa" lang:en

try it... 试试吧...

This might not be a water-tight solution but it can reduce a lot of false positives.这可能不是一个无懈可击的解决方案,但它可以减少很多误报。

Finally, you should keep in mind that Twitter is not guaranteeing accuracy in their machine-identification of languages.最后,您应该记住Twitter 不保证其机器语言识别的准确性

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM