[英]How can I advanced search for tweets on engine with the word 'casa' but not having 'câsa' results included?
I try this method:我试试这个方法:
casa -câsa
But that way it excludes the casa
without accents too, then the search returns blank.但是这样它也排除了没有重音符号的
casa
,然后搜索返回空白。
To the best of my knowledge, Twitter flattens-out all accented latin letters and treats them the same, so...据我所知,Twitter 将所有带重音的拉丁字母变平并一视同仁,所以......
a
= á
= â
= à
= ä
= ā
= ã
= å
. a
= á
= â
= à
= ä
= ā
= ã
= å
。
One possible way to clean a little bit your search results is to use Twitter's advanced search language operator lang:[xx]
in negation -lang:[xx]
, where [xx]
represents the 2 letter ISO language code of the languages which might be using that particular letter (assuming you wish to filter-out from the results).稍微清理搜索结果的一种可能方法是使用 Twitter 的高级搜索语言运算符
lang:[xx]
否定-lang:[xx]
,其中[xx]
代表可能是的语言的2 个字母 ISO 语言代码使用该特定字母(假设您希望从结果中过滤掉)。
In your example, the letter Ââ
(circumflex) is used by the following languages : Sami, Romanian, Vietnamese, French, Frisian, Portuguese, Turkish, Walloon and Welsh.在您的示例中,以下语言使用字母
Ââ
(circumflex):萨米语、罗马尼亚语、越南语、法语、弗里斯兰语、葡萄牙语、土耳其语、瓦隆语和威尔士语。 Assuming you wish to filter-out results from these specific languages, your Twitter search query would look like this:假设您希望过滤掉这些特定语言的结果,您的 Twitter 搜索查询将如下所示:
"casa" -lang:se -lang:ro -lang:vi -lang:fr -lang:fy -lang:pt -lang:tr -lang:wa -lang:cy
Alternatively, you can use the same lang:[xx]
operator to limit Twitter's search results to one specific language (for example - English):或者,您可以使用相同的
lang:[xx]
运算符将 Twitter 的搜索结果限制为一种特定语言(例如 - 英语):
"casa" lang:en
This might not be a water-tight solution but it can reduce a lot of false positives.这可能不是一个无懈可击的解决方案,但它可以减少很多误报。
Finally, you should keep in mind that Twitter is not guaranteeing accuracy in their machine-identification of languages.最后,您应该记住Twitter 不保证其机器语言识别的准确性。
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.