简体   繁体   English

Google BigQuery 是否支持全文搜索

[英]Does Google BigQuery support Full-Text search

I'm thinking about copying my text searchable content to Google's BigQuery and then perform full-text search using BigQuery API.我正在考虑将我的文本可搜索内容复制到 Google 的 BigQuery,然后使用 BigQuery API 执行全文搜索。

Does Google BigQuery support that scenario? Google BigQuery 是否支持这种情况?

I could not find "search" command in Google BigQuery API: https://developers.google.com/bigquery/docs/reference/v2/我在 Google BigQuery API 中找不到“搜索”命令: https://developers.google.com/bigquery/docs/reference/v2/

BigQuery support a collection of RegEx and String query functions, making it suitable for text search queries across STRING fields. BigQuery支持RegEx和String查询功能的集合,使其适用于跨STRING字段的文本搜索查询。 However, there is a 64k per row (and field) limit for each BigQuery record, so it may not possible to support a totally unstructured, unlimited size, document text search case. 但是,每个BigQuery记录的每行(每个字段)限制为64k,因此可能无法支持完全非结构化,大小不受限制的文档文本搜索用例。

For a full text search capabilities in an App Engine application, I would suggest looking at the new Search API: 对于App Engine应用程序中的全文搜索功能,我建议您查看新的Search API:

10 years late and here we are.晚了 10 年,我们到了。 Today (07/04/22) BigQuery launched It equivalent of Full Text Search.今天(22 年 7 月 4 日)BigQuery 推出了相当于全文搜索的功能。 Here is the doc:这是文档:

https://cloud.google.com/blog/products/data-analytics/pinpoint-unique-elements-with-bigquery-search-features/ https://cloud.google.com/blog/products/data-analytics/pinpoint-unique-elements-with-bigquery-search-features/

The litecene library provides full-text search support for BigQuery using a "lucene light" syntax. litecene库使用“lucene light”语法为 BigQuery 提供全文搜索支持。

(smartphone OR "smart phone"~8 OR iphone OR "i phone" OR "apple phone" OR android OR "google phone" OR "windows phone") AND app*

It compiles the boolean query language down to regular expression matches.它将 boolean 查询语言编译为正则表达式匹配。 It also makes use of new BigQuery search features when possible, namely the SEARCH function and search indexes when possible, although at the time of this writing the searches supported by those features are fairly limited.它还尽可能使用新的 BigQuery 搜索功能,即SEARCH function搜索索引,尽管在撰写本文时,这些功能支持的搜索相当有限。 Using litecene, full-text search can also be deployed against existing production datasets without any ETL changes or re-indexing using non-aggregate materialized views .使用 litecene,全文搜索也可以针对现有的生产数据集部署,而无需任何 ETL 更改或使用非聚合物化视图重新索引。 The searches can target one or multiple columns.搜索可以针对一列或多列。

Disclaimer: I am the author of the library.免责声明:我是图书馆的作者。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM