[英]How can you harvest CKAN datsets using filter
I'm using CKAN 2.9 version with the Remote harvesting extension.我将 CKAN 2.9 版本与远程收获扩展一起使用。
I don't want to harvest all datasets, I want to harvest select datasets of name or of the ID within packages on a website.我不想收集所有数据集,我想收集网站上包内名称或 ID 的选定数据集。
Can I use a filter eg with CQL filter in the config or another way possibility (another ext?)?我可以在配置中使用过滤器,例如 CQL 过滤器或其他方式的可能性(另一个分机?)? Do you have an idea of how I can harvest the datasets with a filter and a simple example if I have to use the config?如果我必须使用配置,您是否知道如何使用过滤器和简单示例收集数据集?
By default the CKAN harvester extension allows you to filter by organization or group datasets to be included or excluded from harvesting.默认情况下,CKAN 收割机扩展允许您按组织或组数据集进行过滤,以包含或排除在收割中。 For eg you can harvest the dataset only for a org_1 by adding the "organizations_filter_include": ['org_1'] key.例如,您可以通过添加 "organizations_filter_include": ['org_1'] 键来仅为 org_1 收集数据集。 For more info check the harvester docs in Github .有关更多信息,请查看Github 中的收割机文档。 If that doesn't work for you, you can use the IHarvester interface and add your own logic, which could be more complicated than using the one above provided by the harvester.如果这对您不起作用,您可以使用IHarvester 接口并添加您自己的逻辑,这可能比使用收割机提供的上述逻辑更复杂。
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.