[英]Import Google API JSON file to Elasticsearch
我对ELK堆栈特别是ES完全陌生。 我正在尝试导入使用Google Admin SDK API获得的JSON文件,并且想将其导入到Elasticsearch。
到目前为止,这是我的数据的JSON结构:
{
"kind": "reports#activities",
"nextPageToken": string,
"items": [
{
"kind": "audit#activity",
"id": {
"time": datetime,
"uniqueQualifier": long,
"applicationName": string,
"customerId": string
},
"actor": {
"callerType": string,
"email": string,
"profileId": long,
"key": string
},
"ownerDomain": string,
"ipAddress": string,
"events": [
{
"type": string,
"name": string,
"parameters": [
{
"name": string,
"value": string,
"intValue": long,
"boolValue": boolean
}
]
}
]
}
]
}
因此,我决定首先使用此命令将JSON文件上传到ES中:
curl -s -XPOST 'localhost:9200/_bulk' --data-binary @documents.json
但是我得到一些错误:
{"error":{"root_cause":[{"type":"illegal_argument_exception","reason":"Malformed action/metadata line [1], expected START_OBJECT or END_OBJECT but found [START_ARRAY]"}],"type":"illegal_argument_exception","reason":"Malformed action/metadata line [1], expected START_OBJECT or END_OBJECT but found [START_ARRAY]"},"status":400}
我该怎么办 ?
谢谢您的帮助 !
该JSON似乎正在定义您的文档结构,因此您首先需要创建一个具有匹配该结构的映射的索引。 在您的情况下,您可以这样做:
curl -XPUT localhost:9200/reports -d '{
"nextPageToken": {
"type": "string"
},
"items": {
"properties": {
"kind": {
"type": "string"
},
"id": {
"properties": {
"time": {
"type": "date",
"format": "date_time"
},
"uniqueQualifier": {
"type": "long"
},
"applicationName": {
"type": "string"
},
"customerId": {
"type": "string"
}
}
},
"actor": {
"properties": {
"callerType": {
"type": "string"
},
"email": {
"type": "string"
},
"profileId": {
"type": "long"
},
"key": {
"type": "string"
}
}
},
"ownerDomain": {
"type": "string"
},
"ipAddress": {
"type": "string"
},
"events": {
"properties": {
"type": {
"type": "string"
},
"name": {
"type": "string"
},
"parameters": {
"properties": {
"name": {
"type": "string"
},
"value": {
"type": "string"
},
"intValue": {
"type": "long"
},
"boolValue": {
"type": "boolean"
}
}
}
}
}
}
}
}'
完成此操作后,您现在可以使用批量调用为遵循上述结构的report reports#activities
文档建立索引。 批量调用的语法在此处进行了精确定义,即您需要一个命令行(要做的事情),在下一行之后是文档源(要进行索引的工作),其中不得包含任何新行!
因此,您需要像这样重新格式化您的documents.json
文件(确保在第二行之后添加新行)。 还要注意,我添加了一些虚拟数据来说明该过程:
{"index": {"_index": "reports", "_type": "activity"}}
{"kind":"reports#activities","nextPageToken":"string","items":[{"kind":"audit#activity","id":{"time":"2016-05-31T00:00:00.000Z","uniqueQualifier":1,"applicationName":"string","customerId":"string"},"actor":{"callerType":"string","email":"string","profileId":1,"key":"string"},"ownerDomain":"string","ipAddress":"string","events":[{"type":"string","name":"string","parameters":[{"name":"string","value":"string","intValue":1,"boolValue":true}]}]}]}
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.