简体   繁体   English

Hadoop / Hive:从.csv加载数据到远程计算机上

[英]Hadoop/Hive : Loading data from .csv on a remote machine

I am having a csv file that can from a http url. 我有一个可以从http网址获取的csv文件。 Is there any way I can load it from there :- 有什么办法可以从那里加载它吗:

This is what I am trying 这就是我正在尝试的

LOAD DATA INPATH 'http://192.168.56.101:8081/TeamHalf.csv' OVERWRITE INTO TABLE csvdata;

Hive Load command is a follows : Hive Load命令如下:

LOAD DATA [LOCAL] INPATH 'filepath' [OVERWRITE] INTO TABLE tablename [PARTITION (partcol1=val1, partcol2=val2 ...)]

1) if LOCAL specified - Loads from local FS filepath 1)如果指定LOCAL-从本地FS文件路径加载

2) if no LOCAL - Loads from HDFS filepath only ie,: filepath must refer to files within the same filesystem as the table's (or partition's) location 2)如果没有本地-仅从HDFS文件路径加载,即:文件路径必须引用与表(或分区)位置相同的文件系统中的文件

So Load from remote http:path won't work. 因此,无法从远程http:path加载。 refer HIVE DML . 参考HIVE DML The possible way is (Staging) to load the data from remote http:path to LocalFS or HDFS , then to Hive Warehouse. 可能的方法是(分段)将数据从远程http:path加载到LocalFS或HDFS,然后再加载到Hive Warehouse。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM