简体   繁体   中英

Is there a way to remove a sub-attribute from a json column through spark sql

I have a table in which there is column with nested json data. Need to remove a attribute from that json through spark sql

Checked on basic spark json function but not getting a way to do it

Assuming you read in a JSON file and print the schema you are showing us like this:

val df = sqlContext.read().json("/path/to/file").toDF();
    df.registerTempTable("df");
    df.printSchema();

Then you can select nested objects inside a struct type like so...

val app = df.select("app");
        app.registerTempTable("app");
        app.printSchema();
        app.show();

val appName = app.select("element.appName");
        appName.registerTempTable("appName");
        appName.printSchema();
        appName.show();

val trimmedDF = appName.drop("firstname")

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM