I need to UNION two datasets in a Code Workbook of Palantir Foundry and I'm not sure how to do that. I want to use Pyspark to do this. I'm new to Foundry, please help!
It's the same way you would use PySpark in any other environment!
If you want to union two datasets with the same schema, say datasetA
and datasetB
, you could do something like
dataset_unioned = datasetA.unionByName(datasetB)
Again, this assumes (and you should probably make sure your data structure abides by this) that your two datasets have the same schema. Checkout the PySpark docs here !
The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.