简体繁体中英

AWS Glue PySpark can't count the records

原文 2018-05-05 20:20:10 8 1 amazon-web-services/ apache-spark/ pyspark/ aws-glue

I'm using AWS Glue to extract data from EC2 (Postgre) to be transformed and put it on S3 when I tried to extract 1 table. I got an error looks like this:

Is there anything I can do? I tried to drop null fields or fillna, but none of those works.

UPDATE: I even selected a string-type column but still got the same error:

1 answers

Can you try, df.isnull().any() or df.isnull().sum() . This should help us see the columns with invalid NaN data. Also please try to fetch count of records with df.count(dropna = False) / df.na.drop() . Please refer here , where its explained more in detail on handling null column data.

Hope this helps.

Can you use PySpark instead of Glue PySpark in AWS Glue?

Can AWS Glue process records row wise

Filtering DynamicFrame with AWS Glue or PySpark

Change the delimiter in AWS Glue Pyspark

AWS EMR Spark Glue PySpark -

AWS glue job (Pyspark) to AWS glue data catalog

TypeError: 'JavaPackage' object is not callable on PySpark, AWS Glue

How to connect to Redshift from AWS Glue (PySpark)?

use SQL inside AWS Glue pySpark script

How to debug an aws glue pyspark job

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Can you use PySpark instead of Glue PySpark in AWS Glue? Can AWS Glue process records row wise Filtering DynamicFrame with AWS Glue or PySpark Change the delimiter in AWS Glue Pyspark AWS EMR Spark Glue PySpark - AWS glue job (Pyspark) to AWS glue data catalog TypeError: 'JavaPackage' object is not callable on PySpark, AWS Glue How to connect to Redshift from AWS Glue (PySpark)? use SQL inside AWS Glue pySpark script How to debug an aws glue pyspark job

Related Tags

AWS Glue PySpark can't count the records

Question

1 answers

solution1 0 2018-05-07 10:12:16

solution1
0 2018-05-07 10:12:16