How can I read a text file to dataframe in Pandas with columns with different lenght and missing data?

Question

I have a text file like this:

As you can notice some values are missing data and the file contains some fields with a string with spaces.

I need an output like the following:

When missing data just leave the field blank. Also do not put a comma between the words "No" and "Presento". Is there a way to delimit and separate with comma each field according to a certain length? Here each field has a certain length but I don't know how to convert it to a dataframe.

I remember do something like this in bash with the function substr() .

Any idea?

Sorry about my english. Thank you in advance!

Answer 1

This can be done by a classic pandas.read_csv :

df = pd.read_csv(r'path_to_your_textfile.txt', sep='\t', header=None)

# Output:

print(df)

       0      1            2    3     4   5            6    7
0  Test1   90.0  No presento   67  99.0  67     Aprobado   89
1  Test2  100.0           96   76   NaN  76  No aprobado  100
2  Test3    NaN  No presento   89  80.0  99     Aprobado   78
3  Test4   78.0          100  100  83.0  88          NaN   96

If needed, you can save the dataframe to a new text file with a , separator with pandas.DataFrame.to_csv :

df.to_csv(r'path_to_your_new_texfile.txt', header=None, index=False)

How can I read a text file to dataframe in Pandas with columns with different lenght and missing data?

Question

1 answers

solution1
0 2022-09-25 02:47:38

# Output:

How can I read a text file to dataframe in Pandas with columns with different lenght and missing data?

Question

1 answers

solution1 0 2022-09-25 02:47:38

# Output:

solution1
0 2022-09-25 02:47:38