Python Pandas Dataframe interpolation with strings

Question

I was wondering if Pandas Dataframe allows for interpolation for strings as well. (I have values working but not for strings).

 import pandas as pd import numpy as np s = pd.Series(["Blue", "Blue", np.nan, "Blue","Blue","Red"]) s = s.interpolate() print(s)

Output: Blue, Blue, NaN, Blue, Blue, Red

Desired Output: Blue, Blue, Blue, Blue, Blue, Red

Answer 1

只需使用填充。

s = s.ffill()

Answer 2

no, you can't interpolate strings, but, it is possible to convert the strings to categories and then interpolate on that.

arr, cat = s.factorize()
s2 = pd.Series(arr).replace(-1, np.nan).interpolate()\
         .astype('category').cat.rename_categories(cat)\
         .astype('str')

Answer 3

In your case s.interpolate(method='pad') or s.ffill() will do just fine but you may compare and observe outputs of different techniques below:

import pandas as pd

s = pd.Series([None, None, 'red', 'red', None, 'blue', None, None])

print(s.to_list())
print(s.bfill().tolist())
print(s.ffill().tolist())
print(s.bfill().ffill().tolist())
print(s.ffill().bfill().tolist())
print(s.interpolate(method='pad').tolist())

Output:

[None, None, 'red', 'red', None, 'blue', None, None]
['red', 'red', 'red', 'red', 'blue', 'blue', None, None]
[None, None, 'red', 'red', 'red', 'blue', 'blue', 'blue']
['red', 'red', 'red', 'red', 'blue', 'blue', 'blue', 'blue']
['red', 'red', 'red', 'red', 'red', 'blue', 'blue', 'blue']
[None, None, 'red', 'red', 'red', 'blue', 'blue', 'blue']

Answer 4

I believe that the following will also work for strings:

s = s.interpolate(method='pad')

See the documentation at https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.Series.interpolate.html .

Python Pandas Dataframe interpolation with strings

Question

4 answers

solution1
5 ACCPTED 2019-10-08 07:55:25

solution2
1 2017-02-14 06:48:01

solution3
1 2019-11-15 15:55:46

solution4
0 2019-04-18 22:45:14

Python Pandas Dataframe interpolation with strings

Question

4 answers

solution1 5 ACCPTED 2019-10-08 07:55:25

solution2 1 2017-02-14 06:48:01

solution3 1 2019-11-15 15:55:46

solution4 0 2019-04-18 22:45:14

solution1
5 ACCPTED 2019-10-08 07:55:25

solution2
1 2017-02-14 06:48:01

solution3
1 2019-11-15 15:55:46

solution4
0 2019-04-18 22:45:14