How can I create a new column in a pandas data frame by extracting words from sentences in another column?

Question

I have a pandas dataframe like this.

import pandas as pd
student_id = ['001', '002', '003', '004']
names = ['Jane', 'Mary', 'Andrew', 
'Paul']
address = ['7 karumu st Ikeja Lagos', '8 
logo street Umuahia Abia', 
       '10 jege close PH Rivers', '9 
Lekki gate Lagos']

test_1 = {'Student_ID': student_id, 
      'Name': names, 
      'Address': address}
df = pd.DataFrame(test_1)
df`

Output

and a list like this:

List = [Imo, Lagos, Abia, Ebonyi, Rivers]

So i am trying to iterate through the Address column and estract the states in the address which is also in the list. If a state in the list is spotted I would like to extract it and append to a new column called state.

I tried to use the iterrows() method but I am a bit lost

Answer 1

You can filter like this:

df = df[df['Address'].str.contains('|'.join(List))]

Answer 2

get the 'Adress' Column
convert to 'List' to DataFrame
After I think 'MERGE' you should use
Storage to last dafaFrame and add the as a another column

I think this will solve your problem

Answer 3

Assuming that the state is always the last word in the address.

import numpy as np

states = ["Imo", "Lagos", "Abia", "Ebonyi", "Rivers"]
df["State"] = df["Address"].map(lambda x: state if (state:=x.split()[-1]) in states else np.nan)

How can I create a new column in a pandas data frame by extracting words from sentences in another column?

Question

3 answers

solution1
1 2022-12-12 13:16:08

solution2
0 2022-12-12 13:20:15

solution3
0 ACCPTED 2022-12-12 13:32:15

How can I create a new column in a pandas data frame by extracting words from sentences in another column?

Question

3 answers

solution1 1 2022-12-12 13:16:08

solution2 0 2022-12-12 13:20:15

solution3 0 ACCPTED 2022-12-12 13:32:15

solution1
1 2022-12-12 13:16:08

solution2
0 2022-12-12 13:20:15

solution3
0 ACCPTED 2022-12-12 13:32:15