Extracting string between 2 characters from Dataframe column

Question

I have a column with entries like: Hello [World]. I am trying to extract 'World' and make a new column with that, and doing this for every row.

Not sure how to go about this, I am not familar with Regex.

Thanks.

Answer 1

It would look something like this:

import pandas as pd

df = pd.DataFrame([['hello [world]'],['something [else]']], columns=['words']);
df['words'] = df['words'].str.replace('^.*\[|\]$','')

print(df)

The only complicated part there is that regex: replace('.*\[|\]$','') . That says to look for the start of the word ^ up to .* the first instance of [ character OR | from the first instance of ] character that is at the end of the string $ and replace that with nothing ''

If you are going to be doing this kind of thing often, I would highly encourage you to learn regex.

Extracting string between 2 characters from Dataframe column

Question

1 answers

solution1
0 2022-07-11 19:56:18

Extracting string between 2 characters from Dataframe column

Question

1 answers

solution1 0 2022-07-11 19:56:18

solution1
0 2022-07-11 19:56:18