Using replace and str.startswith() in a pandas dataframe to rename values

Question

I have a column called source which contains couple hundred rows of text. The thing is that some of these can be grouped together and I'm struggling to do that in the Pandas dataframe. Here's my code:

df.source.replace({
                   df.source.str.startswith('share', na=False): 'sharePet',
                   df.source.str.startswith('2012-01-08', na=False): 'shareDate'

                 })

Additionally, Will this work for the second line which starts with dates? if not I can keep it for the first line and other groupings that are text.

Would love some advice.

jpp · Accepted Answer

You can use a dictionary and iterate:

d = {'share': 'sharePet', '2012-01-08': 'shareDate'}

for k, v in d.items():
    df.loc[df['source'].str.startswith(k, na=False), 'source'] = v

Pandas str.startswith works only for strings. You can check easily what types exist in your series via set(map(type, df['source'])).

Using replace and str.startswith() in a pandas dataframe to rename values

Tags:

python

string

pandas

series

user8322222

1 Answers

jpp

Recent Activity

Donate For Us

Using replace and str.startswith() in a pandas dataframe to rename values

Tags:

python

string

pandas

series

user8322222

1 Answers

jpp

Related questions

Recent Activity

Donate For Us