Assigning index column to empty pandas dataframe

Question

I am creating an empty dataframe that i then want to add data to one row at a time. I want to index on the first column, 'customer_ID'

I have this:

In[1]: df = pd.DataFrame(columns = ['customer_ID','a','b','c'],index=['customer_ID'])
In[2]: df
Out[3]: 
            customer_ID    a    b    c
customer_ID         NaN  NaN  NaN  NaN

So there is already a row of NaN that I don't want. Can I point the index to the first column without adding a row of data?

doctorer · Accepted Answer

The answer, I think, as hinted at by @JD Long is to set the index in a seprate instruction:

In[1]: df = pd.DataFrame(columns = ['customer_ID','a','b','c'])
In[2]: df.set_index('customer_ID',inplace = True)
In[3]: df
Out[3]: 
Empty DataFrame
Columns: [customer_ID, a, b, c]
Index: []

I can then add rows:

In[4]: id='x123'
In[5]: df.loc[id]=[id,4,5,6]
In[6]: df
Out[7]: 
 customer_ID    a    b    c
x123        x123  4.0  5.0  6.0

JD Long · Answer

yes... and you can dropna at any time if you are so inclined:

df = df.set_index('customer_ID').dropna()
df

Assigning index column to empty pandas dataframe

Tags:

pandas

dataframe

doctorer

2 Answers

doctorer

JD Long

Recent Activity

Donate For Us

Assigning index column to empty pandas dataframe

Tags:

pandas

dataframe

doctorer

2 Answers

doctorer

JD Long

Related questions

Recent Activity

Donate For Us