Python & Pandas: Pendant for R/dplyr .rds format -


what's closest pendant / alternative .rds format known r python respectively pandas dataframe?

i tried to_pickle() listed in dataframe documentation lacks property smaller corresponding .csv file holding same contents.

df = pd.get_dummies(pd.series(list('abca'))) df.to_csv("tmp.csv")       # 44 bytes df.to_pickle("tmp.pickle") # 682 bytes 

i'm looking fast (save time writing , reading in comparision .csv) , compressed (save disk space in comparision .csv) way (de)serialize pandas dataframes.

thanks.

pandas can export hdf5, faster , more compact csv. has problems mixed types, problems exist csv well.

you have careful comparing filesizes small datasets. overhead compression might make file larger smaller datasets


Comments

Popular posts from this blog

'hasOwnProperty' in javascript -

How to put a lock and transaction on table using spring 4 or above using jdbcTemplate and annotations like @Transactional? -

How to understand 2 main() functions after using uftrace to profile the C++ program? -