pandas-multi

Read multiple csvs or Excel files as a single pandas DataFrame.


Keywords
pandas, readcsv, readcsvs, csv, xls, xlsx, excel, dataframe, multiple, multi
License
MIT
Install
pip install pandas-multi==2019.3.12

Documentation

pandas_multi

Simple loop for reading multiple csv files (matching a certain pattern) as a pandas.DataFrame.

I'm aware this need can be solved in even one line of Python, but loading multiple similar csv's is just something that should be as easy as loading one csv. If you don't want to add a new dependency to your project, google what os.listdir and glob can do for you.

Installation can be done by typing:

pip install pandas_multi

Usage of pandas_multi.read_csvs has been kept as similar as possible to pandas.read_csv:

import pandas_multi

# <sarcasm>
#   Note that dataframes only work if you give them the non-descriptive name df
# </sarcasm>
df = pandas_multi.read_csvs('./20180728*.csv')
# if you provide it with a path to a folder and nothing else, it will assume
# everything in the folder is a comma-separated file
df = pandas_multi.read_csvs('./data/')
# if this is not the case, do this:
df = pandas_multi.read_csvs('./data/*.csv')

All options that are available to pandas.read_csv or pandas.concat can be passed into pandas_multi.read_csvs and will be redirected to the appropriate underlying functions.

If you wish to maintain a trace back to the original data, you can run the function with the keyword argument filenames_as_keys=True. Note that you should no longer use the keyword argument keys. This will be ignored.

Please note that the API for concatenation of Excel sheets is not yet stable.