Need to create DOI in Zenodo and submit to pyOpenSci
catnet is a Python package that allows for transforming tabular data into a network structure. catnet can identify the coexistence of variables and categories in literature reviews and other tables and create a network dataframe that can be exported into a format that can be taken by other packages such as networkx
and applications such as Gephi.
To install this package run:
python -m pip install git+https://github.com/CamiBetancur/catnet/)
To be able to use catnet
you need to format your dataframe in one of the following ways:
"Long" format refers to data that has a column for describing a categorical variable (var_col
) and an identifier column (id_col
) that identifies to which entity that variable belongs to. For example, in a literature review, a long dataframe that could be used by catnet could look like this (note that the column names id_col
and var_col
do not necessarily need to be named id_col
and var_col
):
id_col | var_col | other_data_cols |
---|---|---|
doc_01 | Health | ... |
doc_01 | Water access | ... |
doc_01 | Water quality | ... |
doc_02 | Health | ... |
doc_02 | Energy generation | ... |
... | ... | ... |
For "long" dataframes, you can use the function [...] complement later.
Add information here about contributing to your package. Be sure to add links to your
CODE_OF_CONDUCT.md
file and your development guide. For now this section might be
empty. You can go back and fill it in later.
Betancur Jaramillo, J. C. (2024). catnet source code (Version 0.1.0) [source code]. https://github.com/CamiBetancur/catnet/.
@misc{Betancur_2024,
title={catnet v0.1.0},
url={https://github.com/CamiBetancur/catnet},
publisher={Stockholm Environment Institute},
author={Betancur Jaramillo, Juan Camilo},
year={2024}}