A library to format datasets so that you don't have to.


License
GPL-3.0-only
Install
pip install skit-fixdf==0.1.12

Documentation

skit-fixdf

Changing database schemas, object formats, and other such pains that come along with a fast changing environment is a familiar problem? We have gone through the same. This library intends to be a mitigation plan (not a solution) for these situations where you download a dataset from our platform and are unable to use it along with other standard offerings.

Installation

Install the library via pip

pip install skit-fixdf
skit-fixdf -h

Usage

❯ skit-fixdf -h
usage: Formatting/Patching dataframes obtained via skit-calls and skit-df.
       [-h] [-v] {calls,labels} ...

positional arguments:
  {calls,labels}  Source of dataframe.
    calls         Untagged Dataframe obtained via skit-calls.
    labels        Tagged Dataframe obtained via skit-df.

optional arguments:
  -h, --help      show this help message and exit
  -v, --verbose   Increase verbosity.
❯ skit-fixdf calls -h
usage: Formatting/Patching dataframes obtained via skit-calls and skit-df. calls
       [-h] [--input INPUT] [--output OUTPUT]

optional arguments:
  -h, --help            show this help message and exit
  --input INPUT, -i INPUT
                        Input csv file.
  --output OUTPUT, -o OUTPUT
                        Output csv file.
❯ skit-fixdf labels -h
usage: Formatting/Patching dataframes obtained via skit-calls and skit-df. labels
       [-h] --input INPUT [--output OUTPUT] [--token TOKEN] [--task-type DATASET_TYPE]

optional arguments:
  -h, --help            show this help message and exit
  --input INPUT, -i INPUT
                        Input csv file.
  --output OUTPUT, -o OUTPUT
                        Output csv file.
  --token TOKEN, -t TOKEN
                        Token make requests to duckling service.
  --task-type DATASET_TYPE, -tt DATASET_TYPE
                        Task type.

Examples

WIP