ldc-doc

Python3 library that adds MS Word .doc support to the llm-dataset-converter library.


License
MIT
Install
pip install ldc-doc==0.0.1

Documentation

ldc-doc

Adds MS Word .doc support to the llm-dataset-converter library.

Requirements

  • antiword available on PATH

    • Debian/Ubuntu: sudo apt install antiword
    • Windows: Softpedia

Installation

pip install git+https://github.com/waikato-llm/llm-dataset-converter.git
pip install git+https://github.com/waikato-llm/ldc-doc.git

Plugins

See here for an overview of all plugins.