PDF4Cat Simple and Power tool for processing pdf docs using PyMuPDF
- CLI
- Async work & optimizations
- Merge
- Split
- Rotate
- Edit Pages
- Delete Pages and save to pdf(from pdf)
- Extract Pages and save to pdf(from pdf)
- Protect (Encrypt)
- Unlock (Decrypt)
- Compress (Flate)
- OCR pdf
- Pdf to Images
- Images to pdf
- DOCX
- POWER POINT
- OPEN OFFICE DOCS
Windows: C:\Program Files\Tesseract-OCR\tessdata
Unix systems: /usr/share/tesseract-ocr/4.00/tessdata
Windows: set TESSDATA_PREFIX=C:\Program Files\Tesseract-OCR\tessdata
Unix systems: export TESSDATA_PREFIX=/usr/share/tesseract-ocr/4.00/tessdata