YaLafi: Yet another LaTeX filter

Important: Maintainer change. This program was developed by matze-dd until version 1.3.1 and is now maintained by torik42.

If you have a local copy of this repository, GitHub will automatically redirect your git fetch and git pull commands and all links to the GitHub repository. Updating the URL is nevertheless recommended. If you run git remote -v in your local repository and see something like

name-of-remote  git@github.com:matze-dd/YaLafi.git (fetch)
name-of-remote  git@github.com:matze-dd/YaLafi.git (push)

where name-of-remote is the name of the remote, you should update the URL with

git remote set-url name-of-remote git@github.com:torik42/YaLafi.git

if you use ssh, or

git remote set-url name-of-remote https://github.com/torik42/YaLafi.git

if you use https. You may also update existing links to https://github.com/matze-dd/YaLafi.

Notice. The library of LaTeX macros, environments, document classes, and packages is still rather restricted, compare the list of macros. Please don’t hesitate to raise an Issue, if you would like to have added something.

If you want to add something yourself, have a look at Inclusion of own macros and CONTRIBUTING.md.

Summary. This Python package extracts plain text from LaTeX documents. The software may be integrated with a proofreading tool and an editor. It provides

mapping of character positions between LaTeX and plain text,
simple inclusion of own LaTeX macros and environments with tailored treatment,
careful conservation of text flows,
some parsing of displayed equations for detection of included “normal” text and of interpunction problems,
support of multi-language documents (experimental).

The sample Python application yalafi.shell from section Example application integrates the LaTeX filter with the proofreading software LanguageTool. It sends the extracted plain text to the proofreader, maps position information in returned messages back to the LaTeX text, and generates results in different formats. You may easily

create a proofreading report in text or HTML format for a complete document tree,
check LaTeX texts in the editors Vim, Emacs and Atom via several plugins,
run the script as emulation of a LanguageTool server with integrated LaTeX filtering.

For instance, the LaTeX input

Only few people\footnote{We use
\textcolor{red}{redx colour.}}
is lazy.

will lead to the text report

1.) Line 2, column 17, Rule ID: MORFOLOGIK_RULE_EN_GB
Message: Possible spelling mistake found
Suggestion: red; Rex; reds; redo; Red; Rede; redox; red x
Only few people is lazy.    We use redx colour. 
                                   ^^^^
2.) Line 3, column 1, Rule ID: PEOPLE_VBZ[1]
Message: If 'people' is plural here, don't use the third-person singular verb.
Suggestion: am; are; aren
Only few people is lazy.    We use redx colour. 
                ^^

This is the corresponding HTML report (for an example with a Vim plugin, see here):

The tool builds on results from project Tex2txt, but differs in the internal processing method. Instead of using recursive regular expressions, a simple tokeniser and a small machinery for macro expansion are implemented; see sections Differences to Tex2txt and Remarks on implementation.

Beside the interface from section Python package interface, application Python scripts like yalafi/shell/shell.py from section Example application can access an interface emulating tex2txt.py from repository Tex2txt by from yalafi import tex2txt. The pure LaTeX filter can be directly used in scripts via a command-line interface, it is described in section Command-line of pure filter.

If you use this software and encounter a bug or have other suggestions for improvement, please leave a note under category Issues, or initiate a pull request. Many thanks in advance.

Happy TeXing!

Installation
Authors and Maintainers
Example application
Interfaces to Vim
Interface to Emacs
Interface to Atom
Usage under Windows
Related projects

Filter actions
Fundamental limitations
Adaptation of LaTeX and plain text
Extension modules for LaTeX packages
Inclusion of own macros

Multi-file projects
Handling of displayed equations
Multi-language documents
Python package interface
Command-line of pure filter
Differences to Tex2txt
Remarks on implementation

Installation

YaLafi (at least with Python version 3.6). Choose one of the following possibilities.

Use python -m pip install [--user] yalafi. This installs the last version uploaded to PyPI. Module pip itself can be installed with python -m ensurepip.
Say python -m pip install [--user] git+https://github.com/torik42/YaLafi.git@master. This installs the current snapshot from here.
Download the archive from here and unpack it. Place yalafi/ in the working directory, or in a standard directory like /usr/lib/python3.8/ or ~/.local/lib/python3.8/site-packages/. You can also locate it somewhere else and set environment variable PYTHONPATH accordingly.
For developing YaLafi, editable installs are recommended. See Contributing.md for details.

LanguageTool. On most systems, you have to install the software “manually” (1). At least under Arch Linux, you can also use a package manager (2). Please note that, for example under Ubuntu, sudo snap install languagetool will not install the components required here.

The LanguageTool zip archive, for example LanguageTool-5.0.zip, can be obtained from the LanguageTool download page. Option --lt-directory of application yalafi.shell from section Example application has to point to the directory created after uncompressing the archive at a suitable place. For instance, the directory has to contain file languagetool-server.jar.
Under Arch Linux, you can simply say sudo pacman -S languagetool. In this case, it is not necessary to set option --lt-directory from variant 1. Instead, you have to specify --lt-command languagetool.

yalafi Release 1.1.1

Release 1.1.1 Toggle Dropdown 1.5.0 1.4.0 1.3.1 1.3.0 1.2.0 1.1.7 1.1.6 1.1.5 1.1.4 1.1.3

Documentation

YaLafi: Yet another LaTeX filter

Contents

Installation

Authors and Maintainers

Example application

Interfaces to Vim

Plugin vimtex

“Plain Vim”

Plugin vim-grammarous

Plugin vim-LanguageTool

Plugin ALE

Interface to Emacs

Interface to Atom

Usage under Windows

Related projects

Filter actions

Fundamental limitations

Adaptation of LaTeX and plain text

Modification of LaTeX text

Phrase replacement in the plain text

Extension modules for LaTeX packages

Inclusion of own macros

Definition of macros

Definition of environments

Definition of equation environments

Macro handler functions

Multi-file projects

Handling of displayed equations

Trivial version

Simple version

Full version

Inclusion of “normal” text

Equation replacements in English documents

Multi-language documents

Python package interface

Command-line of pure filter

Differences to Tex2txt

Remarks on implementation

Scanner / tokeniser

Parser

Parser for maths material

Removal of unnecessary blank lines

Stats

Development practices

Releases

Contributors

yalafi
Release 1.1.1

Release 1.1.1

1.5.0

1.4.0

1.3.1

1.3.0

1.2.0

1.1.7

1.1.6

1.1.5

1.1.4

1.1.3