the words of meaning toolkit library
Words of Meaning is a project designed with the aim of creating, processing, and analysing corpora of speech and text.
In order to facilitate this, the
wom toolkit library provides tools and interfaces for the aforementioned.
Currently we have one sub-project, corpgen, a tool for downloading youtube video subtitles to our common corpus format.
More documentation is to come regarding our common format and future plans.