lex
An elegant armor-plated JavaScript lexer modelled after flex. Easily extensible to tailor to your...
Latest release 1.7.9 - Updated - 288 stars
php-ai/php-ml
PHP-ML - Machine Learning library for PHP
Latest release 0.8.0 - Updated - 7.57K stars
yooper/php-text-analysis
PHP Text Analysis is a library for performing Information Retrieval (IR) and Natural Language Pro...
Latest release 1.5.5 - Updated - 366 stars
youtokentome
Unsupervised text tokenizer focused on computational efficiency
Latest release 1.0.6 - Updated - 501 stars
pyonmttok
OpenNMT tokenization library
Latest release 1.18.4 - Updated - 91 stars
wink-tokenizer
Multilingual tokenizer that automatically tags each token with its type
Latest release 5.2.1 - Updated - 15 stars
Stanford.NLP.Segmenter
Tokenization of raw text is a standard pre-processing step for many NLP tasks. For English, token...
Latest release 3.9.2 - Updated - 454 stars
php-text-analysis/php-text-analysis
PHP Text Analysis is a library for performing Information Retrieval (IR) and Natural Language Pro...
Latest release 1.5.5 - Updated - 366 stars
udpipe
Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing with the 'UDPipe' 'NL...
Latest release 0.8.3 - Updated - 138 stars
html-tokenizer
Small, fast, event-driven, fault-tolerant html tokenizer. Works in node or browsers.
Latest release 3.0.0 - Updated - 10 stars
lexer
An elegant armor-plated JavaScript lexer modelled after flex. Easily extensible to tailor to your...
Latest release v1.7.8 - Published - 288 stars
salient
Salient is a natural language processing and sentiment analysis library
Latest release 0.3.0 - Updated - 216 stars
pymystem3
Python wrapper for the Yandex MyStem 3.1 morpholocial analyzer of the Russian language.
Latest release 0.2.0 - Updated - 135 stars
Orange3-Text
Orange3 TextMining add-on.
Latest release 0.9.1 - Updated - 75 stars
GrammarEngineApi
An extndend version of the original Russian Grammatical Dictionary and Thesaurus C# API from sola...
Latest release 1.1.61 - Updated - 37 stars
tokenizr
String Tokenization Library for JavaScript
Latest release 1.5.7 - Published - 36 stars
textoken
Textoken is a Ruby library for text tokenization. This gem extracts words from text with many cus...
Latest release 1.2.1 - Updated - 29 stars
rosette-api
Rosette API Node.js client SDK
Latest release 1.14.3 - Updated - 6 stars
nlpcube
Natural Language Processing Toolkit with support for tokenization, sentence splitting, lemmatizat...
Latest release 0.1.0.8 - Updated - 273 stars
az
A NLP library for Russian language
Latest release 0.2.3 - Updated - 220 stars
razdel
Splits russian text into tokens, sentences, section. Rule-based
Latest release 0.5.0 - Updated - 73 stars
germalemma
A lemmatizer for German language text.
Latest release 0.1.3 - Updated - 52 stars
wink-lemmatizer
English lemmatizer
Latest release 3.0.1 - Updated - 30 stars
attacut
Fast and Reasonably Accurate Word Tokenizer for Thai
Latest release 1.1.0.dev0 - Updated - 23 stars
phpmorphy
Original package is located at http://phpmorphy.sourceforge.net/
Latest release 2.3.2 - Updated - 19 stars
@cybersource/flex-sdk-web
Easily create payment tokens using Flex API
Latest release 0.3.1 - Updated - 16 stars
epub-conversion
Python package for converting xml and epubs to text files
Latest release 1.0.12 - Updated - 13 stars
rftokenizer
A character-wise tokenizer for morphologically rich languages
Latest release 1.1.0 - Updated - 11 stars
ciseau
Word and sentence tokenization.
Latest release 1.0.1 - Updated - 9 stars
github.com/Factom-Asset-Tokens/fatd/factom
FAT Golang Reference Implementation & Daemon
Latest release v1.0.0 - Published - 8 stars
License
Language
Keyword
Platform

Subscribe to an RSS feed of this search