The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text. It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution. These tasks are usually required to build more advanced text processing services. OpenNLP also included maximum entropy and perceptron based machine learning.


Keywords
NLP, OpenNLP, Tokenizer, Categorizer, Chunker, Parser, POS, NER, IKVM, dotnet, fsharp, recompiled-packages
License
Apache-2.0
Install
Install-Package OpenNLP.NET -Version 1.9.4.1

Documentation

Build Status

OpenNLP for .NET NuGet Badge

OpenNLP for .NET is a port of OpenNLP to .NET.

This project contains build scripts that recompile OpenNLP .jar packages to .NET assemblies using IKVM.NET and tests that help to be sure that recompiled packages are workable. Recompiled assemblies are available on NuGet.

.NET samples are available in tests. Manual is available on official site.

Versioning

Versioning model used for NuGet packages is aligned to versioning used by OpenNLP Team. For example, if you get OpenNLP package from OpenNLP site with version 1.5.3, then the NuGet version of this package has a version 1.5.3.x, where x is the greatest that is available on NuGet. Last number is used for internal versioning of .NET assemblies.

Licensing

Project is licensed under the terms of the Apache 2.0 license