kr-sentence 0.0.3 on PyPI

A light-weight sentence tokenizer for Korean.

Half-width punctuation is generally used in Korean, but this tokenizer also supports full-width punctuation. (For details about full-width punctuation in Korean, please see https://www.w3.org/TR/klreq/).

Installation

pip install kr-sentence

Sample Code:

from kr_sentence.tokenizer import tokenize

paragraph_str = "저는 미국인이에요. 만나서 반갑습니다."

sentence_list = tokenize(paragraph_str)

for sentence in sentence_list:
	print(sentence)

Other languages

JavaScript -> https://github.com/Rairye/js-sentence-tokenizers

kr-sentence
Release 0.0.3

Release 0.0.3

0.0.3

0.0.2

0.0.1

Documentation

Installation

Sample Code:

Other languages

Stats

Development practices

Releases

Contributors

kr-sentence Release 0.0.3

Release 0.0.3 Toggle Dropdown 0.0.3 0.0.2 0.0.1

Documentation

Installation

Sample Code:

Other languages

Stats

Development practices

Releases

Contributors

kr-sentence
Release 0.0.3

Release 0.0.3

0.0.3

0.0.2

0.0.1