python-mecab

A repository to bind mecab for Python 3.5+. Not using swig nor pybind. (Not Maintained Now)


Keywords
mecab, python-c-extension, text-preprocessing, text-processing, tokenizer
License
BSD-3-Clause
Install
pip install python-mecab==1.0.1

Documentation

python-mecab

Run Test Status codecov Codacy Badge

Py Versions PyPi Versions License

A repository to bind mecab for Python 3.5+. Not using swig nor pybind.

Support only Linux, macOS

Original source codes: taku910/mecab

Installation

pip install python-mecab

Usage

Tagger

with eunjeon/mecab-ko-dic.

>>> from mecab import Tagger
>>> tagger = Tagger() # or Tagger('path/to/dic')
>>> tagger.parse("μ•ˆλ…•ν•˜μ„Έμš”. 이 ν”„λ‘œμ νŠΈλŠ” python-mecabμž…λ‹ˆλ‹€.")
(('μ•ˆλ…•', 'NNG,ν–‰μœ„,T,μ•ˆλ…•,*,*,*,*'), ('ν•˜', 'XSV,*,F,ν•˜,*,*,*,*'), ('μ„Έμš”', 'EP+EF,*,F,μ„Έμš”,Inflect,EP,EF,μ‹œ/EP/*+μ–΄μš”/EF/*'), ('.', 'SF,*,*,*,*,*,*,*'), ('이', 'MM,~λͺ…사,F,이,*,*,*,*'), ('ν”„λ‘œμ νŠΈ', 'NNG,*,F,ν”„λ‘œμ νŠΈ,*,*,*,*'), ('λŠ”', 'JX,*,T,λŠ”,*,*,*,*'), ('python', 'SL,*,*,*,*,*,*,*'), ('-', 'SY,*,*,*,*,*,*,*'), ('mecab', 'SL,*,*,*,*,*,*,*'), ('μž…λ‹ˆλ‹€', 'VCP+EF,*,F,μž…λ‹ˆλ‹€,Inflect,VCP,EF,이/VCP/*+α„‡λ‹ˆλ‹€/EF/*'), ('.', 'SF,*,*,*,*,*,*,*'))
>>> parsed = tagger.parse("μ•ˆλ…•ν•˜μ„Έμš”. 이 ν”„λ‘œμ νŠΈλŠ” python-mecabμž…λ‹ˆλ‹€.")
>>> print(*parsed, sep='\n')
('μ•ˆλ…•', 'NNG,ν–‰μœ„,T,μ•ˆλ…•,*,*,*,*')
('ν•˜', 'XSV,*,F,ν•˜,*,*,*,*')
('μ„Έμš”', 'EP+EF,*,F,μ„Έμš”,Inflect,EP,EF,μ‹œ/EP/*+μ–΄μš”/EF/*')
('.', 'SF,*,*,*,*,*,*,*')
('이', 'MM,~λͺ…사,F,이,*,*,*,*')
('ν”„λ‘œμ νŠΈ', 'NNG,*,F,ν”„λ‘œμ νŠΈ,*,*,*,*')
('λŠ”', 'JX,*,T,λŠ”,*,*,*,*')
('python', 'SL,*,*,*,*,*,*,*')
('-', 'SY,*,*,*,*,*,*,*')
('mecab', 'SL,*,*,*,*,*,*,*')
('μž…λ‹ˆλ‹€', 'VCP+EF,*,F,μž…λ‹ˆλ‹€,Inflect,VCP,EF,이/VCP/*+α„‡λ‹ˆλ‹€/EF/*')
('.', 'SF,*,*,*,*,*,*,*')

binded cli commands

  • mecab
  • mecab-dict-index
  • mecab-dict-gen
  • mecab-test-gen
  • mecab-cost-train
  • mecab-system-eval