mecab-ko


Licenses
GPL-2.0-only/LGPL-2.1-only/BSD-3-Clause
Install
conda install -c conda-forge mecab-ko

Documentation

mecab-ko ์†Œ๊ฐœ

mecab-ko๋Š” ์€์ „ํ•œ๋‹ข ํ”„๋กœ์ ํŠธ์—์„œ ์‚ฌ์šฉํ•˜๊ธฐ ์œ„ํ•œ MeCab์˜ fork ํ”„๋กœ์ ํŠธ ์ž…๋‹ˆ๋‹ค.

์ตœ์†Œํ•œ์˜ ๋ณ€๊ฒฝ์œผ๋กœ ํ•œ๊ตญ์–ด์˜ ํŠน์„ฑ์— ๋งž๋Š” ๊ธฐ๋Šฅ์„ ์ถ”๊ฐ€ํ•˜๋Š” ๊ฒƒ์ด ๋ชฉํ‘œ์ž…๋‹ˆ๋‹ค.

mecab-ko์—์„œ ์ถ”๊ฐ€๋œ ๊ธฐ๋Šฅ.

๊ณต๋ฐฑ ๋ฌธ์ž(white space)๋ฅผ ํฌํ•จํ•˜๋Š” ํŠน์ • ํ’ˆ์‚ฌ ๋น„์šฉ ๋Š˜๋ฆผ

๋„์–ด์“ฐ๊ธฐ๋ฅผ ํ•˜์ง€ ์•Š๋Š” ์ผ๋ณธ์–ด์™€ ๋‹ฌ๋ฆฌ ๋„์–ด์“ฐ๊ธฐ๋ฅผ ํ•˜๋Š” ํ•œ๊ตญ์–ด ํŠน์„ฑ์— ๋งž๊ฒŒ ํŠน์ • ํ’ˆ์‚ฌ๊ฐ€ ๋„์–ด์“ฐ๊ธฐ ๋˜์–ด์žˆ๋Š” ๊ฒฝ์šฐ ํ•ด๋‹น ํ’ˆ์‚ฌ์˜ ๋น„์šฉ์„ ๋Š˜๋ฆฌ๋Š” ๊ธฐ๋Šฅ (์‚ฌ์ „ ์„ค์ •(dicrc)์— ์„ค์ • ๊ฐ’์„ ์ง€์ •)

mecab์„ ์‚ฌ์šฉํ•˜์—ฌ ๋ถ„์„

:::text
ํ™”ํ•™ ์ด์™ธ์˜ ๊ฒƒ
ํ™”ํ•™    NN,T,ํ™”ํ•™,*,*,*,*
์ด      JKS,F,์ด,*,*,*,*
์™ธ      NN,F,์™ธ,*,*,*,*
์˜      JKG,F,์˜,*,*,*,*
๊ฒƒ      NNB,T,๊ฒƒ,*,*,*,*
EOS

mecab-ko๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ๋ถ„์„

:::text
ํ™”ํ•™ ์ด์™ธ์˜ ๊ฒƒ
ํ™”ํ•™    NN,T,ํ™”ํ•™,*,*,*,*
์ด์™ธ    NN,F,์ด์™ธ,*,*,*,*
์˜      JKG,F,์˜,*,*,*,*
๊ฒƒ      NNB,T,๊ฒƒ,*,*,*,*
EOS

์„ค์ • ๋ฐฉ๋ฒ•

MeCab์˜ ์‚ฌ์ „ ์„ค์ •(dicrc)์—์„œ ๋‹ค์Œ๊ณผ ๊ฐ™์ด ์„ค์ •ํ•ฉ๋‹ˆ๋‹ค.

:::text
# ์ขŒ์ธก์— ๊ณต๋ฐฑ์„ ํฌํ•จํ•˜๋Š” ํ’ˆ์‚ฌ์˜ ์—ฐ์ ‘ ๋น„์šฉ์„ ๋Š˜๋ฆฌ๊ธฐ ์œ„ํ•œ ์„ค์ •์ž…๋‹ˆ๋‹ค.
# mecab-ko์—์„œ๋งŒ ์‚ฌ์šฉ๋˜๋Š” ์„ค์ •์ž…๋‹ˆ๋‹ค. ๋‹ค์Œ๊ณผ ๊ฐ™์€ ํ˜•์‹์„ ๊ฐ€์ง‘๋‹ˆ๋‹ค.
# <posid 1>,<posid 1 penalty cost>,<posid 2>,<posid 2 penalty cost> ...
# 
# ์˜ˆ) 120,6000 => posid๊ฐ€ 120์ธ ํ’ˆ์‚ฌ(์กฐ์‚ฌ)์˜ ์ขŒ์ธก์— ๊ณต๋ฐฑ์„ ํฌํ•จํ•  ๊ฒฝ์šฐ
# ์—ฐ์ ‘ ๋น„์šฉ์„ 6000๋งŒํผ ๋Š˜๋ฆผ
left-space-penalty-factor = 120,6000,184,6000,100,500

mecab-ko์˜ ์„ค์น˜์™€ ์‚ฌ์šฉ๋ฒ•

mecab-ko ์„ค์น˜

mecab-ko ๋‹ค์šด๋กœ๋“œ ํŽ˜์ด์ง€์—์„œ ์ตœ์‹  ๋ฒ„์ „์˜ ์†Œ์Šค๋ฅผ ๋‹ค์šด ๋ฐ›๊ณ  ์„ค์น˜ํ•ฉ๋‹ˆ๋‹ค. tar.gz ์••์ถ•์„ ํ•ด์ œํ•˜๊ณ  ์ผ๋ฐ˜์ ์ธ ์ž์œ  ์†Œํ”„ํŠธ์›จ์–ด์™€ ๊ฐ™์€ ์ˆœ์„œ๋กœ ์„ค์น˜ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

:::text
$ tar zxfv mecab-ko-XX.tar.gz
$ cd mecab-ko-XX
$ ./configure 
$ make
$ make check
$ su
# make install

์„ค์น˜ ๋ฐฉ๋ฒ•์€ MeCab์™€ ๋™์ผํ•˜๋ฏ€๋กœ, ์ž์„ธํ•œ ๋‚ด์šฉ์€ MeCab ํ™ˆํŽ˜์ด์ง€๋ฅผ ์ฐธ์กฐํ•˜์‹œ๊ธฐ ๋ฐ”๋ž๋‹ˆ๋‹ค.

์ฐธ๊ณ 

ํ•œ๊ตญ์–ด ์‚ฌ์ „(mecab-ko-dic)์˜ ์„ค์น˜์™€ ์‚ฌ์šฉ

mecab-ko-dic์˜ ์„ค๋ช…์„ ์ฐธ์กฐํ•˜์‹œ๊ธฐ ๋ฐ”๋ž๋‹ˆ๋‹ค.

๋ผ์ด์„ผ์Šค

mecab-ko์˜ ๋ผ์ด์„ผ์Šค๋Š” MeCab์˜ ๋ผ์ด์„ผ์Šค๋ฅผ ๊ทธ๋Œ€๋กœ ๋”ฐ๋ฆ…๋‹ˆ๋‹ค.

MeCab ๋Š” ๋ฌด๋ฃŒ ์†Œํ”„ํŠธ์›จ์–ด์ž…๋‹ˆ๋‹ค. GPL (the GNU General Public License), LGPL (Lesser GNU General Public License) ๋˜๋Š” BSD ๋ผ์ด์„ ์Šค์— ๋”ฐ๋ผ ์†Œํ”„ํŠธ์›จ์–ด๋ฅผ ์‚ฌ์šฉ, ์žฌ๋ฐฐํฌํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ์ž์„ธํ•œ ๋‚ด์šฉ์€ COPYING, GPL, LGPL, BSD ๊ฐ ํŒŒ์ผ์„ ์ฐธ์กฐํ•˜์‹ญ์‹œ์˜ค.