Borscht
The Python Module of The Japanese sentence generator by Markov chain.
Description
The Python Module of The Japanese sentence generator by Markov chain.
Do you want to generate text by Markov chain? It's quite easy! Just copy the example below.
Requirement
If you want to run this module, please install these:
- Python 3.x
- MeCab - http://taku910.github.io/mecab/
- mecab-python3 (Python Module)
Recommendation:
We recommend that you use this MeCab dictionary, mecab-ipadic-NEologd
If you want detailed information about how to install MeCab and
mecab-ipadic-NEologd
, please see this post. (written in Japanese): https://qiita.com/taroc/items/b9afd914432da08dafc8
Example Usage
# Import borscht.
import borscht as bor
# Text used for chain generation of Markov chain
text = """
ãčĻĒč˛ããŽįĄéį ˛ã§å°äžãŽæããæã°ããããĻãããå°åĻæ ĄãĢåą
ãæååĻæ ĄãŽäēéããéŖãŗéããĻä¸éąéãģãŠč
°ãæãããäēãããã
ãĒããããĒįĄéãããã¨čãäēēãããããįĨããŦãåĨæŽĩæˇąãįįąã§ããĒãã
æ°į¯ãŽäēéããéĻãåēããĻããããåį´įãŽä¸äēēãåčĢãĢããããå¨åŧĩãŖãĻããããããéŖãŗéããäēã¯åēæĨãžããåŧąčĢããŧããã¨åããããã§ããã
å°äŊŋãĢč˛ ãļããŖãĻ帰ãŖãĻæĨãæããããã大ããĒįŧãããĻäēéãããããéŖãŗéããĻč
°ãæããåĨ´ããããã¨äēãŖããããããŽæŦĄã¯æããããĢéŖãã§čĻããžãã¨įããã
ãčĻĒéĄãŽããŽããčĨŋæ´čŖŊãŽãã¤ãã貰ãŖãĻåĨéēãĒåãæĨãĢįŋŗããĻãåéãĢčĻããĻããããä¸äēēãå
ãäēã¯å
ããåãããããĒãã¨äēãŖãã
åããŦäēãããããäŊã§ãåãŖãĻãŋããã¨åãåãŖãããããĒãåãŽæãåãŖãĻãŋãã¨æŗ¨æãããããäŊã æãããããŽéãã ã¨åŗãŽæãŽčĻĒæãŽį˛ãã¯ããĢåãčžŧãã ã
嚸ãã¤ããå°ãããŽã¨ãčĻĒæãŽéĒ¨ãå
ããŖããŽã§ãäģã ãĢčĻĒæã¯æãĢäģããĻããããããåĩįã¯æģãŦãžã§æļããŦã
ãåēãæąã¸äēåæŠãĢčĄãå°Ŋãã¨ãåä¸ãããĢããããã°ãããŽčåãããŖãĻãįä¸ãĢæ ãŽæ¨ãä¸æŦįĢãŖãĻããã
ããã¯åŊãã大äēãĒæ ã ãåŽãŽįããæåã¯čĩˇãæããĢčæ¸ãåēãĻčŊãĄãåĨ´ãæžãŖãĻããĻãåĻæ Ąã§éŖãã
čåãŽčĨŋå´ãåąąååąã¨ããčŗĒåąãŽåēįļãã§ãããŽčŗĒåąãĢåå¤Ēéã¨ããåä¸åãŽå
ãåą
ãã
åå¤Ēéã¯įĄčĢåŧąčĢã§ãããåŧąčĢãŽįãĢåã¤įŽåŖãäšããããĻãæ ãįãŋãĢãããããæĨãŽå¤æšææ¸ãŽčãĢé ããĻãã¨ãã¨ãåå¤ĒéãæãžããĻããŖãã
ããŽæåå¤Ēéã¯éãčˇ¯ãå¤ąãŖãĻãä¸įæ¸åŊãĢéŖãŗãããŖãĻãããåãã¯äēã¤ã°ããåš´ä¸ã§ããã
åŧąčĢã ãåã¯åŧˇããéĸãŽéããé ããããŖãĄãŽč¸ã¸åŽãĻãĻããããæŧããæåãĢãåå¤ĒéãŽé ãããšãŖãĻããããŽčĸˇãŽčĸãŽä¸ãĢã¯ããŖãã
éĒéãĢãĒãŖãĻæãäŊŋããŦãããįĄæãĢæãæ¯ãŖãããčĸãŽä¸ãĢããåå¤ĒéãŽé ããåŗåˇĻã¸ããããéĄããã
ããžããĢčĻãããŖãĻčĸãŽä¸ããããããŽäēãŽč
ã¸éŖãäģãããįããŖãããåå¤ĒéãåŖæ šã¸æŧãã¤ããĻãããĻãčļŗæĻããããĻåãã¸åããĻããŖãã
åąąååąãŽå°éĸã¯čåããå
å°ēããäŊããåå¤Ēéã¯åã¤įŽåŖãååå´ŠããĻãčĒåãŽé åã¸įéæ§ãĢčŊãĄãĻãããã¨äēãŖãã
åå¤ĒéãčŊãĄãã¨ããĢããããŽčĸˇãŽįčĸããããĻãæĨãĢæãčĒįąãĢãĒãŖããããŽæŠæ¯ãåąąååąãĢčŠĢãŗãĢčĄãŖãã¤ãã§ãĢčĸˇãŽįčĸãåãčŋããĻæĨãã
ãããŽå¤ããããã¯å¤§åããŖãã大åˇĨãŽå
ŧå
Ŧã¨č´åąãŽč§ãã¤ããĻãčäŊãŽäēēåį ãããããäēããããäēēåãŽčŊãåēæããŦåĻã¸čãä¸éĸãĢæˇããĻããŖãããã
ããŽä¸ã§ä¸äēēãåæĨį¸æ˛ãã¨ãã¤ãĨããĢåãŖãããäēēåããŋããĒč¸ãŋã¤ãļãããĻããžãŖããå¤åˇãŽæãŖãĻããį°åãŽäēæ¸ãåããĻå°ģãæãĄčžŧãžããäēãããã
å¤ĒãååŽãŽį¯ãæããĻãæˇąãåããä¸ããæ°´ã暧ãåēãĻããããããŽį¨˛ãĢãŋãããããäģæã§ããŖãã
ããŽæåã¯ãŠããĒäģæãįĨããŦãããįŗãæŖãĄããããã
ããã
ãäēæ¸ãŽä¸ã¸æŋãčžŧãã§ãæ°´ãåēãĒããĒãŖããŽãčĻåąããĻãããĄã¸å¸°ãŖãĻéŖ¯ãéŖãŖãĻãããã
å¤åˇãįčĩ¤ãĢãĒãŖãĻæéŗ´ãčžŧãã§æĨãããããįŊ°éãåēããĻæ¸ãã ããã§ããã
"""
# å¤įŽæŧąįŗ ãåãŖãĄããããã
cg = bor.ChainGenerator(text) # Generate instance of class that generates chains
cg.dump("./chain.json") # Dump json file written about chains' info.
tg = bor.TextGenerator("./chain.json")
# Generate instance of class that generates sentence.
for i in range(10):
sentence = tg.generate() # Generate sentence
print(tg.generate())
Installation
It's quite simple using pip command:
$ pip install borscht
Contribution
- Fork it (https://github.com/yu-san-19/Borscht)
- Create your feature branch (
git checkout -b feature/<your-new-feature>
) - Commit your changes (
git commit -am 'Add some feature'
) - Push to the branch (
git push origin feature/<your-new-feature>
) - Create new Pull Request
License
This repository licensed under the MIT.
Please see the LICENSE file.
Author
YuSan19 and other Contributors.