是提供janomeraiku的接口的mecab的raper.
wakame的Python项目详细描述
wakame
是提供janomeric接口的mecab的拉帕.
使用方法
importMeCabfromwakame.tokenizerimportTokenizerfromwakame.analyzerimportAnalyzerfromwakame.charfilterimport*fromwakame.tokenfilterimport*text='和布ちゃんこんにちは'# 基本的な使い方tokenizer=Tokenizer()tokens=tokenizer.tokenize(text)fortokenintokens:print(token)# 分かち書きtokens=tokenizer.tokenize(text,wakati=True)print(tokens)# 辞書をNEologdにする場合tokenizer=Tokenizer(use_neologd=True)tokens=tokenizer.tokenize(text)fortokenintokens:print(token)# filterを利用する場合char_filters=[RegexReplaceCharFilter('和布','wakame')]token_filters=[POSKeepFilter('名詞'),POSStopFilter(['名詞,接尾'])]analyzer=Analyzer(tokenizer,char_filters=char_filters,token_filters=token_filters)tokens=analyzer.analyze(text)fortokenintokens:print(token)# tokenの情報をDataFrameで用いる場合tokenizer=Tokenizer()analyzer=Analyzer(tokenizer)df=analyzer.analyze_with_dataframe(text)print(df)
安装
MeCab的安装(必须)
brew install mecab brew install mecab-ipadic
mecab-ipadic-NEologd的安装(任意)
brew install git curl xz git clone --depth 1 git@github.com:neologd/mecab-ipadic-neologd.git cd mecab-ipadic-neologd ./bin/install-mecab-ipadic-neologd -n
详细情况是こちらを参照してください
mecab-python3的安装(必须)
brew install swig pip install mecab-python3
wakame的安装
pip install wakame