Python tokenizers-collection包_程序模块 - PyPI

一个使用一组中文标记器的简单迭代器

tokenizers-collection的Python项目详细描述

中文分词器集合

https://img.shields.io/pypi/v/chinese_tokenzier_iterator.svg

https://img.shields.io/travis/howl-anderson/chinese_tokenzier_iterator.svg

一些中文分词器的简单封装和集合

Free software: MIT license
Documentation: https://chinese-tokenzier-iterator.readthedocs.io.

Features

TODO

使用

fromtokenizers_collection.configimporttokenizer_registryforname,tokenizerintokenizer_registry:print("Tokenizer: {}".format(name))tokenizer('input_file.txt','output_file.txt')

安装

pip install tokenizers_collection

更新许可文件与下载模型

因为其中有些模型需要更新许可文件（比如：pynlpir）或者需要下载模型文件（比如：pyltp），因此安装后需要执行特定的命令完成操作，这里已经将所有的操作封装成了一个函数，只需要执行类似如下的指令即可

python -m tokenizers_collection.helper

Credits

This package was created with Cookiecutter and the audreyr/cookiecutter-pypackage project template.

History

0.1.0 (2018-08-28)

First release on PyPI.

欢迎加入QQ群-->： 979659372

tokenizers-collection 0.1.2

tokenizers-collection的Python项目详细描述

中文分词器集合

Features

使用

安装

更新许可文件与下载模型

Credits

History

0.1.0 (2018-08-28)

推荐PyPI第三方库

catastrop

typedtensor

recordb

zhihu_oauth

vortexai

kylileo

mosaicode-lib-c-opencv

c7n-kube

emailreplyparser

ccfs

doufo

twitter.common.rpc

printqiantao

FastSync

cvopt

导航栏

项目链接

标签

维护者

最新PyPI项目

最新Python常见问题

tokenizers-collection 0.1.2

tokenizers-collection的Python项目详细描述

中文分词器集合

Features

使用

安装

更新许可文件与下载模型

Credits

History

0.1.0 (2018-08-28)

推荐PyPI第三方库

catastrop

typedtensor

recordb

zhihu_oauth

vortexai

kylileo

mosaicode-lib-c-opencv

c7n-kube

emailreplyparser

ccfs

doufo

twitter.common.rpc

printqiantao

FastSync

cvopt

导 航 栏

项目 链接

标 签

维护者

最新PyPI项目

最新Python常见问题

导航栏

项目链接

标签