Python simil包_程序模块 - PyPI

用于语义字符串相似性的cli

simil的Python项目详细描述

语义字符串相似性cli

simil是^{}字符串相似性引擎的cli接口。它使用en_vectors_web_lg数据集比较字符串的英语语义相似性。给定两个单词、短语或句子，simil将告诉您它们的含义有多相似

安装

首先安装simil本身：

$ pip3 install --user -U simil

现在安装spacy的一个web矢量模型：

$ python3 -m spacy download en_vectors_web_lg

您可以在en_vectors_web_lg、en_core_web_lg和en_core_web_md之间进行选择，（en_core_web_sm根本不包括字向量，并且不能与simil一起使用。）simil将使用您安装的最大模型，首选vectors模型而不是core模型。

我建议使用大向量模型（en_vectors_web_lg），但为了节省磁盘空间或内存使用，您可能需要使用较小的模型

用法：

$ sim first_file.txt second_file.txt # compare two files
$ sim -s "first string""second string"# compare two strings

输出是一个介于0和1之间的数字，表示这两个字符串的相似程度。

详细信息：

simil使用spacy用^{}训练的词向量模型，例如^{}。

这可能是一个大数据集，这会导致启动时间过长。因此simil在后台剥离一个进程来保存模型，并在客户机-服务器模型下使用它。这意味着，如果连续运行simil多次，则只有第一次运行比较慢。

这个后台进程确实占用了相当多的内存，通常大约2GB（对于en_vectors_web_lg模型）。不活动10分钟后，它将自动被终止，以避免无限期占用内存。您可以使用--timeout标志更改此超时的长度。

欢迎加入QQ群-->： 979659372

simil 0.0.2

simil的Python项目详细描述

语义字符串相似性cli

安装

用法：

详细信息：

推荐PyPI第三方库

django-crosswalk-client

odoo8-addons-oca-pos

os-xcode-tools

aiotraversal

dyfunconn

kw-mone

pyatool

cinnamon

kiwi-flight-events-oag-processing

steem_bot_checker

openProductionHW

plat

dab

uptodate

requirements.pip

导航栏

项目链接

标签

维护者

最新PyPI项目

最新Python常见问题

simil 0.0.2

simil的Python项目详细描述

语义字符串相似性cli

安装

用法：

详细信息：

推荐PyPI第三方库

django-crosswalk-client

odoo8-addons-oca-pos

os-xcode-tools

aiotraversal

dyfunconn

kw-mone

pyatool

cinnamon

kiwi-flight-events-oag-processing

steem_bot_checker

openProductionHW

plat

dab

uptodate

requirements.pip

导 航 栏

项目 链接

标 签

维护者

最新PyPI项目

最新Python常见问题

导航栏

项目链接

标签