Python wanish包_程序模块 - PyPI

summly的开源实现

wanish的Python项目详细描述

关于

这个包允许您通过缩小文章的大小来总结文本几句话保留了课文的思想。

除此之外，软件包还从文档中提取以下内容：

文章的规范url
文章标题
描述本文的图像的url
去除文档中过多的信息（页眉、页脚，导航、广告等）并基于 schema.org的结构化数据

DEMO

安装

easy_install wanish
or
pip install wanish

用法

fromwanishimportWanishwanish=Wanish()wanish.perform_url(document_url)# getting doc's source canonical urlurl=wanish.url# getting document's titletitle=wanish.title# getting url of related image if document has itimage_url=wanish.image_url# getting two-letter code of the document's language (en, de, es...)language_code=wanish.language# getting a clean html page of a document with articleclean_html=wanish.clean_html# getting a short summarized description of the article reduced to several sentences (5 by default)description=wanish.description

wanish（）类的可用Kwarg选项（都是可选的）：

wanish=Wanish(url=document_url,positive_keywords=["main","story"],negative_keywords=["banner","adv","similar","top-ad"],summary_sentences_qty=5,headers={'user-agent':'test-purposes/0.0.1'})

url:允许在构造函数中传递文档的url。如果设置了，然后它将自动启动self。初始化。默认为“无”。
正关键字：类中正搜索模式的列表和id，例如：[“main”，“story”]。默认为“无”。
负关键字：类中负搜索模式的列表和id，例如：[“banner”，“adv”，“similar”，“top ad”]。默认为“无”。

{STR 1 } $ SimulyYangSuthEngsEsQuQT:<强/>最大句子数量文件的摘要文本。默认设置为5。
headers:获取请求的其他自定义头的dict 获取文章的网页。默认为“无”。

特别感谢

欢迎加入QQ群-->： 979659372

推荐PyPI第三方库

导航栏
项目描述
版本历史
下载文件
项目链接
首页
标签
许可证: BSD许可证（BSD 3条款）
作者信息:: 暂无
维护者
gorschal
最新PyPI项目
italian_vip_says
UFx
vofs
fake_item_generator
NerEva
django-monologue
fio_product_attribute_strict
climailsystem
pyshape
tbb-devel
npy-append-arra
anthill.tal.macrorenderer
odoo11-addon-stock-a
uuuu
contextil
fyl_nester
appomatic_renderable
teacher
chuletas
slackbot_ce
最新Python常见问题
我是否正确构建了这个递归神经网络
我是否正确理解acquire和realease是如何在python库“线程化”中工作的
我是否正确理解Keras中的批次大小？
我是否正确理解PyTorch的加法和乘法？
我是否正确组织了我的Django应用程序？
我是否正确计算执行时间？如果是这样，那么并行处理将花费更长的时间。这看起来很奇怪
我是否每次创建新项目时都必须在PyCharm中安装numpy？（安装而不是导入）
我是否每次运行jupyter笔记本时都必须重新启动内核？
我是否用python安装了socks模块？
我是否真的需要知道超过一种语言，如果我想要制作网页应用程序？
我是否缺少spaCy柠檬化中的预处理功能？
我是否缺少给定状态下操作的检查？
我是否能够使用函数“count（）”来查找密码中大写字母的数量(（Python）
我是否能够使用用户输入作为colorama模块中的颜色？
我是否能够创建一个能够添加新Django.contrib.auth公司没有登录到管理面板的用户？

wanish 0.6.3

wanish的Python项目详细描述

关于

安装

用法

特别感谢

推荐PyPI第三方库

cekit

LDTk

jadm

jupyter-saagie-plugin

odoo12-addon-account-partner-reconcile

example-cli-python

collective.restapi.pam

djangorestframework-custom-filters

Python-For-Excel

check-tier

fastga

cfutils

inqbus.zopeftp

guide-search

flask-sqlacodegen

导航栏

项目链接

标签

维护者

最新PyPI项目

最新Python常见问题

wanish 0.6.3

wanish的Python项目详细描述

关于

安装

用法

特别感谢

推荐PyPI第三方库

cekit

LDTk

jadm

jupyter-saagie-plugin

odoo12-addon-account-partner-reconcile

example-cli-python

collective.restapi.pam

djangorestframework-custom-filters

Python-For-Excel

check-tier

fastga

cfutils

inqbus.zopeftp

guide-search

flask-sqlacodegen

导 航 栏

项目 链接

标 签

维护者

最新PyPI项目

最新Python常见问题

导航栏

项目链接

标签