一种数据结构,用于基于秩的统计数据的精确在线累积。

tdigest-cffi的Python项目详细描述


T-Digest CFFI

t-digest是一个数据结构 为了准确地在线累积基于排名的统计数据,例如 分位数和修剪平均数,由Ted Dunning设计。

t-摘要构造算法使用一维变量 k-means聚类以生成与 Q-文摘。这种t-摘要数据结构可用于估计分位数 或者计算其他等级统计。T-文摘比 q-digest是t-digest可以处理浮点值,而 q-摘要仅限于整数。只要稍加改动,T-消化就可以 处理任何有序集中具有类似于 卑鄙。由t-摘要产生的分位数估计的准确性可以是 数量级的精度高于 尽管t-digest存储在磁盘上时更紧凑。

这个包提供经过测试、性能良好、线程安全的python 3cffi 与最初为编写的Usman Masood改编的t-digest实现的绑定 redis-tdigest

安装

可以使用pip或包含的^{tt2}安装此包$ 脚本:

# Using pip
pip install tdigest-cffi

# Using setup.py
python setup.py install

用法

fromtdigestimportTDigest,RawTDigest# Thread-safe instance with default compression factordigest=TDigest()# Raw instance with default compression factordigest=RawTDigest()# Thread-safe instance with a custom compression factordigest=TDigest(compression=500)# Digest compressioncompression=digest.compression# Digest weightweight=digest.weight# Centroid countcentroid_count=digest.centroid_count# Compression countcompression_count=digest.compression_count# Insertion with unit weightdigest.insert(1000)# Insertion with custom weightdigest.insert(1000,2)# 99th percentile calculationquantile=digest.quantile(0.99)percentile=digest.percentile(99)# Cumulative distribution functioncdf=digest.cdf(1000)# P(X <= 1000)# Centroid extractionforcentroidindigest.centroids():print(centroid.mean,centroid.weight)# Digest mergingother=TDigest()other.insert(42)digest.merge(other)

许可证

BSD 3-Clause License

Copyright (c) 2018, Phil Demetriou
All rights reserved.

Redistribution and use in source and binary forms, with or without
modification, are permitted provided that the following conditions are met:

* Redistributions of source code must retain the above copyright notice, this
  list of conditions and the following disclaimer.

* Redistributions in binary form must reproduce the above copyright notice,
  this list of conditions and the following disclaimer in the documentation
  and/or other materials provided with the distribution.

* Neither the name of the copyright holder nor the names of its
  contributors may be used to endorse or promote products derived from
  this software without specific prior written permission.

THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS"
AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE
DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR CONTRIBUTORS BE LIABLE
FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL
DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR
SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER
CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY,
OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.

欢迎加入QQ群-->: 979659372 Python中文网_新手群

推荐PyPI第三方库


热门话题
java根据两个数组的值对数组进行排序   具有自签名证书和NTLM代理的java Maven SSL repo错误   java自定义字体按钮不工作AndroidStudio   java通过Spring MVC web应用程序向客户端发送文本文件   Java Spring Web服务SOAP身份验证   ANT property environment=“env”无法在JAVA中检索它,但如果作为ANT命令运行,则可以正常工作   java是为spring mvc rest api或spring boot api对应用服务器的每个新请求创建的服务、存储库和组件的新实例吗?   java私有静态最终字符串未完成其工作   PKCS12的安全Java密钥重新处理   java JPA继承表每类SQLSyntaxErrorException