Python处理大数字

def get_perplexity(test_set, model): perplexity = 1 n = 0 for word in test_set: n += 1 perplexity = perplexity * 1 / get_prob(model, word) perplexity = pow(perplexity, 1/float(n)) return perplexity

2条回答

网友

1楼 · 编辑于 2024-04-26 07:37:22

重复的乘法将导致一些棘手的数值不稳定性，因为乘法的结果需要越来越多的位来表示。我建议您将其转换为日志空间，并使用求和而不是乘法：

import math

def get_perplexity(test_set, model):
    log_perplexity = 0
    n = 0
    for word in test_set:
        n += 1
        log_perplexity -= math.log(get_prob(model, word))
    log_perplexity /= float(n)
    return math.exp(log_perplexity)

这样，所有的对数都可以用标准位数来表示，而且不会出现数值放大和精度损失。此外，还可以使用decimal模块引入任意精度：

import decimal

def get_perplexity(test_set, model):
    with decimal.localcontext() as ctx:
        ctx.prec = 100  # set as appropriate
        log_perplexity = decimal.Decimal(0)
        n = 0
        for word in test_set:
            n += 1
            log_perplexity -= decimal.Decimal(get_prob(model, word))).ln()
        log_perplexity /= float(n)
        return log_perplexity.exp()

网友

2楼 · 编辑于 2024-04-26 07:37:22

因为e+306只是10^306，你可以把这个类分成两部分

class BigPowerFloat:
    POWER_STEP = 10**100
    def __init__(self, input_value):
        self.value = float(input_value)
        self.power = 0

    def _move_to_power(self):
        while self.value > self.POWER_STEP:
            self.value = self.value / self.POWER_STEP
            self.power += self.POWER_STEP
        # you can add similar for negative values           


    def __mul__(self, other):
        self.value *= other
        self._move_to_power()

    # TODO other __calls for /, +, - ...

    def __str__(self):
        pass
        # make your cust to str

相关问题更多 >

编程相关推荐

热门问题

热门文章

Python处理大数字

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >