如何使用字符编码在文件中存储随机字节？

Entropy = 6.251272 bits per byte. Optimum compression would reduce the size of this 471812 byte file by 21 percent. Chi square distribution for 471812 samples is 6545600.65, and randomly would exceed this value less than 0.01 percent of the times. Arithmetic mean value of data bytes is 138.9331 (127.5 = random). Monte Carlo value for Pi is 3.173294335 (error 1.01 percent). Serial correlation coefficient is 0.162915 (totally uncorrelated = 0.0).

Entropy = 7.999373 bits per byte. Optimum compression would reduce the size of this 313417 byte file by 0 percent. Chi square distribution for 31347 samples is 272.63, and randomly would exceed this value 25.00 percent of the times. Arithmetic mean value of data bytes is 127.6336 (127.5 = random). Monte Carlo value for Pi is 3.149475458 (error 0.25 percent). Serial correlation coefficient is -0.001209 (totally uncorrelated = 0.0).

2条回答

网友

1楼 · 编辑于 2024-05-29 04:10:42

这里有一个示例（在python3中）：

# check if the characters are matching Unicode
l1 = [chr(i) for i in range(128, 160)]
print("{}\n".format(l1))

s1 = " ".join(l1)

# display these characters for visual comparison
# before writing them to file
print("INITIAL:")
print(s1)

pf = open("somefile", "wb")
pf.write(s1.encode("utf-8"))
pf.close()

po = open("somefile", "rb")
out = po.read()
po.close()

s2 = out.decode('utf-8')

# display these characters for visual comparison    
# after writing them to file and reading them from it
print("AFTER:")
print(s2)

其中我们测试了两个理论：

字符（128到159）能被编码吗
我们能把所有的数据以二进制形式写入一个文件吗？你知道吗

在第一个演示中，我们可以清楚地看到数据在Unicode字符映射中确实匹配。你知道吗

至于第二种理论，我们可以很明显地以原始形式写入和检索二进制数据，正如输出所示：

网友

2楼 · 编辑于 2024-05-29 04:10:42

看起来timakro得到了答案（谢谢）：

“要编写二进制文件，您应该以二进制模式打开它（文件名，“wb”），并向它写入类似字节的对象。例如，要写入值为123的字节：文件.write（字节（[123]））。“-timakro

当我将“bytes([byte value from 0-255])”写入文件时，它会得到ent程序所期望的随机性分数。因此，我将python2的chr()更改为bytes()，以便程序在python3中存储字节。不需要字符编码。你知道吗

相关问题更多 >

编程相关推荐

热门问题

热门文章