Python高效混淆字符串

13 投票

4 回答

19842 浏览

数据工程师

提问于 2025-04-17 02:42

我想把一些Unicode文本搞得复杂一点，这样那些想要提取这些文本的人就会慢一些。理想情况下，我希望能用Python自带的模块或者一个小的附加库来实现；处理后的字符串长度要和原来的相同或者更短；而且“解密”的速度要尽可能快。

我试过各种字符交换和异或运算，但这些方法都很慢。用Base64和十六进制编码会让文本变得更大。目前我找到的最有效的方法是用zlib在最低压缩设置（1）下进行压缩。有没有更好的办法呢？

字符编码数据压缩加密算法 zlib 字符串混淆 unicode文本

4 个回答

使用带有十六进制编码的codecs，比如：

>>> codecs.encode(b'test/jimmy', 'hex')
b'746573742f6a696d6d79'
>>> codecs.decode(b'746573742f6a696d6d79', 'hex')
b'test/jimmy'

回答于 2025-04-17 由 Python大师

分享举报

这个方法使用了一种简单又快速的加密方式，适用于bytes对象。

# For Python 3 - strings are Unicode, print is a function

def obfuscate(byt):
    # Use same function in both directions.  Input and output are bytes
    # objects.
    mask = b'keyword'
    lmask = len(mask)
    return bytes(c ^ mask[i % lmask] for i, c in enumerate(byt))

def test(s):
    data = obfuscate(s.encode())
    print(len(s), len(data), data)
    newdata = obfuscate(data).decode()
    print(newdata == s)


simple_string = 'Just plain ASCII'
unicode_string = ('sensei = \N{HIRAGANA LETTER SE}\N{HIRAGANA LETTER N}'
                  '\N{HIRAGANA LETTER SE}\N{HIRAGANA LETTER I}')

test(simple_string)
test(unicode_string)

这是Python 2的版本：

# For Python 2

mask = 'keyword'
nmask = [ord(c) for c in mask]
lmask = len(mask)

def obfuscate(s):
    # Use same function in both directions.  Input and output are
    # Python 2 strings, ASCII only.
    return ''.join([chr(ord(c) ^ nmask[i % lmask])
                    for i, c in enumerate(s)])

def test(s):
    data = obfuscate(s.encode('utf-8'))
    print len(s), len(data), repr(data)
    newdata = obfuscate(data).decode('utf-8')
    print newdata == s


simple_string = u'Just plain ASCII'
unicode_string = (u'sensei = \N{HIRAGANA LETTER SE}\N{HIRAGANA LETTER N}'
                  '\N{HIRAGANA LETTER SE}\N{HIRAGANA LETTER I}')

test(simple_string)
test(unicode_string)

回答于 2025-04-17 由 Python大师

分享举报

老掉牙的ROT13技巧怎么样呢？

在Python 3中：

>>> import codecs
>>> x = 'some string'
>>> y = codecs.encode(x, 'rot13')
>>> y
'fbzr fgevat'
>>> codecs.decode(y, 'rot13')
u'some string'

在Python 2中：

>>> x = 'some string'
>>> y = x.encode('rot13')
>>> y
'fbzr fgevat'
>>> y.decode('rot13')
u'some string'

对于一个unicode字符串：

>>> x = u'國碼'
>>> print x
國碼
>>> y = x.encode('unicode-escape').encode('rot13')
>>> print y
\h570o\h78op
>>> print y.decode('rot13').decode('unicode-escape')
國碼

回答于 2025-04-17 由 Python大师

分享举报

Python高效混淆字符串

4 个回答

撰写回答