Numpy字符串编码

>>> import numpy >>> my_array = numpy.array(['apple', 'pear'], dtype = 'S5') >>> print("Mary has an {} and a {}".format(my_array[0], my_array[1])) Mary has an b'apple' and a b'pear' >>> print("Mary has an {} and a {}".format(my_array[0].decode('utf-8'), ... my_array[1].decode('utf-8'))) Mary has an apple and a pear

2条回答

网友

1楼 · 编辑于 2024-05-15 20:53:31

它与decode没有太大区别，但是astype可以工作（并且可以应用于整个数组，而不是每个字符串）。但只要有需要，较长的阵列将一直存在。

In [538]: x=my_array.astype('U');"Mary has an {} and a {}".format(x[0],x[1])
Out[538]: 'Mary has an apple and a pear'

我在format语法中找不到任何强制使用“b”格式的内容。

https://stackoverflow.com/a/19864787/901925 -演示如何自定义格式化程序类，更改format_field方法。我用convert_field方法做了类似的尝试。但调用语法仍然混乱。

In [562]: def makeU(astr):
    return astr.decode('utf-8')
   .....: 

In [563]: class MyFormatter(string.Formatter):
    def convert_field(self, value, conversion):
        if 'q'== conversion:
            return makeU(value)
        else:
            return super(MyFormatter, self).convert_field(value, conversion)
   .....:         

In [564]: MyFormatter().format("Mary has an {!q} and a {!q}",my_array[0],my_array[1])
Out[564]: 'Mary has an apple and a pear'

其他两种格式化方法：

In [642]: "Mary has an {1} and a {0} or {1}".format(*my_array.astype('U'))
Out[642]: 'Mary has an pear and a apple or pear'

这将转换数组（动态）并将其作为列表传递给format。如果数组已经是unicode，它也可以工作：

In [643]: "Mary has an {1} and a {0} or {1}".format(*uarray.astype('U'))
Out[643]: 'Mary has an pear and a apple or pear'

np.char具有将字符串函数应用于字符数组元素的函数。有了这个decode可以应用到整个数组：

In [644]: "Mary has a {1} and an {0}".format(*np.char.decode(my_array))
Out[644]: 'Mary has a pear and an apple'

（如果数组已经是unicode，则此操作不起作用）。

如果你经常使用字符串数组，np.char是值得研究的。

网友

2楼 · 编辑于 2024-05-15 20:53:31

给出：

>>> my_array = numpy.array(['apple', 'pear'], dtype = 'S5')

你可以在飞行中解码：

>>> print("Mary has an {} and a {}".format(*map(lambda b: b.decode('utf-8'), my_array)))
Mary has an apple and a pear

或者可以创建特定的格式化程序：

import string
class ByteFormatter(string.Formatter):
    def __init__(self, decoder='utf-8'):
        self.decoder=decoder

    def format_field(self, value, spec):
        if isinstance(value, bytes):
            return value.decode(self.decoder)
        return super(ByteFormatter, self).format_field(value, spec)   

>>> print(ByteFormatter().format("Mary has an {} and a {}", *my_array))
Mary has an apple and a pear

相关问题更多 >

编程相关推荐

热门问题

热门文章