将PIL图像传递到google cloud vision，无需保存和读取

Traceback (most recent call last): "C:\Users\...\vision_api.py", line 20, in get_text image = vision.Image(content) File "C:\...\venv\lib\site-packages\proto\message.py", line 494, in __init__ raise TypeError( TypeError: Invalid constructor input for Image:b'Ma\x81Ma\x81La\x81Ma\x81Ma\x81Ma\x81Ma\x81Ma\x81Ma\x81Ma\x81Ma\x81La\x81Ma\x81Ma\x81Ma\x81Ma\x80Ma\x81La\x81Ma\x81Ma\x81Ma\x80Ma\x81Ma\x81Ma\x81Ma\x8 ...

image = Image.open(path).convert('RGB') # Opening the saved image cropped_image = image.crop((30, 900, 510, 1200)) # Cropping the image vision_image = vision.Image(# I passed the different options) # Here I need to pass the image, but I don't know how client = vision.ImageAnnotatorClient() response = client.text_detection(image=vision_image) # Text detection using google-vision-api

Attributes: content (bytes): Image content, represented as a stream of bytes. Note: As with all ``bytes`` fields, protobuffers use a pure binary representation, whereas JSON representations use base64. Currently, this field only works for BatchAnnotateImages requests. It does not work for AsyncBatchAnnotateImages requests.

from PIL import Image from io import BytesIO from google.cloud import vision with open('images/screenshots/screenshot.png', 'rb') as image_file: data = image_file.read() try: image = vision.Image(content=data) print('worked') except TypeError: print('failed') im = Image.open('images/screenshots/screenshot.png') buffer = BytesIO() im.save(buffer, format='PNG') try: image = vision.Image(buffer.getvalue()) print('worked') except TypeError: print('failed')

b'\x89PNG\r\n\x1a\n\x00\x00\x00\rIHDR\x00\x00\x048\x00\x00\x07\x80\x08\x06\x00\x00\x00+a\xe7\n\x00\x00\x00\x04sBIT\x08\x08\x08\x08|\x08d\x88\x00\x00 \x00IDATx\x9c\xec\xbdy\xd8-\xc7Y\x1f\xf8\xab\xea>\xe7\xdb\xef\xaa\xbbk\xb3%\xcb\x8b\x16[\x12\xc6\xc8\xbb,\x1b\x03\x06\xc6\x8111\x93@2y\xc2381\x8b1\x90\x10\x9e\xf18\x93\x10\x0811\x84\x192\x0c3\x9e\x1020\x03\x03\xc3\xb0\x04\xf0C0\xc6\x96m\xc9\x96m\xed\xb2dI\x96\xaetu\xf7\xed\xdb\xcf\xe9\xae\x9a?j\xe9\xea\xbd\xba\xbb\xbaO\x9f\xef\x9e\xd7\xd6\xfd\xfat\xbf\xf5Vu-o\xbd\xf5\xeb\xb7\xde"\xef\xff\xc7\'8\x1c\x13\x07\x00\xd2\x82\xcc6\xe5\xc6\xa8B&' <class 'bytes'>

b'\x89PNG\r\n\x1a\n\x00\x00\x00\rIHDR\x00\x00\x048\x00\x00\x07\x80\x08\x06\x00\x00\x00+a\xe7\n\x00\x01\x00\x00IDATx\x9c\xec\xbdw\x80$\xc7u\x1f\xfc\xab\xea\xeeI\x9bw/\'\x1cr\xce\x04@\x10\x04A\x82`\x84\x95%J"\x95,\xcb\x1f%\x91T\xb0$*}\x1fM\xd9\x96\x95EY\x94(\xc9\xb6\x92i+\x90\x12\x83(3)0\x82\x08$rN\x07\\\xce\xb7\xb7yBw\xd5\xf7G\x85\xaeN3\xdd=\xdd\xb3\xb3{\xfb\xc8\xc3\xceLW\xbd\xca\xaf\xde\xfb\xf5\xabW\xe4{\xdeu\x84\xa3`\xe2\x00@J\xe0Y&\xdf\x00e($\x94\x94\'p\xcc\xc3\xda\xe7Y\x0c\xf1Te\x13\xbf\xcc>\xfa:]Y=x\x84\x7f\xe8\xc23u\x1f\x91l\xfd\x99' <class 'bytes'>

2条回答

网友
1楼 · 编辑于 2024-06-16 11:56:47

据我所知，您从一个PIL Image开始，希望在内存中获得一个PNG图像，而无需访问磁盘。所以你需要这个：
#!/usr/bin/env python3 from PIL import Image from io import BytesIO # Create PIL Image like you have - filled with red im = Image.new('RGB', (320,240), (255,0,0)) # Create in-memory PNG - like you want for Google Cloud Vision buffer = BytesIO() im.save(buffer, format="PNG") # Look at first few bytes PNG = buffer.getvalue() print(PNG[:20])
它会打印出来，如果您将图像以PNG格式写入磁盘，然后将其以二进制格式读回，则会得到这样的结果-除了这一点，它在内存中执行，而不进入磁盘：
b'\x89PNG\r\n\x1a\n\x00\x00\x00\rIHDR\x00\x00\x01@'

网友
2楼 · 编辑于 2024-06-16 11:56:47

最好有完整的错误堆栈和更准确的代码段。但形式呈现的信息似乎是两种不同“图像”的混淆。可能是复制/粘贴错误，因为tutorials有完全相同的行：
response = client.text_detection(image=image)
但是前面提到的教程image是由vision.Image()创建的，所以我认为在给出的代码中应该是：
response = client.text_detection(image=vision_image)
至少如果我正确理解了代码片段，image是PIL图像，而vision_image是应该传递给text_detection方法的视觉图像。因此，在vision.Image()中执行的任何操作都不会对错误消息产生影响

相关问题更多 >

编程相关推荐

热门问题

热门文章