如何从python中的Azure函数将xlsx blob读入pandas

2024-06-16 11:26:03 发布

您现在位置:Python中文网/ 问答频道 /正文

我正在从azure函数中的blob读取.xslx数据。我的代码如下所示:

def main(techdatablob: func.InputStream, crmdatablob: func.InputStream, outputblob: func.Out[func.InputStream]):

    # Load in the tech and crm data
    crm_data = pd.read_excel(crmdatablob.read().decode('ISO-8859-1'))
    tech_data = pd.read_excel(techdatablob.read().decode('ISO-8859-1'))
   

问题是,当我尝试解码文件时,出现以下错误:

ValueError: Protocol not known: PK...

在“…”之后还有很多奇怪的字符。关于如何正确读取这些文件有什么想法吗


Tags: 文件函数readdataisoazureexceltech
1条回答
网友
1楼 · 发布于 2024-06-16 11:26:03

请参考我的代码,似乎您不需要添加decode('ISO-8859-1')

import logging
import pandas as pd
import azure.functions as func


def main(techdatablob: func.InputStream, crmdatablob: func.InputStream, outputblob: func.Out[func.InputStream]):
    logging.info(f"Python blob trigger function processed blob \n"
                 f"Name: {techdatablob.name}\n"
                 f"Blob Size: {techdatablob.length} bytes")

    # Load in the tech and crm data
    crm_data = pd.read_excel(crmdatablob.read())
    logging.info(f"{crm_data}")
    tech_data = pd.read_excel(techdatablob.read())
    logging.info(f"{tech_data}")

注意:您的function.json应该是这样的。否则,将发生错误

{
      "name": "techdatablob",
      "type": "blobTrigger",
      "direction": "in",
      "path": "path1/{name}",
      "connection": "example"
    },
    {
      "name": "crmdatablob",
      "dataType": "binary",
      "type": "blob",
      "direction": "in",
      "path": "path2/data.xlsx",
      "connection": "example"
    },
    {
      "name": "outputblob",
      "type": "blob",
      "direction": "out",
      "path": "path3/out.xlsx",
      "connection": "example"
    }

这与function.json之间的区别在于缺少dataType属性

enter image description here

我的测试结果是这样的,似乎没有问题

enter image description here

相关问题 更多 >