如何在azure blob上保存sparkdfprofiling生成的html报告？

1条回答

网友

1楼 · 发布于 2024-05-14 07:51:23

在我查看了julioasotodv/spark-df-profiling版本v1.1.13的源代码后，我通过下面的代码解决了它。首先，请参考Azure Databricks官方文档^{}和^{}了解dbutils如何将数据写入指定的数据源，如Azure存储。在

这是我的示例代码，它适用于我的Azure数据库和Azure存储。在

storage_account_name='<your storage account name>'
storage_account_access_key='<your storage account key>'
spark.conf.set(
  "fs.azure.account.key."+storage_account_name+".blob.core.windows.net",
  storage_account_access_key)

# My sample pandas dataframe for testing
import pandas as pd
d = {'col1': [1, 2], 'col2': [3, 4]}
pd_df = pd.DataFrame(data=d)

import spark_df_profiling
from spark_df_profiling.templates import template
df = spark.createDataFrame(pd_df)
profile = spark_df_profiling.ProfileReport(df)
dbutils.fs.put("wasbs://<your container name>@ppas.blob.core.windows.net/test.html", template('wrapper').render(content=profile.html))

我可以通过结果True看到它的工作原理，并将29806字节输出到azureblob，然后在azurestorageexplorer中检查它。在

希望有帮助。在

相关问题更多 >

编程相关推荐

热门问题

热门文章

如何在azure blob上保存sparkdfprofiling生成的html报告？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >