如何使用云函数触发器组合GCS bucket中的多个文件

bucket_id = 'bucketname' client = gcs.Client() bucket = client.get_bucket(bucket_id) name = date = outfile = f'bucketname/{name}_{date}.CSV' blobs = [] for shard in ('XX', 'XY', 'XZ'): sfile = f'{name}{shard}_{date}' blob = bucket.blob(sfile) if not blob.exists(): # this causes a retry in 60s raise ValueError(f'branch {sfile} not present') blobs.append(blob) bucket.blob(outfile).compose(blobs) logging.info(f'Successfullt created {outfile}') for blob in blobs: blob.delete() logging.info('Deleted {} blobs'.format(len(blobs)))

1条回答

网友

1楼 · 发布于 2024-04-26 03:05:16

据我所知，云函数是由特定GCS bucket中对象上的google.storage.object.finalize事件触发的

在这种情况下，您的云函数“签名”看起来像（取自您提到的“媒体”文章）：

def compose_shards(data, context):

data是一本字典，其中有大量关于对象（文件）的详细信息。请参见此处的一些详细信息：Google Cloud Storage Triggers

例如，data["name"]-是正在讨论的对象的名称

如果您知道这些对象/碎片的命名所依据的模式/模板/规则，那么可以从对象/碎片名称中提取相关元素，并使用它来组合目标对象/文件名

相关问题更多 >

编程相关推荐

热门问题

热门文章