BigQuery：从CSV加载，跳过列

job_body = { 'projectId': projectId, 'configuration': { 'load': { 'sourceUris': [sourceCSV], 'schema': { 'fields': [ { 'name': 'Field1', 'type': 'STRING' }, { # this would be the skipped field 'name': None 'skip': True }, { 'name': 'Field2', 'type': 'String' }, ] }, 'destinationTable': { 'projectId': projectId, 'datasetId': datasetId, 'tableId': targetTableId }, } } }

2条回答

网友

1楼 · 编辑于 2024-05-16 20:25:22

目前还不可能做到这一点，但这可能是一个有趣的特性请求。请随意将其添加到https://code.google.com/p/google-bigquery/issues/list。在

同时，我将执行两步导入：

导入为具有3列的新表。在
将“SELECT column1，column2 FROM[newtable]”追加到现有表中。在

网友

2楼 · 编辑于 2024-05-16 20:25:22

菲利佩的建议应该行得通。另一种可能性是，如果您能够修改要加载到BigQuery中的CSV，则在加载作业上使用ignoreUnknownValues标志：

[Optional] Accept rows that contain values that do not match the schema. The unknown values are ignored. Default is false which treats unknown values as errors. For CSV this ignores extra values at the end of a line. For JSON this ignores named values that do not match any column name.

但是，使用此标志将需要重新排序CSV中的列或将数据格式化为JSON。在

相关问题更多 >

编程相关推荐

热门问题

热门文章