如何使用Python PySpark在zeppelin中创建数据帧？ - 问答 - Python中文网

如何使用Python PySpark在zeppelin中创建数据帧？

2024-05-19 22:25:39 发布

您现在位置：Python中文网/ 问答频道 /正文

男 | 程序猿一只，喜欢编程写python代码。

Bicycletheft_raw_data = Crime_data.filter(lambda x: 'Bicycle theft' in x)
Bicycletheft_raw_data.collect()
print Bicycletheft_raw_data.count()



df1 = sqlCtx.createDataFrame(BicycleTheft_raw_data , ['CrimeID','Month','Reportedby','Fallswithin','Longitude', 'Latitude','Location','LSOAcode','LSOAname','Crimetype','Lastoutcomecategory'])

Py4JJavaError:调用o742.applySchemaToPythonRDD时出错

Tags： lambda in data raw count filter collect df1

0条回答

目前没有回答

相关问题更多 >

编程相关推荐

热门问题

热门文章