有 Java 编程相关的问题?

你可以在下面搜索框中键入要查询的问题!

java Spark临时表不以直线显示

我在AWS EMR有一个spark cluster,并尝试使用thrift server启动以下代码:

...
JavaSparkContext jsc = new JavaSparkContext(SparkContext.getOrCreate());
HiveContext hiveContext = new HiveContext(jsc);
JavaRDD<Person> people = jsc.textFile("people.txt").map(
  new Function<String, Person>() {
    public Person call(String line) throws Exception {
      ...
    }
});
DataFrame schemaPeople = hiveContext.createDataFrame(people, Person.class);
schemaPeople.registerTempTable("people_temp");
schemaPeople.saveAsTable("people");

HiveThriftServer2.startWithContext(hiveContext);
...

我使用以下命令运行此代码: sudo ./sbin/start-thriftserver.sh --jars /home/ec2-user/some.jar --class spark.jobs.thrift.ThriftServerInit

启动thrift server后,我使用直线连接到它: !connect jdbc:hive2://localhost:10001,运行show tables;并获得一个结果:

+--------------+--------------+--+
|  tableName   | isTemporary  |
+--------------+--------------+--+
| people       | false        |
+--------------+--------------+--+

我希望看到一个临时表people_temp。为什么people_temp不存在


共 (1) 个答案