您好,登录后才能下订单哦!
密码登录
登录注册
点击 登录注册 即表示同意《亿速云用户服务条款》
在hive中建表格式存储格式为orc
create table user(id int,name string) stored as orc;
spark写文件
val jsons = "hdfs://localhost:9000/test/artist_orc.json"
val people = sc.textFile(jsons)
val schemaString = "id name"
val schema = StructType(schemaString.split(" ").map(fieldName => {if(fieldName == "name")
StructField(fieldName, StringType, true) else StructField(fieldName, IntegerType, true)}))
val rowRDD = people.map(line=>{
JSONObject.fromObject(line)
}).map(p => Row(new Integer(p.get("id").toString), p.get("name")))
val hiveContext = new org.apache.spark.sql.hive.HiveContext(sc)
val peopleSchemaRDD = hiveContext.createDataFrame(rowRDD, schema)
peopleSchemaRDD.write.format("orc").save("hdfs://localhost:9000/user/xb/warehouse/artist_orc/adf")免责声明:本站发布的内容(图片、视频和文字)以原创、转载和分享为主,文章观点不代表本网站立场,如果涉及侵权请联系站长邮箱:is@yisu.com进行举报,并提供相关证据,一经查实,将立刻删除涉嫌侵权内容。