问题描述
我是HIVE的新手,正在创建我的第一张桌子!
由于某种原因,所有非字符串值都显示为NULL(包括int,BOOLEAN等)
我的数据如下所示:
58;"management";"married";"tertiary";"no";2143;"yes";"no";"unkNown";5;"may";261;1;-1;0;"unkNown";"no"
我用它来创建表:
create external table bank_dataset(
age tinyint,job string,education string,default BOOLEAN,balance INT,housing BOOLEAN,loan BOOLEAN,contact STRING,day STRING,month STRING,duration INT,campaign INT,pdays INT,prevIoUs INT,poutcome STRING,y BOOLEAN)
ROW FORMAT DELIMITED
FIELDS TERMINATED BY '\u003B'
STORED AS TEXTFILE
location '/user/marchenrisaad_gmail/Bank_Project'
tblproperties("skip.header.line.count"="1");
解决方法
感谢您的评论!但我有1期。对于每一行,我正确地获取了所有数据,然后获得了额外的空值列。在我的代码下面找到:
create external table bank_dataset(age TINYINT,job string,education string,default BOOLEAN,balance INT,housing BOOLEAN,loan BOOLEAN,contact STRING,day INT,month STRING,duration INT,campaign INT,pdays INT,previous INT,poutcome STRING,y BOOLEAN)
ROW FORMAT SERDE 'org.apache.hadoop.hive.serde2.OpenCSVSerde'
WITH SERDEPROPERTIES (
"separatorChar" = "\u003B","quoteChar" = '"'
)
STORED AS TEXTFILE
location '/user/marchenrisaad_gmail/Bank_Project'
tblproperties("skip.header.line.count"="1");
有什么建议吗?