问题描述
我该如何在HIVE中写这个?
该表具有重复项,并且基于第一个唯一列,我想从新表中排除重复的记录。
data new;
set old;
by Col_1 Col_2date Col_3date;
if Col_2date ^= Col_3date then do;
if first.Col_3Date ^= 1 then delete;
end;
run;
解决方法
以下内容如何:
select min(Obs) as Obs,Col_11,Col_2date,Col_3date
from your_table
group by Col_11,Col_3date