如果先执行,则先删除,然后删除:如何在HIVE中编写此代码?

问题描述

我该如何在HIVE中写这个? 该表具有重复项,并且基于第一个唯一列,我想从新表中排除重复的记录。

Sample tables

 data new;
    set old;
    by Col_1  Col_2date  Col_3date;

    if Col_2date ^=  Col_3date then do;
        if first.Col_3Date ^= 1 then delete;
    end;
run;

解决方法

以下内容如何:

select min(Obs)  as Obs,Col_11,Col_2date,Col_3date
  from your_table
 group by Col_11,Col_3date