hadoop - How to delete duplicate records from Hive table? -


i trying learn deleting duplicate records hive table.

my hive table: 'dynpart' columns: id, name, technology

id  name  technology 1   abcd  hadoop 2   efgh  java 3   ijkl  mainframes 2   efgh  java 

we have options 'distinct' use in select query, select query retrieves data table. tell how use delete query remove duplicate rows hive table.

sure not recommended or not standard delete/update records in hive. want learn how it.

you can use insert overwrite statement update data

insert overwrite table dynpart select distinct * dynpart; 

Comments

Popular posts from this blog

Command prompt result in label. Python 2.7 -

javascript - How do I use URL parameters to change link href on page? -

amazon web services - AWS Route53 Trying To Get Site To Resolve To www -