Professional Documents
Culture Documents
Hive Advanced Con
Hive Advanced Con
joins
There are 2 types of joins
a.inner join
b.Outer join
Here default join is innerjoin
Note: to collect information(rows) from
more than one tables(2 tables)
examples
INNERJOIN:
Hive>select e.code,
e.ename,e.esal,d.dname ,d.dloc from
emp e join dept d on(e.dno=d.dno)
examples
Leftouterjoin:
Hive>select e.ecode, e.ename,
e.esal,d.dname, d.dloc from emp e
leftouterjoin dept d on(e.dno=d.dno);
RIGHTOUTERJOIN:
Hive>select e.ecode, e.ename,e.esal,
d.dname, d.dloc from emp e
rightouterjoin dept d
on(e.dno=d.dno)
FULLOUTERJOIN:
Hive>select
e.ecode,e.ename,e.esal,d.dname,
d.dloc from emp e fullouterjoin dept
d on(e.dno=d.dno);
Table to table
Insert overwrite table tablename
select * from
tablename
Hive to hdfs
Insert overwrite directory
directoryname/filename.txt select *
from tablename
Hive to localfilesystem
Insert overwrite local directory file.txt
select * from tablename;
joins
joins:
1) inner join:
first
11
12
13
second
11
13
14
join:
11
13
Outerjoin:
a. LEFTOUTER JOIN
11
12 null
13
b. Rightouterjoin:
11
13
null 14
3) full outer join:
11
12 null
13
null 14
--------------------------------------inner join:
select e.ecode,e.ename,e.esal,d.dname,d.dloc,d.dmid from emp1 e
full outer join dept1 d on (e.deptno=d.deptno);
Hive vs rdbms
Differences with Relational
database
Hiveql providing High latency.It does
not providing index concept because
of streaming access being massive
amount of data.
It does not support transactional
management it supports only batch
There is no updates because we have
write once concept