基于CentOS的HBase,Hive,Kylin,ZooKeeper,Kafka分布式集群搭建过程

一. HBase环境搭建

1. 编辑/opt/hbase-1.3.1/conf/hbase-env.sh,如下所示:

export JAVA_HOME=/usr/local/jdk1.8.0_141  
export HBASE_HOME=/opt/hbase-1.3.1  
export HBASE_LOG_DIR=/home/ssw/Data/hbase-logs
export HBASE_MANAGES_ZK=true
2. 编辑 /opt/hbase-1.3.1/conf/hbase-site.xml,如下所示:
<configuration>  
    <property>  
        <name>hbase.rootdir</name>
        <value>hdfs://master:9000/hbase</value>  
    </property>  
  
    <property>  
        <name>hbase.cluster.distributed</name>  
        <value>true</value>  
    </property>  
  
    <property>  
        <name>hbase.master</name>  
        <value>master:60000</value>  
    </property>  
  
    <property>  
        <name>hbase.zookeeper.quorum</name>  
        <value>master,slave1,slave2</value>  
    </property>  
  
    <property>  
        <name>hbase.zookeeper.property.dataDir</name>  
        <value>/home/ssw/Data/zookeeper</value>  
    </property>  
</configuration> 
3. 编辑 /opt/hbase-1.3.1/conf/regionservers,如下所示:
Slave1  
Slave2
4. 启动HBase集群
/opt/hbase-1.3.1/bin/start-hbase.sh
5. 查看集群是否启动成功

Master:HQuorumPeer;HMaster。Slave:HQuorumPeer;HRegionServer。


二. Hive环境搭建

1. 安装MysqL

wget http://repo.MysqL.com/MysqL-community-release-el7-5.noarch.rpm
sudo rpm -ivh MysqL-community-release-el7-5.noarch.rpm
sudo yum install MysqL-server
sudo chown -R root:root /var/lib/MysqL
service MysqLd restart
$ MysqL -u root
MysqL > use MysqL;
MysqL > update user set password=password('root') where user='root';
MysqL > exit;
service MysqLd restart
2. 为Hive建立相应的MysqL账户,并赋予足够的权限,如下所示:
create user 'hive' identified by 'MysqL';
grant all privileges on *.* to 'hive'@'%' with grant option;
flush privileges;
service MysqLd restart

3.建立Hive专用元数据库,用创建的hive账号登录,如下所示:

MysqL -uhive -p'MysqL'
MysqL> create database hive;

4.加载MysqL驱动包

mysql-connector-java-5.1.44-bin.jar放$HIVE_HOME$/lib目录下面。

5. 编辑/etc/profile,如下所示:

export HIVE_HOME=/opt/hive-2.3.0
export PATH=$PATH:$HIVE_HOME/bin

6. 编辑hive-config.sh,如下所示:

export JAVA_HOME=/usr/local/jdk1.8.0_141
export HIVE_HOME=/opt/hive-2.3.0
export HADOOP_HOME=/opt/hadoop-2.7.3
7. 编辑hive-site.xml,如下所示:
<property>
     <name>javax.jdo.option.ConnectionURL</name>
     <value>jdbc:MysqL://localhost:3306/hive?createDatabaseIfNotExist=true</value>
</property>
<property>
     <name>javax.jdo.option.ConnectionDriverName</name>
     <value>com.MysqL.jdbc.Driver</value>
</property>
<property>
     <name>javax.jdo.option.ConnectionUserName</name>
     <value>hive</value>
</property>
<property>
     <name>javax.jdo.option.ConnectionPassword</name>
     <value>MysqL</value>
</property>
8. 编辑hive-env.sh,如下所示:
export HOME==/opt/hadoop-2.7.3
export HIVE_CONF_DIR=/opt/hive-2.3.0/conf
export HIVE_AUX_JARS_PATH=/opt/hive-2.3.0/lib

9.验证Hive配置是否成功,如下所示:

hive> create table test_table (id int,name string,no int);
MysqL -uhive -p'MysqL'; show databases; use hive; show tables; select * from TBLS;

说明:Hive将元数据库存储在MysqL中,元数据包括表的名字,表的列和分区及其属性,表的属性(是否为外部表

等),表的数据所在目录。


三. Kylin环境搭建


四. ZooKeeper环境搭建


五. Kafka环境搭建


六. 遇到的问题

1.create directory /tmp/hive/XXX. Name node is in safe mode.

解析:hdfs dfsadmin -safemode leave

2.Failed: SemanticException org.apache.hadoop.hive.ql.Metadata.HiveException: java.lang.RuntimeException:

Unable to instantiate org.apache.hadoop.hive.ql.Metadata.SessionHivemetastoreClient.

解析:用hive -hiveconf hive.root.logger=DEBUG,console排查问题。

3.org.apache.hadoop.hive.metastore.api.MetaException: Version information not found in metastore.


参考文献:

[1]Kylin安装配置实验:http://blog.csdn.net/wzy0623/article/details/51283352

[2] Access denied for user 'hive'@'localhost' :http://blog.csdn.net/adminyou/article/details/48328417

[3]Hive完全分布式集群安装过程(元数据库:MysqL):http://www.aboutyun.com/thread-6902-1-1.html

[4]Hive常见问题汇总:http://www.jb51.cc/article/p-hbqhenbv-bgk.html

相关文章

Centos下搭建性能监控Spotlight
CentOS 6.3下Strongswan搭建IPSec VPN
在CentOS6.5上安装Skype与QQ
阿里云基于centos6.5主机VPN配置
CentOS 6.3下配置multipah
CentOS安装、配置APR和tomcat-native