暂无图片
暂无图片
暂无图片
暂无图片
暂无图片

实时数据处理与湖仓一体化:Flink CDC + Paimon + Doris详细搭建指南

数码百科 2024-08-23
789

Flink CDC、Paimon 和 Doris 搭建开始之前,

请确保您有以下环境:

•  Linux 服务器若干台(根据集群规模)

•  Java 8 或更高版本

•  MySQL 5.7以上(用于 Flink CDC)


Flink 1.18 安装步骤:

1.  下载 Flink 1.18:

wget https://archive.apache.org/dist/flink/flink-1.18.0/flink-1.18.0-bin-scala_2.12.tgz


2.  解压并设置环境变量:

tar -zxvf flink-1.18.0-bin-scala_2.12.tgz


mv flink-1.18.0 opt/flink


echo 'export FLINK_HOME=/opt/flink' >> etc/profile

echo 'export PATH=$PATH:$FLINK_HOME/bin' >> etc/profile

source etc/profile


Flink CDC 配置步骤:

1.  下载 Flink CDC Connector for Flink 1.18(以 MySQL 为例):

wget https://repo1.maven.org/maven2/com/ververica/flink-connector-mysql-cdc/2.4.1/flink-connector-mysql-cdc-2.4.1.jar


cp flink-connector-mysql-cdc-2.4.1.jar $FLINK_HOME/lib/


2.  创建 Flink CDC 作业配置文件(mysql-to-doris.yaml):

source:

type: mysql

hostname: localhost

port: 3306

username: root

password: 123456

tables: app_db..*

server-id: 5400-5404

server-time-zone: UTC


sink:

type: doris

fenodes: 127.0.0.1:8030

username: root

password: ""

database: test_db

table: sink_table

table.create.properties.light_schema_change: true

table.create.properties.replication_num: 1


pipeline:

name: Sync MySQL Database to Doris

parallelism: 2


3.  提交 Flink CDC 作业:

$FLINK_HOME/bin/flink run -c com.alibaba.ververica.connectors.mysql.MySqlConnector \

-p 2 \

-Dconnector.mysql.hostname=localhost \

-Dconnector.mysql.port=3306 \

-Dconnector.mysql.username=root \

-Dconnector.mysql.password=123456 \

-Dconnector.mysql.database-name=app_db \

-Dconnector.mysql.table-name=.* \

-Dconnector.mysql.server-id=5400-5404 \

-Dconnector.mysql.server-time-zone=UTC \

-Dconnector.doris.fenodes=127.0.0.1:8030 \

-Dconnector.doris.username=root \

-Dconnector.doris.password= \

-Dconnector.doris.database=test_db \

-Dconnector.doris.table=sink_table \

-Dconnector.doris.table.create.properties.light-schema-change=true \

-Dconnector.doris.table.create.properties.replication-num=1 \

-Djob.name=Sync MySQL Database to Doris \

$FLINK_HOME/lib/flink-connector-mysql-cdc-2.4.1.jar


步骤 2: 搭建 Paimon

2.1 安装 Paimon

1.  下载 Paimon for Flink 1.18:

wget https://repository.apache.org/content/groups/snapshots/org/apache/paimon/paimon-flink-1.18/0.5-SNAPSHOT/paimon-flink-1.18-0.5-20230802.034234-105.jar


cp paimon-flink-1.18-0.5-20230802.034234-105.jar $FLINK_HOME/lib/


2.  启动 Flink 集群:

$FLINK_HOME/bin/start-cluster.sh


3.  使用 Flink SQL 客户端:

$FLINK_HOME/bin/sql-client.sh embedded


4.  创建 Paimon Catalog:

CREATE CATALOG paimon_catalog WITH (
 'type' = 'paimon',
 'warehouse' = 'file:///path/to/your/local/warehouse'
);


USE CATALOG paimon_catalog;


步骤 3: 搭建 Doris

3.1 安装 Doris

1.  下载并解压 Doris:

wget https://apache.doris.apache.org/downloads/main/2.1.1/apache-doris-2.1.1.tar.gz


tar -zxvf apache-doris-2.1.1.tar.gz -C opt/module/

mv opt/module/apache-doris-2.1.1 opt/module/doris


2.  配置 FE(Frontend):

vim opt/module/doris/fe/conf/fe.conf


添加或修改以下配置:

LOG_DIR = ${DORIS_HOME}/log

meta_dir = opt/module/doris/doris-meta

priority_networks = 192.168.180.152/24


3.  创建元数据目录:

mkdir -p opt/module/doris/doris-meta


4.  配置环境变量:

echo 'export DORIS_HOME=/opt/module/doris' >> etc/profile


echo 'export PATH=$PATH:$DORIS_HOME/be/bin' >> etc/profile


echo 'export PATH=$PATH:$DORIS_HOME/extensions/apache_hdfs_broker/bin' >> etc/profile


echo 'export PATH=$PATH:$DORIS_HOME/fe/bin' >> /etc/profile


source /etc/profile


5.  启动 FE:

/opt/module/doris/fe/bin/start_fe.sh --daemon


6.  配置 BE(Backend):

vim /opt/module/doris/be/conf/be.conf


添加或修改以下配置:

priority_networks = 192.168.180.153/24

storage_root_path = /opt/module/doris/be/storage.HDD;/opt/module/doris/be/storage.SSD


7.  创建存储目录:

mkdir -p /opt/module/doris/be/storage.HDD

mkdir -p /opt/module/doris/be/storage.SSD


8.  修改 start_be.sh 文件:

cd /opt/module/doris/be/bin/


vim start_be.sh


export JAVA_HOME=/opt/module/jdk1.8.0_212


9.  启动 BE:

/opt/module/doris/be/bin/start_be.sh --daemon


10.  连接 FE 和 BE:

使用 MySQL 客户端连接到 FE:

mysql -uroot -P9030 -p -h192.168.1.152


添加 BE 节点:

ALTER SYSTEM ADD BACKEND "192.168.1.153:9050";


11.  可选部署 Broker:

启动 Broker:

start_broker.sh --daemon


添加 Broker 节点:

ALTER SYSTEM ADD BROKER broker_name "192.168.1.152:8000";

文章转载自数码百科,如果涉嫌侵权,请发送邮件至:contact@modb.pro进行举报,并提供相关证据,一经查实,墨天轮将立刻删除相关内容。

评论