一、Canal概述
1、Canal是什么?
Canal是阿里巴巴开源的一个组件,主要用途是基于 MySQL 数据库增量日志解析,提供增量数据订阅和消费。canal的介绍,在github 上的官方文档介绍的很好,我这边就不介绍了。感兴趣的查看git地址:https://github.com/alibaba/canal
2、Canal版本
Canal 1.1.4版本,迎来最重要的WebUI能力,引入canal-admin工程,支持面向WebUI的canal动态 管理能力,支持配置、任务、日志等在线白屏运维能力,具体文档:Canal Admin Guide。
二、Canal同步到ES
1、服务器规划
本地测试准备2台服务器
| 服务器 | 部署的服务 |
|---|---|
| a-lf-bigdata | mysql、canal-server |
| b-lf-bigdata | mysql、canal-admin、canal-adapter、es、kibana |
2、安装mysql
A. 分别在两台机器安装mysql
(1)安装MySQL的yum仓库
yum -y localinstall https://dev.mysql.com/get/mysql80-community-release-el7-3.noarch.rpm
- 1
(2)安装MySQL
yum -y install mysql-community-server
- 1
(3)设置为开机启动
systemctl enable mysqld
- 1
(4)启动MySQL
systemctl start mysqld
- 1
(5)查看MySQL状态
systemctl status mysqld
- 1
(6)查看root临时密码
grep 'temporary password' /var/log/mysqld.log
- 1
![[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-YEP80Tpf-1631081169438)(/Users/juzi/Library/Application Support/typora-user-images/image-20210907203206489.png)]](/https://i-blog.csdnimg.cn/blog_migrate/3adb34b4077baf2476299a48e5ee8c39.png)
(7)修改root密码
mysql -uroot -p
ALTER USER 'root'@'localhost' IDENTIFIED BY 'Root_12root';
SHOW VARIABLES LIKE 'validate_password%';
set global validate_password.policy=0;
set global validate_password.length=1;
ALTER USER 'root'@'localhost' IDENTIFIED BY 'root%123';
exit
- 1
- 2
- 3
- 4
- 5
- 6
- 7
B. MySQL创建用户
1、在a-lf-bigdata的MySQL创建采集数据的用户
mysql -uroot -p
set global validate_password.policy=0;
set global validate_password.length=1;
CREATE USER canal IDENTIFIED BY '2wsxVFR_';
-- GRANT SELECT, REPLICATION SLAVE, REPLICATION CLIENT ON *.* TO 'canal'@'%';
GRANT ALL PRIVILEGES ON *.* TO 'canal'@'%' ;
FLUSH PRIVILEGES;
exit
- 1
- 2
- 3
- 4
- 5
- 6
- 7
- 8
在a-lf-bigdata的MySQL开启Binlog格式
vi /etc/my.cnf
- 1
增加如下配置
server-id=1
log-bin=mysql-bin
binlog-format=ROW
binlog-ignore-db=information_schema
binlog-ignore-db=mysql
binlog-ignore-db=performance_schema
binlog-ignore-db=sys
- 1
- 2
- 3
- 4
- 5
- 6
- 7
-
log-bin用于指定binlog日志文件名前缀,默认存储在/var/lib/mysql 目录下。
-
server-id用于标识唯一的数据库,不能和别的服务器重复,建议使用ip的最后一段,默认值也不可以。
-
binlog-ignore-db:表示同步的时候忽略的数据库。
-
binlog-do-db:指定需要同步的数据库(如果没有此项,表示同步所有的库)
登录mysql查看:
mysql -uroot -p
show master status;
- 1
- 2
![[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-zJwXuft1-1631081115828)(/Users/juzi/Library/Application Support/typora-user-images/image-20210907203938217.png)]](/https://i-blog.csdnimg.cn/blog_migrate/40c851fd2c62e4d8270aa911b47d2e4b.png)
如图,binlog就成功开启了。
禁用explicit_defaults_for_timestamp
mysql -uroot -p
SHOW VARIABLES LIKE '%explicit_defaults_for_timestamp%';
set persist explicit_defaults_for_timestamp=0;
SHOW VARIABLES LIKE '%explicit_defaults_for_timestamp%';
- 1
- 2
- 3
- 4
重启MySQL
systemctl status mysqld
- 1
2、在b-lf-bigdata的MySQL创建用户
mysql -uroot -p
set global validate_password.policy=0;
set global validate_password.length=1;
CREATE USER canaladmin IDENTIFIED BY '2wsxVFR_';
GRANT ALL ON canal_manager.* TO 'canaladmin'@'%';
FLUSH PRIVILEGES;
exit
- 1
- 2
- 3
- 4
- 5
- 6
- 7
3、安装ES
1、下载安装包
下载ES:
curl -L -O https://artifacts.elastic.co/downloads/elasticsearch/elasticsearch-7.4.0-linux-x86_64.tar.gz
curl -L -O https://artifacts.elastic.co/downloads/kibana/kibana-7.4.0-linux-x86_64.tar.gz
- 1
- 2
2、安装ES
(1)为服务器创建一个普通用户
useradd hadoop
passwd hadoop
- 1
- 2
然后赋予sudo权限
(2)安装ES
- 解压ES
cd /data/liufei/es
tar -zxf elasticsearch-7.4.0-linux-x86_64.tar.gz
ln -s elasticsearch-7.4.0 elasticsearch
- 1
- 2
- 3
- 配置环境变量
# 编辑配置文件
vi /etc/profile
# 新增ES配置
#elasticsearch
export ES_HOME=/data/liufei/es/elasticsearch
export PATH=${ES_HOME}/bin:$PATH
# 是配置生效
source /etc/profile
- 1
- 2
- 3
- 4
- 5
- 6
- 7
- 8
- 基础配置
调整最大虚拟内存
# 编辑配置文件
vi /etc/sysctl.conf
# 增加配置
vm.max_map_count=262144
- 1
- 2
- 3
- 4
保存退出后执行命令使配置生效
sysctl -p
- 1
- 修改权限
将ES目录的所有者赋予给hadoop用户
chown -R hadoop:hadoop elasticsearch
chown -R hadoop:hadoop elasticsearch-7.4.0
- 1
- 2
切换到hadoop用户
su hadoop
- 1
- 配置ES
# 编辑配置文件
vi $ES_HOME/config/elasticsearch.yml
# 修改配置
network.host: 0.0.0.0
discovery.seed_hosts: [“b-lf-bigdata”]
- 1
- 2
- 3
- 4
- 5
- 6
- 启动ES
$ES_HOME/bin/elasticsearch -d
- 1
- 确认ES启动成功
curl http://b-lf-bigdata:9200
- 1
(2)安装kibana
- 解压kibana
cd /data/liufei/es
tar xzvf kibana-7.4.0-linux-x86_64.tar.gz
ln -s kibana-7.4.0-linux-x86_64 kibana
- 1
- 2
- 3
- 配置环境变量
# 编辑配置文件
vi /etc/profile
# 新增ES配置
#kibana
export KIBANA_HOME=/data/liufei/es/kibana
export PATH=${KIBANA_HOME}/bin:$PATH
# 是配置生效
source /etc/profile
- 1
- 2
- 3
- 4
- 5
- 6
- 7
- 8
- 配置Kibana
# 编辑配置文件
vi $KIBANA_HOME/config/kibana.yml
# 修改配置
server.host: “0.0.0.0”
elasticsearch.hosts: [“http://b-lf-bigdata:9200”]
- 1
- 2
- 3
- 4
- 5
- 6
- 启动Kibana
nohup $KIBANA_HOME/bin/kibana > $KIBANA_HOME/kibana.out 2>&1 &
- 1
- 查看Kibana状态
http://b-lf-bigdata:5601
![[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-aQMttZup-1631081115829)(/Users/juzi/Library/Application Support/typora-user-images/image-20210908105624893.png)]](/https://i-blog.csdnimg.cn/blog_migrate/16f588394ac8a4b2c3eab4442f2eaed2.png)
- 新建索引
put /test_test
{
"mappings": {
"properties": {
"name": {
"type": "text"
},
"age": {
"type": "integer"
},
"modified_time": {
"type": "date"
}
}
}
}
- 1
- 2
- 3
- 4
- 5
- 6
- 7
- 8
- 9
- 10
- 11
- 12
- 13
- 14
- 15
- 16
4、安装Canal
(1)下载Canal
Canal下载地址:https://github.com/alibaba/canal/releases
wget https://github.com/alibaba/canal/releases/download/canal-1.1.5/canal.admin-1.1.5.tar.gz
wget https://github.com/alibaba/canal/releases/download/canal-1.1.5/canal.deployer-1.1.5.tar.gz
wget https://github.com/alibaba/canal/releases/download/canal-1.1.5/canal.adapter-1.1.5.tar.gz
- 1
- 2
- 3
- 4
(2)安装Canal-admin
- 解压
tar -zxf canal.admin-1.1.5.tar.gz
cd canal-admin
- 1
- 2
目录结构如下
![[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-S3nD9Cyq-1631081115830)(/Users/juzi/Library/Application Support/typora-user-images/image-20210908111454405.png)]](/https://i-blog.csdnimg.cn/blog_migrate/26e46dddff639cf898172ceba1b788db.png)
- 配置环境变量
# 编辑配置文件
vi /etc/profile
# 增加配置
#canal-admin
export CANAL_ADMIN_HOME=/data/liufei/cacal/canal-admin
export PATH=
C
A
N
A
L
A
D
M
I
N
H
O
M
E
<
/
s
p
a
n
>
/
b
i
n
:
<
s
p
a
n
c
l
a
s
s
=
"
t
o
k
e
n
e
n
v
i
r
o
n
m
e
n
t
c
o
n
s
t
a
n
t
"
>
{CANAL_ADMIN_HOME}</span>/bin:<span class="token environment constant">
CANALADMINHOME</span>/bin:<spanclass="tokenenvironmentconstant">PATH
# 使环境变量生效
source /etc/profile
- 1
- 2
- 3
- 4
- 5
- 6
- 7
- 8
- 9
- 10
- 修改配置
vi $CANAL_ADMIN_HOME/conf/application.yml
# 修改配置如下
server:
port: 8089
spring:
jackson:
date-format: yyyy-MM-dd HH:mm:ss
time-zone: GMT+8
spring.datasource:
address: b-lf-bigdata:3306
database: canal_manager
username: canaladmin
password: 2wsxVFR_
driver-class-name: com.mysql.jdbc.Driver
url: jdbc:mysql://
s
p
r
i
n
g
.
d
a
t
a
s
o
u
r
c
e
.
a
d
d
r
e
s
s
<
/
s
p
a
n
>
/
<
s
p
a
n
c
l
a
s
s
=
"
t
o
k
e
n
v
a
r
i
a
b
l
e
"
>
{spring.datasource.address}</span>/<span class="token variable">
spring.datasource.address</span>/<spanclass="tokenvariable">{spring.datasource.database}?
useUnicode=true&characterEncoding=UTF-
8&useSSL=false&allowPublicKeyRetrieval=true
hikari:
maximum-pool-size: 30
minimum-idle: 1
#这里指的是canal-server和canal-admin双线通信的用户名密码 canal:
adminUser: admin
adminPasswd: 123456
- 1
- 2
- 3
- 4
- 5
- 6
- 7
- 8
- 9
- 10
- 11
- 12
- 13
- 14
- 15
- 16
- 17
- 18
- 19
- 20
- 21
- 22
- 23
- 24
- 替换MySQL驱动包
因为我们使用的是MySQL 5.7,所以需要使用8的驱动
- 初始化数据库
mysql -uroot -p
source /data/liufei/canal/canal-admin/conf/canal_manager.sql
- 1
- 2
- 启动canal-admin
sh $CANAL_ADMIN_HOME/bin/startup.sh
- 1
查看日志:
[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-lomrs253-1631081115831)(/Users/juzi/Library/Application Support/typora-user-images/image-20210908132642239.png)]
- Canal-admin启动成功,访问地址
http://b-lf-bigdata:8089
![[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-a6B9Z0jp-1631081115831)(/Users/juzi/Library/Application Support/typora-user-images/image-20210908132827793.png)]](/https://i-blog.csdnimg.cn/blog_migrate/dc410929997b6e6f55abe053aa519d2c.png)
(3)安装Canal-server
- 解压
tar -zxvf canal.deployer-1.1.5.tar.gz
cd canal-server
- 1
- 2
- 配置环境变量
vi /etc/profile
# 新增配置
#canal-server
export CANAL_SERVER_HOME=/home/hadoop/app/canal-server
export PATH=
C
A
N
A
L
S
E
R
V
E
R
H
O
M
E
<
/
s
p
a
n
>
/
b
i
n
:
<
s
p
a
n
c
l
a
s
s
=
"
t
o
k
e
n
e
n
v
i
r
o
n
m
e
n
t
c
o
n
s
t
a
n
t
"
>
{CANAL_SERVER_HOME}</span>/bin:<span class="token environment constant">
CANALSERVERHOME</span>/bin:<spanclass="tokenenvironmentconstant">PATH
# 使环境变量生效
source /etc/profile
- 1
- 2
- 3
- 4
- 5
- 6
- 7
- 8
- 9
- 修改配置
![[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-pyig8ly3-1631081115831)(/Users/juzi/Library/Application Support/typora-user-images/image-20210908133848856.png)]](/https://i-blog.csdnimg.cn/blog_migrate/1213a4453d5ac8e9f5696eb524726b71.png)
vi canal.properties
# register ip
canal.register.ip = b-lf-bigdata
# canal admin config
canal.admin.manager = b-lf-bigdata:8089
canal.admin.port = 11110
canal.admin.user = admin
canal.admin.passwd = 6BB4837EB74329105EE4568DDA7DC67ED2CA2AD9
# admin auto register
canal.admin.register.auto = true
canal.admin.register.cluster =
canal.admin.register.name =
- 1
- 2
- 3
- 4
- 5
- 6
- 7
- 8
- 9
- 10
- 11
- 12
- 13
- 14
- 更换MySQL 驱动包
- 启动
$CANAL_SERVER_HOME/bin/startup.sh
- 1
- 查看日志
(4)配置Instance
- 登录canal_admin
- 进入Instance 管理页,新建instance
![[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-z5kin8mL-1631081115832)(/Users/juzi/Library/Application Support/typora-user-images/image-20210908134211085.png)]](/https://i-blog.csdnimg.cn/blog_migrate/fb5759895c16275f6af1d2f9645de70c.png)
- 载入模版
![[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-sXDUJQII-1631081115832)(/Users/juzi/Library/Application Support/typora-user-images/image-20210908134240376.png)]](/https://i-blog.csdnimg.cn/blog_migrate/c75214184b9ce6d4eb041e3f89c1c939.png)
- 修改模版
canal.instance.master.address=a-lf-bigdata:3306
canal.instance.dbUsername=canal
canal.instance.dbPassword=2wsxVFR_
- 1
- 2
- 3
- 启动instance,查看日志
![[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-0BeueDOm-1631081115832)(/Users/juzi/Library/Application Support/typora-user-images/image-20210908134456642.png)]](/https://i-blog.csdnimg.cn/blog_migrate/5b16af3b3ca68468b3e0c039741b57bd.png)
(5)创建调试数据库
在a-lf-bigdata上操作
mysql -uroot -p
创建数据
CREATE DATABASE IF NOT EXISTS test DEFAULT CHARSET utf8 COLLATE utf8_general_ci;
use test;
DROP TABLE IF EXISTS test;
CREATE TABLE test (
uid INT UNSIGNED AUTO_INCREMENT,
name VARCHAR(100) NOT NULL,
age int(3) DEFAULT NULL,
modified_time timestamp NOT NULL DEFAULT CURRENT_TIMESTAMP ON UPDATE
CURRENT_TIMESTAMP,
PRIMARY KEY (uid)
) ENGINE=InnoDB DEFAULT CHARSET=utf8;
- 1
- 2
- 3
- 4
- 5
- 6
- 7
- 8
- 9
- 10
- 11
- 12
- 13
- 14
(6)安装ClientAdapter
- 解压
tar -zxf canal.adapter-1.1.5.tar.gz
cd canal-adapter
- 1
- 2
- 配置环境变量
vi /etc/profile
# 新增配置
#canal-adapter
export CANAL_ADAPTER_HOME=/data/liufei/cacal/canal-adapter
export PATH=
C
A
N
A
L
A
D
A
P
T
E
R
H
O
M
E
<
/
s
p
a
n
>
/
b
i
n
:
<
s
p
a
n
c
l
a
s
s
=
"
t
o
k
e
n
e
n
v
i
r
o
n
m
e
n
t
c
o
n
s
t
a
n
t
"
>
{CANAL_ADAPTER_HOME}</span>/bin:<span class="token environment constant">
CANALADAPTERHOME</span>/bin:<spanclass="tokenenvironmentconstant">PATH
# 使环境变量生效
source /etc/profile
- 1
- 2
- 3
- 4
- 5
- 6
- 7
- 8
- 9
- 修改服务配置
vi $CANAL_ADAPTER_HOME/conf/application.yml
server:
port: 8081
spring:
jackson:
date-format: yyyy-MM-dd HH:mm:ss
time-zone: GMT+8
default-property-inclusion: non_null
canal.conf:
mode: tcp #tcp kafka rocketMQ rabbitMQ
flatMessage: true
zookeeperHosts:
syncBatchSize: 1000
retries: 0
timeout:
accessKey:
secretKey:
consumerProperties:
# canal tcp consumer
canal.tcp.server.host: a-lf-bigdata:11111
canal.tcp.zookeeper.hosts:
canal.tcp.batch.size: 500
canal.tcp.username:
canal.tcp.password:
srcDataSources:
defaultDS:
url: jdbc:mysql://a-lf-bigdata:3306/test?useUnicode=true
username: canal
password: 2wsxVFR_
canalAdapters:
- instance: test_to_es # canal instance Name or mq topic name
groups:- groupId: g1
outerAdapters:- name: logger
# - name: rdb
# key: mysql1
# properties:
# jdbc.driverClassName: com.mysql.jdbc.Driver
# jdbc.url: jdbc:mysql://127.0.0.1:3306/mytest2?useUnicode=true
# jdbc.username: root
# jdbc.password: 121212
# - name: rdb
# key: oracle1
# properties:
# jdbc.driverClassName: oracle.jdbc.OracleDriver
# jdbc.url: jdbc:oracle:thin:@localhost:49161:XE
# jdbc.username: mytest
# jdbc.password: m121212
# - name: rdb
# key: postgres1
# properties:
# jdbc.driverClassName: org.postgresql.Driver
# jdbc.url: jdbc:postgresql://localhost:5432/postgres
# jdbc.username: postgres
# jdbc.password: 121212
# threads: 1
# commitSize: 3000
# - name: hbase
# properties:
# hbase.zookeeper.quorum: 127.0.0.1
# hbase.zookeeper.property.clientPort: 2181
# zookeeper.znode.parent: /hbase - name: es6
hosts: b-lf-bigdata:9300 # 127.0.0.1:9200 for rest mode
properties:
mode: transport # or rest
# # security.auth: test:123456 # only used for rest mode
cluster.name: es
# - name: kudu
# key: kudu
# properties:
# kudu.master.address: 127.0.0.1 # ‘,’ split multi address
- name: logger
- groupId: g1
- 1
- 2
- 3
- 4
- 5
- 6
- 7
- 8
- 9
- 10
- 11
- 12
- 13
- 14
- 15
- 16
- 17
- 18
- 19
- 20
- 21
- 22
- 23
- 24
- 25
- 26
- 27
- 28
- 29
- 30
- 31
- 32
- 33
- 34
- 35
- 36
- 37
- 38
- 39
- 40
- 41
- 42
- 43
- 44
- 45
- 46
- 47
- 48
- 49
- 50
- 51
- 52
- 53
- 54
- 55
- 56
- 57
- 58
- 59
- 60
- 61
- 62
- 63
- 64
- 65
- 66
- 67
- 68
- 69
- 70
- 71
- 72
- 73
- 74
- 75
- 76
修改es6 目录下的配置,新建test_to_es.yml
dataSourceKey: defaultDS
destination: test_to_es
groupId: g1
esMapping:
_index: test_test
_type: _doc
_id: _id
upsert: true
sql: "select a.uid as _id, a.name, a.age, a.modified_time from test a"
commitBatch: 2
- 1
- 2
- 3
- 4
- 5
- 6
- 7
- 8
- 9
- 10
- 替换MySQL驱动包
- 启动adapter
$CANAL_ADAPTER_HOME/bin/startup.sh
$CANAL_ADAPTER_HOME/bin/stop.sh
$CANAL_ADAPTER_HOME/bin/restart.sh
- 1
- 2
- 3
(7)测试
- 新增数据
INSERT INTO test.test (name, age) VALUES ("张三",20);
INSERT INTO test.test (name, age) VALUES ("李四",21);
INSERT INTO test.test (name, age) VALUES ("王五",22);
INSERT INTO test.test (name, age) VALUES ("赵六",23);
INSERT INTO test.test (name, age) VALUES ("马七",24);
- 1
- 2
- 3
- 4
- 5
- 查看es
![[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-tOpWA0Bh-1631081115833)(/Users/juzi/Library/Application Support/typora-user-images/image-20210908140415725.png)]](/https://i-blog.csdnimg.cn/blog_migrate/67d6b9a863cb1a585895e23f99d70fc7.png)


2099

被折叠的 条评论
为什么被折叠?



