为提高效率,提问时请提供以下信息,问题描述清晰可优先响应。
【DM版本】DM8
【操作系统】CentOS 8
【CPU】X86
【问题描述】
实时主备集群/读写分离集群,集群节点的数据库实例服务、数据守护进程,都不能开机自启。
【安装规划1】
实时主备集群、读写分离集群部署完成后,集群节点的数据库实例服务、数据守护进程,开机都能正常自启。
节点1
192.168.1.173 对外服务IIP
192.168.1.173 心跳IP
节点2
192.168.1.174 对外服务IIP
192.168.1.174 心跳IP
[dmdba@dm03 ~]$ cat /dmdata/DAMENG/dmmal.ini
MAL_CHECK_INTERVAL = 10
MAL_CONN_FAIL_INTERVAL = 10
MAL_TEMP_PATH = /dmdata/malpath
MAL_BUF_SIZE = 512
MAL_SYS_BUF_SIZE = 2048
MAL_COMPRESS_LEVEL = 0
[MAL_INST1]
MAL_INST_NAME = GRP1_RT_01
MAL_HOST = 192.168.1.173
MAL_PORT = 5336
MAL_INST_HOST = 192.168.1.173
MAL_INST_PORT = 5236
MAL_DW_PORT = 5436
MAL_INST_DW_PORT = 5536
[MAL_INST2]
MAL_INST_NAME = GRP1_RT_02
MAL_HOST = 192.168.1.174
MAL_PORT = 5336
MAL_INST_HOST = 192.168.1.174
MAL_INST_PORT = 5236
MAL_DW_PORT = 5436
MAL_INST_DW_PORT = 5536
【安装规划2】
实时主备集群、读写分离集群部署完成后,集群节点的数据库实例服务、数据守护进程,开机都不能自启。
节点1
192.168.1.173 对外服务IIP
10.0.0.3 心跳IP
节点2
192.168.1.174 对外服务IIP
10.0.0.4 心跳IP
【疑问1】
安装规划1进行改善,增加心跳IP地址,dmmal.ini文件修改为安装规划2的dmmal.ini
测试结论:开机任然都能正常自启。
【疑问2】
安装规划2,由于开机不能自启,取消心跳IP地址,dmmal.ini文件修改为安装规划1的dmmal.ini
测试结论:开机却还是不能自启。
【疑问3】it's not local ip是什么意思?
[root@dm03 ~]# hostname -I
192.168.1.173 10.0.0.3
[dmdba@dm03 ~]$ cat /dmdata/DAMENG/dmmal.ini
MAL_CHECK_INTERVAL = 10
MAL_CONN_FAIL_INTERVAL = 10
MAL_TEMP_PATH = /dmdata/malpath
MAL_BUF_SIZE = 512
MAL_SYS_BUF_SIZE = 2048
MAL_COMPRESS_LEVEL = 0
[MAL_INST1]
MAL_INST_NAME = GRP1_RT_01
MAL_HOST = 10.0.0.3
MAL_PORT = 5336
MAL_INST_HOST = 192.168.1.173
MAL_INST_PORT = 5236
MAL_DW_PORT = 5436
MAL_INST_DW_PORT = 5536
[MAL_INST2]
MAL_INST_NAME = GRP1_RT_02
MAL_HOST = 10.0.0.4
MAL_PORT = 5336
MAL_INST_HOST = 192.168.1.174
MAL_INST_PORT = 5236
MAL_DW_PORT = 5436
MAL_INST_DW_PORT = 5536
[root@dm03 ~]# cat /etc/sysconfig/network-scripts/ifcfg-enp0s3
TYPE=Ethernet
PROXY_METHOD=none
BROWSER_ONLY=no
BOOTPROTO=static
DEFROUTE=yes
IPV4_FAILURE_FATAL=no
IPV6INIT=yes
IPV6_AUTOCONF=yes
IPV6_DEFROUTE=yes
IPV6_FAILURE_FATAL=no
NAME=enp0s3
UUID=cb526a34-9c21-4385-b953-c97e189aaa33
DEVICE=enp0s3
ONBOOT=yes
IPADDR=192.168.1.173
NETMASK=255.255.255.0
GATEWAY=192.168.1.1
DNS1=192.168.1.1
[root@dm03 ~]# cat /etc/sysconfig/network-scripts/ifcfg-enp0s8
TYPE=Ethernet
PROXY_METHOD=none
BROWSER_ONLY=no
BOOTPROTO=static
DEFROUTE=yes
IPV4_FAILURE_FATAL=no
IPV6INIT=yes
IPV6_AUTOCONF=yes
IPV6_DEFROUTE=yes
IPV6_FAILURE_FATAL=no
NAME=enp0s8
UUID=f68a5ae1-2eca-4860-8ccb-285cdeb54764
DEVICE=enp0s8
ONBOOT=yes
IPADDR=10.0.0.3
NETMASK=255.255.255.0
[root@dm03 ~]# ifconfig
enp0s3: flags=4163<UP,BROADCAST,RUNNING,MULTICAST> mtu 1500
inet 192.168.1.173 netmask 255.255.255.0 broadcast 192.168.1.255
inet6 fe80::a00:27ff:feeb:399e prefixlen 64 scopeid 0x20<link>
ether 08:00:27:eb:39:9e txqueuelen 1000 (Ethernet)
RX packets 1010 bytes 82715 (80.7 KiB)
RX errors 0 dropped 0 overruns 0 frame 0
TX packets 744 bytes 268409 (262.1 KiB)
TX errors 0 dropped 0 overruns 0 carrier 0 collisions 0
enp0s8: flags=4163<UP,BROADCAST,RUNNING,MULTICAST> mtu 1500
inet 10.0.0.3 netmask 255.255.255.0 broadcast 10.0.0.255
inet6 fe80::a00:27ff:fed7:6ac prefixlen 64 scopeid 0x20<link>
ether 08:00:27:d7:06:ac txqueuelen 1000 (Ethernet)
RX packets 206 bytes 14928 (14.5 KiB)
RX errors 0 dropped 0 overruns 0 frame 0
TX packets 47 bytes 3644 (3.5 KiB)
TX errors 0 dropped 0 overruns 0 carrier 0 collisions 0
lo: flags=73<UP,LOOPBACK,RUNNING> mtu 65536
inet 127.0.0.1 netmask 255.0.0.0
inet6 ::1 prefixlen 128 scopeid 0x10<host>
loop txqueuelen 1000 (Local Loopback)
RX packets 24 bytes 2040 (1.9 KiB)
RX errors 0 dropped 0 overruns 0 frame 0
TX packets 24 bytes 2040 (1.9 KiB)
TX errors 0 dropped 0 overruns 0 carrier 0 collisions 0
【安装规划2——报错——演示步骤】
【场景1】
根据安装规划2安装实时主备集群、读写分离集群,安装后,集群节点的数据库实例服务、数据守护进程,开机都不能自启。
[dmdba@dm03 ~]$ cat /dmdata/DAMENG/dmmal.ini
MAL_CHECK_INTERVAL = 10
MAL_CONN_FAIL_INTERVAL = 10
MAL_TEMP_PATH = /dmdata/malpath
MAL_BUF_SIZE = 512
MAL_SYS_BUF_SIZE = 2048
MAL_COMPRESS_LEVEL = 0
[MAL_INST1]
MAL_INST_NAME = GRP1_RT_01
MAL_HOST = 10.0.0.3
MAL_PORT = 5336
MAL_INST_HOST = 192.168.1.173
MAL_INST_PORT = 5236
MAL_DW_PORT = 5436
MAL_INST_DW_PORT = 5536
[MAL_INST2]
MAL_INST_NAME = GRP1_RT_02
MAL_HOST = 10.0.0.4
MAL_PORT = 5336
MAL_INST_HOST = 192.168.1.174
MAL_INST_PORT = 5236
MAL_DW_PORT = 5436
MAL_INST_DW_PORT = 5536
2025-01-02 14:55:09.466 [ERROR] database P0000000988 T0000000000000000988 self instance(GRP1_RT_01) mal_ip(10.0.0.3) config error, it's not local ip
2025-01-02 14:55:09.503 [INFO] database P0000000988 T0000000000000000988 INI parameter DPC_2PC changed, the original value 1, new value 0
2025-01-02 14:55:10.624 [FATAL] database P0000000988 T0000000000000000988 dmserver startup failed, code = -2158 [Invalid IP]
2025-01-02 14:55:10.677 [FATAL] database P0000000988 T0000000000000000988 nsvr_ini_file_read failed, [code: -2158]
【场景2】
根据安装规划2安装实时主备集群、读写分离集群,安装后,集群节点的数据库实例服务、数据守护进程,开机都不能自启。
根据安装规划1调整dmmal.ini文件,问题一样。
[dmdba@dm03 ~]$ cat /dmdata/DAMENG/dmmal.ini
MAL_CHECK_INTERVAL = 10
MAL_CONN_FAIL_INTERVAL = 10
MAL_TEMP_PATH = /dmdata/malpath
MAL_BUF_SIZE = 512
MAL_SYS_BUF_SIZE = 2048
MAL_COMPRESS_LEVEL = 0
[MAL_INST1]
MAL_INST_NAME = GRP1_RT_01
MAL_HOST = 192.168.1.173
MAL_PORT = 5336
MAL_INST_HOST = 192.168.1.173
MAL_INST_PORT = 5236
MAL_DW_PORT = 5436
MAL_INST_DW_PORT = 5536
[MAL_INST2]
MAL_INST_NAME = GRP1_RT_02
MAL_HOST = 192.168.1.174
MAL_PORT = 5336
MAL_INST_HOST = 192.168.1.174
MAL_INST_PORT = 5236
MAL_DW_PORT = 5436
MAL_INST_DW_PORT = 5536
2025-01-02 15:03:23.897 [ERROR] database P0000000979 T0000000000000000979 self instance(GRP1_RT_01) mal_ip(192.168.1.173) config error, it's not local ip
2025-01-02 15:03:23.945 [INFO] database P0000000979 T0000000000000000979 INI parameter DPC_2PC changed, the original value 1, new value 0
2025-01-02 15:03:25.115 [FATAL] database P0000000979 T0000000000000000979 dmserver startup failed, code = -2158 [Invalid IP]
2025-01-02 15:03:25.200 [FATAL] database P0000000979 T0000000000000000979 nsvr_ini_file_read failed, [code: -2158]
您好 开机后,手动启动能成功吗
如果手动启动成功 加上您提供的信息 猜测可能是达梦相关服务启动早于网卡相关服务。建议在开机启动项目中,把达梦服务启动放在最后面试试。