为提高效率,提问时请提供以下信息,问题描述清晰可优先响应。
【DM版本】:dm8_20231226_HWarm920_kylin10_sp1_64.iso
【操作系统】:kylin10_sp1 (虚拟机部署)
【CPU】: 鲲鹏920
【问题描述】*:搭建达梦数据库dsc集群时,可以正常拉起ccs和asm服务,但是无法拉起DSC实例
下面是拉起DSC实例出现的提示:
[dmdba@dm-dsc01 bin]$ ./dmserver /home/dmdba/dm8/config/dsc1/dm.ini dcr_ini=/home/dmdba/dm8/config/dmdcr.ini
file dm.key not found, use default license!
version info: develop
DM Database Server 64 V8 03134284132-20231226-213242-20081 startup...
Normal of FAST
Normal of DEFAULT
Normal of RECYCLE
Normal of KEEP
Normal of ROLL
Database mode = 0, oguid = 0
License will expire on 2024-12-26
hlck_sys_init, init g_drm_dest:[0, 1]
lbs_sys_init, the length of g_master_map is 1117, fill it use ok_ep_arr:[0, 1], n_ok_ep:2!
check CSS cmd: START NOTIFY, cmd_seq: 12
Control Node change from 255 to 254
check CSS cmd: DCR_LOAD, cmd_seq: 13
check CSS cmd: EP START, cmd_seq: 16
Control Node change from 254 to 0
[dmdba@dm-dsc01 bin]$
css信息如下:
[dmdba@dm-dsc01 bin]$ ./dmcss dcr_ini=/home/dmdba/dm8 /config/dmdcr.ini
DMCSS V8
DMCSS IS READY
[2024-07-05 16:21:00:398] [CSS]: 设置EP CSS1[0]为控制节点
[2024-07-05 16:22:18:406] [ASM]: 设置EP ASM1[0]为控制节点
[2024-07-05 16:22:18:411] [ASM]: 设置命令[START NOTIFY], 目标站点 ASM1[0], 命令序号[2]
[2024-07-05 16:22:19:415] [ASM]: 设置命令[EP START], 目标站点 ASM1[0], 命令序号[3]
[2024-07-05 16:22:20:448] [ASM]: 设置命令[NONE], 目标站点 ASM1[0], 命令序号[0]
[2024-07-05 16:22:20:571] [ASM]: 设置命令[EP START], 目标站点 ASM2[1], 命令序号[5]
[2024-07-05 16:22:23:387] [ASM]: 设置命令[NONE], 目标站点 ASM2[1], 命令序号[0]
[2024-07-05 16:22:23:495] [ASM]: 设置命令[EP OPEN], 目标站点 ASM1[0], 命令序号[10]
[2024-07-05 16:22:23:498] [ASM]: 设置命令[EP OPEN], 目标站点 ASM2[1], 命令序号[11]
[2024-07-05 16:22:24:432] [ASM]: 设置命令[NONE], 目标站点 ASM1[0], 命令序号[0]
[2024-07-05 16:22:28:387] [ASM]: 设置命令[NONE], 目标站点 ASM2[1], 命令序号[0]
[2024-07-05 16:22:28:390] [ASM]: 设置命令[EP REAL OPEN], 目标站点 ASM1[0], 命令序号[13]
[2024-07-05 16:22:28:392] [ASM]: 设置命令[EP REAL OPEN], 目标站点 ASM2[1], 命令序号[14]
[2024-07-05 16:22:29:428] [ASM]: 设置命令[NONE], 目标站点 ASM1[0], 命令序号[0]
[2024-07-05 16:22:33:385] [ASM]: 设置命令[NONE], 目标站点 ASM2[1], 命令序号[0]
show
css current time:2024-07-05 16:23:02
======= group[name = CSS, seq = 0, type = CSS, Control Node = 0] ===================
ep: inst_name seqno port mode sys_status vtd_status is_ok active guid ts
CSS1 0 9341 Control Node OPEN WORKING OK TRUE 825021 825 103
CSS2 1 9343 Normal Node OPEN WORKING OK TRUE 827595 8276 75
[2024-07-05 16:29:18:397] [DB]: 设置EP DSC1[0]为控制节点
[2024-07-05 16:29:18:401] [DB]: 设置命令[START NOTIFY], 目标站点 DSC1[0], 命令序号[2]
[2024-07-05 16:29:19:406] [DB]: 设置命令[DCR_LOAD], 目标站点 DSC1[0], 命令序号[3]
[2024-07-05 16:29:19:408] [DB]: 设置命令[DCR_LOAD], 目标站点 DSC2[1], 命令序号[4]
[2024-07-05 16:29:19:515] [DB]: 设置命令[NONE], 目标站点 DSC1[0], 命令序号[0]
[2024-07-05 16:29:19:518] [DB]: 设置命令[NONE], 目标站点 DSC2[1], 命令序号[0]
[2024-07-05 16:29:19:523] [DB]: 设置命令[EP START], 目标站点 DSC1[0], 命令序号[6]
[2024-07-05 16:29:39:838] Instance DB [DSC1] has not been detected for about 10 seconds, CSS may probably exclude the instance from the cluster after 50 seconds
[2024-07-05 16:29:39:838] Instance DB [DSC2] has not been detected for about 10 seconds, CSS may probably exclude the instance from the cluster after 50 seconds
show
css current time:2024-07-05 16:29:56
======= group[name = CSS, seq = 0, type = CSS, Control Node = 0] ===================
ep: inst_name seqno port mode sys_status vtd_status is_ok active guid ts
CSS1 0 9341 Control Node OPEN WORKING OK TRUE 825021 825352
CSS2 1 9343 Normal Node OPEN WORKING OK TRUE 827595 827924
[2024-07-05 16:30:00:460] Instance DB [DSC1] has not been detected for about 20 seconds, CSS may probably exclude the instance from the cluster after 40 seconds
[2024-07-05 16:30:00:460] Instance DB [DSC2] has not been detected for about 20 seconds, CSS may probably exclude the instance from the cluster after 40 seconds
检查一下日志里面的报错信息,看日志信息是不是未监控到实例的状态。
考虑是因为实例宕机、网络问题、监控配置错误等原因。
部署可以参考:
https://eco.dameng.com/document/dm/zh-cn/ops/DSC-installation-cluster.html
https://eco.dameng.com/document/dm/zh-cn/pm/dsc-build.html