1.主备ip已替换为示例地址
2.恢复后需要手动切换主备服务器
3.数据库不停机,应用在线进行切换验证
监视器、主备环境操作步骤如下:
[dmdba@ecsDBjsq ~]$ cd /data/dmdbms/bin
[dmdba@ecsDBjsq bin]$ ./dmmonitor dmmonitor_noc.ini
#================================================================================#
login
username:SYSDBA
password:
[monitor] 2023-12-21 14:30:45: Login dmmonitor success!
Input command illegal, please input help to get the help information!
SHOW
2023-12-21 14:30:51
#================================================================================#
GROUP OGUID MON_CONFIRM MODE MPP_FLAG
GRP1 26180 FALSE AUTO FALSE
<<DATABASE GLOBAL INFO:>>
DW_IP MAL_DW_PORT WTIME WTYPE WCTLSTAT WSTATUS INAME INST_OK N_EP N_OK ISTATUS IMODE DSC_STATUS RTYPE RSTAT
192.168.101.205 5256 2023-12-21 14:30:51 GLOBAL VALID OPEN ECSZ OK 1 1 OPEN PRIMARY DSC_OPEN REALTIME VALID
EP INFO:
INST_IP INST_PORT INST_OK INAME ISTATUS IMODE DSC_SEQNO DSC_CTL_NODE RTYPE RSTAT FSEQ FLSN CSEQ CLSN DW_STAT_FLAG
192.168.101.205 5236 OK ECSZ OPEN PRIMARY 0 0 REALTIME VALID 1808039 2721899 1808040 2721900 NONE
<<DATABASE GLOBAL INFO:>>
DW_IP MAL_DW_PORT WTIME WTYPE WCTLSTAT WSTATUS INAME INST_OK N_EP N_OK ISTATUS IMODE DSC_STATUS RTYPE RSTAT
192.168.101.206 5256 2023-12-21 14:30:51 GLOBAL VALID OPEN ECSB OK 1 1 OPEN STANDBY DSC_OPEN REALTIME VALID
EP INFO:
INST_IP INST_PORT INST_OK INAME ISTATUS IMODE DSC_SEQNO DSC_CTL_NODE RTYPE RSTAT FSEQ FLSN CSEQ CLSN DW_STAT_FLAG
192.168.101.206 5236 OK ECSB OPEN STANDBY 0 0 REALTIME VALID 4807 2721898 4807 2721898 NONE
DATABASE(ECSB) APPLY INFO FROM (ECSZ), REDOS_PARALLEL_NUM (4):
DSC_SEQNO[0], (RSEQ, SSEQ, KSEQ)[1808038, 1808038, 1808039], (RLSN, SLSN, KLSN)[2721898, 2721898, 2721899], N_TSK[0], TSK_MEM_USE[512]
REDO_LSN_ARR: (2721898, 2721898, 2721898, 2721898)
#================================================================================#
--故障切换
停守护进程
停实例
--故障切换后
跟项目组确认连通测试,观察206库,可通过命令netstat -natp | grep 5236 查看应用连接状态是否已进来。
#================================================================================#
show
2023-12-21 14:36:50
#================================================================================#
GROUP OGUID MON_CONFIRM MODE MPP_FLAG
GRP1 26180 FALSE AUTO FALSE
<<DATABASE GLOBAL INFO:>>
DW_IP MAL_DW_PORT WTIME WTYPE WCTLSTAT WSTATUS INAME INST_OK N_EP N_OK ISTATUS IMODE DSC_STATUS RTYPE RSTAT
192.168.101.206 5256 2023-12-21 14:36:49 GLOBAL VALID OPEN ECSB OK 1 1 OPEN PRIMARY DSC_OPEN REALTIME VALID
EP INFO:
INST_IP INST_PORT INST_OK INAME ISTATUS IMODE DSC_SEQNO DSC_CTL_NODE RTYPE RSTAT FSEQ FLSN CSEQ CLSN DW_STAT_FLAG
192.168.101.206 5236 OK ECSB OPEN PRIMARY 0 0 REALTIME VALID 1808208 2740933 1808208 2740933 NONE
ERROR DATABASE:
<<DATABASE GLOBAL INFO:>>
DW_IP MAL_DW_PORT WTIME WTYPE WCTLSTAT WSTATUS INAME INST_OK N_EP N_OK ISTATUS IMODE DSC_STATUS RTYPE RSTAT
192.168.101.205 5256 2023-12-21 14:33:07 GLOBAL VALID ERROR ECSZ OK 1 1 OPEN PRIMARY DSC_OPEN REALTIME VALID
EP INFO:
INST_IP INST_PORT INST_OK INAME ISTATUS IMODE DSC_SEQNO DSC_CTL_NODE RTYPE RSTAT FSEQ FLSN CSEQ CLSN DW_STAT_FLAG
192.168.101.205 5236 OK ECSZ OPEN PRIMARY 0 0 REALTIME VALID 1808098 2731382 1808098 2731382 NONE
#================================================================================#
tip命令监控
tip
[monitor] 2023-12-21 14:47:21: [!!! TIP: This dmmonitor is not in CONFIRM mode, if PRIMARY instance error occurs in AUTO watch mode, this dmmonitor cannot do auto takeover !!!]
[monitor] 2023-12-21 14:47:21: Instance ECSB[PRIMARY, OPEN, ISTAT_SAME:TRUE] cannot join other instances, dmwatcher status is OPEN, SYSOPENHISTORY status is VALID
[monitor] 2023-12-21 14:47:21: Instance ECSB[PRIMARY, OPEN, ISTAT_SAME:TRUE] has no command to execute currently
[monitor] 2023-12-21 14:47:21: Instance ECSB[PRIMARY, OPEN, ISTAT_SAME:TRUE] is OK, dmwatcher status is OPEN, dw_type is GLOBAL
[monitor] 2023-12-21 14:47:21: Instance ECSZ[STANDBY, OPEN, ISTAT_SAME:TRUE] can join instance ECSB[PRIMARY, OPEN, ISTAT_SAME:TRUE]
[monitor] 2023-12-21 14:47:21: Instance ECSZ[STANDBY, OPEN, ISTAT_SAME:TRUE] has no command to execute currently
[monitor] 2023-12-21 14:47:21: Instance ECSZ[STANDBY, OPEN, ISTAT_SAME:TRUE] is OK, dmwatcher status is OPEN, dw_type is GLOBAL
[monitor] 2023-12-21 14:47:21: Group(GRP1) current active instances are OK
[monitor] 2023-12-21 14:47:21: All groups' current active instances are OK!
--手动切换主备
#================================================================================#
choose switchover GRP1
Can choose one of the following instances to do switchover:
1: ECSZ
switchover GRP1.ECSZ
[monitor] 2023-12-21 14:50:13: Start to switchover instance ECSZ
[monitor] 2023-12-21 14:50:13: Notify dmwatcher(ECSB) switch to SWITCHOVER status
[monitor] 2023-12-21 14:50:13: Dmwatcher process ECSB status switching [OPEN-->SWITCHOVER]
[monitor] 2023-12-21 14:50:13: Switch dmwatcher ECSB to SWITCHOVER status success
[monitor] 2023-12-21 14:50:13: Notify dmwatcher(ECSZ) switch to SWITCHOVER status
[monitor] 2023-12-21 14:50:13: Dmwatcher process ECSZ status switching [OPEN-->SWITCHOVER]
[monitor] 2023-12-21 14:50:14: Switch dmwatcher ECSZ to SWITCHOVER status success
[monitor] 2023-12-21 14:50:14: Instance ECSB start to execute sql SP_SET_GLOBAL_DW_STATUS(0, 6)
[monitor] 2023-12-21 14:50:15: Instance ECSB execute sql SP_SET_GLOBAL_DW_STATUS(0, 6) success
[monitor] 2023-12-21 14:50:15: Instance ECSZ start to execute sql SP_SET_GLOBAL_DW_STATUS(0, 6)
[monitor] 2023-12-21 14:50:15: Instance ECSZ execute sql SP_SET_GLOBAL_DW_STATUS(0, 6) success
[monitor] 2023-12-21 14:50:15: Instance ECSB start to execute sql ALTER DATABASE MOUNT
[monitor] 2023-12-21 14:50:15: Instance ECSB execute sql ALTER DATABASE MOUNT success
[monitor] 2023-12-21 14:50:15: Instance ECSZ start to execute sql SP_APPLY_KEEP_PKG()
[monitor] 2023-12-21 14:50:16: Instance ECSZ execute sql SP_APPLY_KEEP_PKG() success
[monitor] 2023-12-21 14:50:16: Instance ECSZ start to execute sql ALTER DATABASE MOUNT
[monitor] 2023-12-21 14:50:16: Instance ECSZ execute sql ALTER DATABASE MOUNT success
[monitor] 2023-12-21 14:50:16: Instance ECSB start to execute sql ALTER DATABASE STANDBY
[monitor] 2023-12-21 14:50:17: Instance ECSB execute sql ALTER DATABASE STANDBY success
[monitor] 2023-12-21 14:50:17: Instance ECSZ start to execute sql ALTER DATABASE PRIMARY
[monitor] 2023-12-21 14:50:18: Instance ECSZ execute sql ALTER DATABASE PRIMARY success
[monitor] 2023-12-21 14:50:18: Notify instance ECSZ to change all arch status to be invalid
[monitor] 2023-12-21 14:50:18: Succeed to change all instances arch status to be invalid
[monitor] 2023-12-21 14:50:18: Instance ECSB start to execute sql ALTER DATABASE OPEN FORCE
[monitor] 2023-12-21 14:50:18: Instance ECSB execute sql ALTER DATABASE OPEN FORCE success
[monitor] 2023-12-21 14:50:18: Instance ECSZ start to execute sql ALTER DATABASE OPEN FORCE
[monitor] 2023-12-21 14:50:18: Instance ECSZ execute sql ALTER DATABASE OPEN FORCE success
[monitor] 2023-12-21 14:50:18: Instance ECSB start to execute sql SP_SET_GLOBAL_DW_STATUS(6, 0)
[monitor] 2023-12-21 14:50:18: Instance ECSB execute sql SP_SET_GLOBAL_DW_STATUS(6, 0) success
[monitor] 2023-12-21 14:50:18: Instance ECSZ start to execute sql SP_SET_GLOBAL_DW_STATUS(6, 0)
[monitor] 2023-12-21 14:50:18: Instance ECSZ execute sql SP_SET_GLOBAL_DW_STATUS(6, 0) success
[monitor] 2023-12-21 14:50:18: Notify dmwatcher(ECSB) switch to OPEN status
[monitor] 2023-12-21 14:50:18: Dmwatcher process ECSB status switching [SWITCHOVER-->OPEN]
[monitor] 2023-12-21 14:50:19: Switch dmwatcher ECSB to OPEN status success
[monitor] 2023-12-21 14:50:19: Notify dmwatcher(ECSZ) switch to OPEN status
[monitor] 2023-12-21 14:50:20: Dmwatcher process ECSZ status switching [SWITCHOVER-->OPEN]
[monitor] 2023-12-21 14:50:20: Switch dmwatcher ECSZ to OPEN status success
[monitor] 2023-12-21 14:50:20: Notify group(GRP1)'s dmwatcher to do clear
[monitor] 2023-12-21 14:50:20: Clean request of dmwatcher processer ECSZ success
[monitor] 2023-12-21 14:50:21: Clean request of dmwatcher processer ECSB success
[monitor] 2023-12-21 14:50:21: Switchover instance ECSZ success
2023-12-21 14:50:21
#================================================================================#
GROUP OGUID MON_CONFIRM MODE MPP_FLAG
GRP1 26180 FALSE AUTO FALSE
<<DATABASE GLOBAL INFO:>>
DW_IP MAL_DW_PORT WTIME WTYPE WCTLSTAT WSTATUS INAME INST_OK N_EP N_OK ISTATUS IMODE DSC_STATUS RTYPE RSTAT
192.168.101.205 5256 2023-12-21 14:50:20 GLOBAL VALID OPEN ECSZ OK 1 1 OPEN PRIMARY DSC_OPEN REALTIME VALID
EP INFO:
INST_IP INST_PORT INST_OK INAME ISTATUS IMODE DSC_SEQNO DSC_CTL_NODE RTYPE RSTAT FSEQ FLSN CSEQ CLSN DW_STAT_FLAG
192.168.101.205 5236 OK ECSZ OPEN PRIMARY 0 0 REALTIME VALID 1808627 2750990 1808627 2750990 NONE
<<DATABASE GLOBAL INFO:>>
DW_IP MAL_DW_PORT WTIME WTYPE WCTLSTAT WSTATUS INAME INST_OK N_EP N_OK ISTATUS IMODE DSC_STATUS RTYPE RSTAT
192.168.101.206 5256 2023-12-21 14:50:20 GLOBAL VALID OPEN ECSB OK 1 1 OPEN STANDBY DSC_OPEN REALTIME INVALID
EP INFO:
INST_IP INST_PORT INST_OK INAME ISTATUS IMODE DSC_SEQNO DSC_CTL_NODE RTYPE RSTAT FSEQ FLSN CSEQ CLSN DW_STAT_FLAG
192.168.101.206 5236 OK ECSB OPEN STANDBY 0 0 REALTIME INVALID 1808624 2741809 1808624 2741809 NONE
DATABASE(ECSB) APPLY INFO FROM (ECSZ), REDOS_PARALLEL_NUM (4):
DSC_SEQNO[0], (RSEQ, SSEQ, KSEQ)[1808624, 1808624, 1808624], (RLSN, SLSN, KLSN)[2741809, 2741809, 2741809], N_TSK[0], TSK_MEM_USE[0]
REDO_LSN_ARR: (2741809, 2741809, 2741809, 2741809)
#================================================================================#
[monitor] 2023-12-21 14:50:22: Dmwatcher process ECSZ status switching [OPEN-->RECOVERY]
WTIME WSTATUS INST_OK INAME ISTATUS IMODE RSTAT N_OPEN FLSN CLSN
2023-12-21 14:50:22 RECOVERY OK ECSZ OPEN PRIMARY VALID 7 2750994 2750994
[monitor] 2023-12-21 14:50:25: Dmwatcher process ECSZ status switching [RECOVERY-->OPEN]
WTIME WSTATUS INST_OK INAME ISTATUS IMODE RSTAT N_OPEN FLSN CLSN
2023-12-21 14:50:25 OPEN OK ECSZ OPEN PRIMARY VALID 7 2750995 2750995
Input command illegal, please input help to get the help information!
Input command illegal, please input help to get the help information!
show
2023-12-21 14:57:31
#================================================================================#
GROUP OGUID MON_CONFIRM MODE MPP_FLAG
GRP1 26180 FALSE AUTO FALSE
<<DATABASE GLOBAL INFO:>>
DW_IP MAL_DW_PORT WTIME WTYPE WCTLSTAT WSTATUS INAME INST_OK N_EP N_OK ISTATUS IMODE DSC_STATUS RTYPE RSTAT
192.168.101.205 5256 2023-12-21 14:57:30 GLOBAL VALID OPEN ECSZ OK 1 1 OPEN PRIMARY DSC_OPEN REALTIME VALID
EP INFO:
INST_IP INST_PORT INST_OK INAME ISTATUS IMODE DSC_SEQNO DSC_CTL_NODE RTYPE RSTAT FSEQ FLSN CSEQ CLSN DW_STAT_FLAG
192.168.101.205 5236 OK ECSZ OPEN PRIMARY 0 0 REALTIME VALID 1808850 2751664 1808851 2751668 NONE
<<DATABASE GLOBAL INFO:>>
DW_IP MAL_DW_PORT WTIME WTYPE WCTLSTAT WSTATUS INAME INST_OK N_EP N_OK ISTATUS IMODE DSC_STATUS RTYPE RSTAT
192.168.101.206 5256 2023-12-21 14:57:30 GLOBAL VALID OPEN ECSB OK 1 1 OPEN STANDBY DSC_OPEN REALTIME VALID
EP INFO:
INST_IP INST_PORT INST_OK INAME ISTATUS IMODE DSC_SEQNO DSC_CTL_NODE RTYPE RSTAT FSEQ FLSN CSEQ CLSN DW_STAT_FLAG
192.168.101.206 5236 OK ECSB OPEN STANDBY 0 0 REALTIME VALID 1808624 2751663 1808624 2751663 NONE
DATABASE(ECSB) APPLY INFO FROM (ECSZ), REDOS_PARALLEL_NUM (4):
DSC_SEQNO[0], (RSEQ, SSEQ, KSEQ)[1808849, 1808849, 1808850], (RLSN, SLSN, KLSN)[2751663, 2751663, 2751664], N_TSK[0], TSK_MEM_USE[512]
REDO_LSN_ARR: (2751663, 2751663, 2751663, 2751663)
#================================================================================#
--主库切换回205后
跟项目组确认连通测试,观察205库,可通过命令netstat -natp | grep 5236 查看应用连接状态是否已进来。
文章
阅读量
获赞