注册
一次在线环境-监视器手动切换主备
培训园地/ 文章详情 /

一次在线环境-监视器手动切换主备

hlwg 2024/04/25 633 1 0

1.主备ip已替换为示例地址
2.恢复后需要手动切换主备服务器
3.数据库不停机,应用在线进行切换验证

监视器、主备环境操作步骤如下:


[dmdba@ecsDBjsq ~]$ cd /data/dmdbms/bin
[dmdba@ecsDBjsq bin]$ ./dmmonitor dmmonitor_noc.ini 

#================================================================================#

login
username:SYSDBA
password:
[monitor]         2023-12-21 14:30:45: Login dmmonitor success!


Input command illegal, please input help to get the help information!

SHOW
2023-12-21 14:30:51 
#================================================================================#
GROUP            OGUID       MON_CONFIRM     MODE            MPP_FLAG  
GRP1             26180       FALSE           AUTO            FALSE     


<<DATABASE GLOBAL INFO:>>
DW_IP               MAL_DW_PORT  WTIME                WTYPE     WCTLSTAT  WSTATUS        INAME            INST_OK   N_EP  N_OK  ISTATUS     IMODE     DSC_STATUS     RTYPE     RSTAT    
192.168.101.205        5256         2023-12-21 14:30:51  GLOBAL    VALID     OPEN           ECSZ             OK        1     1     OPEN        PRIMARY   DSC_OPEN       REALTIME  VALID    

EP INFO:
INST_IP             INST_PORT  INST_OK   INAME            ISTATUS     IMODE     DSC_SEQNO  DSC_CTL_NODE RTYPE     RSTAT    FSEQ            FLSN            CSEQ            CLSN            DW_STAT_FLAG          
192.168.101.205        5236       OK        ECSZ             OPEN        PRIMARY   0          0            REALTIME  VALID    1808039         2721899         1808040         2721900         NONE                  

<<DATABASE GLOBAL INFO:>>
DW_IP               MAL_DW_PORT  WTIME                WTYPE     WCTLSTAT  WSTATUS        INAME            INST_OK   N_EP  N_OK  ISTATUS     IMODE     DSC_STATUS     RTYPE     RSTAT    
192.168.101.206        5256         2023-12-21 14:30:51  GLOBAL    VALID     OPEN           ECSB             OK        1     1     OPEN        STANDBY   DSC_OPEN       REALTIME  VALID    

EP INFO:
INST_IP             INST_PORT  INST_OK   INAME            ISTATUS     IMODE     DSC_SEQNO  DSC_CTL_NODE RTYPE     RSTAT    FSEQ            FLSN            CSEQ            CLSN            DW_STAT_FLAG          
192.168.101.206        5236       OK        ECSB             OPEN        STANDBY   0          0            REALTIME  VALID    4807            2721898         4807            2721898         NONE                  

DATABASE(ECSB) APPLY INFO FROM (ECSZ), REDOS_PARALLEL_NUM (4):
DSC_SEQNO[0], (RSEQ, SSEQ, KSEQ)[1808038, 1808038, 1808039], (RLSN, SLSN, KLSN)[2721898, 2721898, 2721899], N_TSK[0], TSK_MEM_USE[512] 
REDO_LSN_ARR: (2721898, 2721898, 2721898, 2721898)


#================================================================================#

--故障切换
停守护进程
停实例

--故障切换后
跟项目组确认连通测试,观察206库,可通过命令netstat -natp | grep 5236 查看应用连接状态是否已进来。

#================================================================================#

show
2023-12-21 14:36:50 
#================================================================================#
GROUP            OGUID       MON_CONFIRM     MODE            MPP_FLAG  
GRP1             26180       FALSE           AUTO            FALSE     


<<DATABASE GLOBAL INFO:>>
DW_IP               MAL_DW_PORT  WTIME                WTYPE     WCTLSTAT  WSTATUS        INAME            INST_OK   N_EP  N_OK  ISTATUS     IMODE     DSC_STATUS     RTYPE     RSTAT    
192.168.101.206        5256         2023-12-21 14:36:49  GLOBAL    VALID     OPEN           ECSB             OK        1     1     OPEN        PRIMARY   DSC_OPEN       REALTIME  VALID    

EP INFO:
INST_IP             INST_PORT  INST_OK   INAME            ISTATUS     IMODE     DSC_SEQNO  DSC_CTL_NODE RTYPE     RSTAT    FSEQ            FLSN            CSEQ            CLSN            DW_STAT_FLAG          
192.168.101.206        5236       OK        ECSB             OPEN        PRIMARY   0          0            REALTIME  VALID    1808208         2740933         1808208         2740933         NONE                  

ERROR DATABASE:

<<DATABASE GLOBAL INFO:>>
DW_IP               MAL_DW_PORT  WTIME                WTYPE     WCTLSTAT  WSTATUS        INAME            INST_OK   N_EP  N_OK  ISTATUS     IMODE     DSC_STATUS     RTYPE     RSTAT    
192.168.101.205        5256         2023-12-21 14:33:07  GLOBAL    VALID     ERROR          ECSZ             OK        1     1     OPEN        PRIMARY   DSC_OPEN       REALTIME  VALID    

EP INFO:
INST_IP             INST_PORT  INST_OK   INAME            ISTATUS     IMODE     DSC_SEQNO  DSC_CTL_NODE RTYPE     RSTAT    FSEQ            FLSN            CSEQ            CLSN            DW_STAT_FLAG          
192.168.101.205        5236       OK        ECSZ             OPEN        PRIMARY   0          0            REALTIME  VALID    1808098         2731382         1808098         2731382         NONE                  

#================================================================================#

tip命令监控

tip
[monitor]         2023-12-21 14:47:21: [!!! TIP: This dmmonitor is not in CONFIRM mode, if PRIMARY instance error occurs in AUTO watch mode, this dmmonitor cannot do auto takeover !!!]

[monitor]         2023-12-21 14:47:21: Instance ECSB[PRIMARY, OPEN, ISTAT_SAME:TRUE] cannot join other instances, dmwatcher status is OPEN, SYSOPENHISTORY status is VALID
[monitor]         2023-12-21 14:47:21: Instance ECSB[PRIMARY, OPEN, ISTAT_SAME:TRUE] has no command to execute currently
[monitor]         2023-12-21 14:47:21: Instance ECSB[PRIMARY, OPEN, ISTAT_SAME:TRUE] is OK, dmwatcher status is OPEN, dw_type is GLOBAL

[monitor]         2023-12-21 14:47:21: Instance ECSZ[STANDBY, OPEN, ISTAT_SAME:TRUE] can join instance ECSB[PRIMARY, OPEN, ISTAT_SAME:TRUE]
[monitor]         2023-12-21 14:47:21: Instance ECSZ[STANDBY, OPEN, ISTAT_SAME:TRUE] has no command to execute currently
[monitor]         2023-12-21 14:47:21: Instance ECSZ[STANDBY, OPEN, ISTAT_SAME:TRUE] is OK, dmwatcher status is OPEN, dw_type is GLOBAL

[monitor]         2023-12-21 14:47:21: Group(GRP1) current active instances are OK

[monitor]         2023-12-21 14:47:21: All groups' current active instances are OK!

--手动切换主备

#================================================================================#

choose switchover GRP1
Can choose one of the following instances to do switchover:
1: ECSZ


switchover GRP1.ECSZ

[monitor]         2023-12-21 14:50:13: Start to switchover instance ECSZ
[monitor]         2023-12-21 14:50:13: Notify dmwatcher(ECSB) switch to SWITCHOVER status
[monitor]         2023-12-21 14:50:13: Dmwatcher process ECSB status switching [OPEN-->SWITCHOVER] 
[monitor]         2023-12-21 14:50:13: Switch dmwatcher ECSB to SWITCHOVER status success
[monitor]         2023-12-21 14:50:13: Notify dmwatcher(ECSZ) switch to SWITCHOVER status
[monitor]         2023-12-21 14:50:13: Dmwatcher process ECSZ status switching [OPEN-->SWITCHOVER] 
[monitor]         2023-12-21 14:50:14: Switch dmwatcher ECSZ to SWITCHOVER status success
[monitor]         2023-12-21 14:50:14: Instance ECSB start to execute sql SP_SET_GLOBAL_DW_STATUS(0, 6)
[monitor]         2023-12-21 14:50:15: Instance ECSB execute sql SP_SET_GLOBAL_DW_STATUS(0, 6) success
[monitor]         2023-12-21 14:50:15: Instance ECSZ start to execute sql SP_SET_GLOBAL_DW_STATUS(0, 6)
[monitor]         2023-12-21 14:50:15: Instance ECSZ execute sql SP_SET_GLOBAL_DW_STATUS(0, 6) success
[monitor]         2023-12-21 14:50:15: Instance ECSB start to execute sql ALTER DATABASE MOUNT
[monitor]         2023-12-21 14:50:15: Instance ECSB execute sql ALTER DATABASE MOUNT success
[monitor]         2023-12-21 14:50:15: Instance ECSZ start to execute sql SP_APPLY_KEEP_PKG()
[monitor]         2023-12-21 14:50:16: Instance ECSZ execute sql SP_APPLY_KEEP_PKG() success
[monitor]         2023-12-21 14:50:16: Instance ECSZ start to execute sql ALTER DATABASE MOUNT
[monitor]         2023-12-21 14:50:16: Instance ECSZ execute sql ALTER DATABASE MOUNT success
[monitor]         2023-12-21 14:50:16: Instance ECSB start to execute sql ALTER DATABASE STANDBY
[monitor]         2023-12-21 14:50:17: Instance ECSB execute sql ALTER DATABASE STANDBY success
[monitor]         2023-12-21 14:50:17: Instance ECSZ start to execute sql ALTER DATABASE PRIMARY
[monitor]         2023-12-21 14:50:18: Instance ECSZ execute sql ALTER DATABASE PRIMARY success
[monitor]         2023-12-21 14:50:18: Notify instance ECSZ to change all arch status to be invalid
[monitor]         2023-12-21 14:50:18: Succeed to change all instances arch status to be invalid
[monitor]         2023-12-21 14:50:18: Instance ECSB start to execute sql ALTER DATABASE OPEN FORCE
[monitor]         2023-12-21 14:50:18: Instance ECSB execute sql ALTER DATABASE OPEN FORCE success
[monitor]         2023-12-21 14:50:18: Instance ECSZ start to execute sql ALTER DATABASE OPEN FORCE
[monitor]         2023-12-21 14:50:18: Instance ECSZ execute sql ALTER DATABASE OPEN FORCE success
[monitor]         2023-12-21 14:50:18: Instance ECSB start to execute sql SP_SET_GLOBAL_DW_STATUS(6, 0)
[monitor]         2023-12-21 14:50:18: Instance ECSB execute sql SP_SET_GLOBAL_DW_STATUS(6, 0) success
[monitor]         2023-12-21 14:50:18: Instance ECSZ start to execute sql SP_SET_GLOBAL_DW_STATUS(6, 0)
[monitor]         2023-12-21 14:50:18: Instance ECSZ execute sql SP_SET_GLOBAL_DW_STATUS(6, 0) success
[monitor]         2023-12-21 14:50:18: Notify dmwatcher(ECSB) switch to OPEN status
[monitor]         2023-12-21 14:50:18: Dmwatcher process ECSB status switching [SWITCHOVER-->OPEN] 
[monitor]         2023-12-21 14:50:19: Switch dmwatcher ECSB to OPEN status success
[monitor]         2023-12-21 14:50:19: Notify dmwatcher(ECSZ) switch to OPEN status
[monitor]         2023-12-21 14:50:20: Dmwatcher process ECSZ status switching [SWITCHOVER-->OPEN] 
[monitor]         2023-12-21 14:50:20: Switch dmwatcher ECSZ to OPEN status success
[monitor]         2023-12-21 14:50:20: Notify group(GRP1)'s dmwatcher to do clear
[monitor]         2023-12-21 14:50:20: Clean request of dmwatcher processer ECSZ success
[monitor]         2023-12-21 14:50:21: Clean request of dmwatcher processer ECSB success
[monitor]         2023-12-21 14:50:21: Switchover instance ECSZ success

2023-12-21 14:50:21 
#================================================================================#
GROUP            OGUID       MON_CONFIRM     MODE            MPP_FLAG  
GRP1             26180       FALSE           AUTO            FALSE     
<<DATABASE GLOBAL INFO:>>
DW_IP               MAL_DW_PORT  WTIME                WTYPE     WCTLSTAT  WSTATUS        INAME            INST_OK   N_EP  N_OK  ISTATUS     IMODE     DSC_STATUS     RTYPE     RSTAT    
192.168.101.205        5256         2023-12-21 14:50:20  GLOBAL    VALID     OPEN           ECSZ             OK        1     1     OPEN        PRIMARY   DSC_OPEN       REALTIME  VALID    

EP INFO:
INST_IP             INST_PORT  INST_OK   INAME            ISTATUS     IMODE     DSC_SEQNO  DSC_CTL_NODE RTYPE     RSTAT    FSEQ            FLSN            CSEQ            CLSN            DW_STAT_FLAG          
192.168.101.205        5236       OK        ECSZ             OPEN        PRIMARY   0          0            REALTIME  VALID    1808627         2750990         1808627         2750990         NONE                  

<<DATABASE GLOBAL INFO:>>
DW_IP               MAL_DW_PORT  WTIME                WTYPE     WCTLSTAT  WSTATUS        INAME            INST_OK   N_EP  N_OK  ISTATUS     IMODE     DSC_STATUS     RTYPE     RSTAT    
192.168.101.206        5256         2023-12-21 14:50:20  GLOBAL    VALID     OPEN           ECSB             OK        1     1     OPEN        STANDBY   DSC_OPEN       REALTIME  INVALID  

EP INFO:
INST_IP             INST_PORT  INST_OK   INAME            ISTATUS     IMODE     DSC_SEQNO  DSC_CTL_NODE RTYPE     RSTAT    FSEQ            FLSN            CSEQ            CLSN            DW_STAT_FLAG          
192.168.101.206        5236       OK        ECSB             OPEN        STANDBY   0          0            REALTIME  INVALID  1808624         2741809         1808624         2741809         NONE                  

DATABASE(ECSB) APPLY INFO FROM (ECSZ), REDOS_PARALLEL_NUM (4):
DSC_SEQNO[0], (RSEQ, SSEQ, KSEQ)[1808624, 1808624, 1808624], (RLSN, SLSN, KLSN)[2741809, 2741809, 2741809], N_TSK[0], TSK_MEM_USE[0] 
REDO_LSN_ARR: (2741809, 2741809, 2741809, 2741809)
#================================================================================#

[monitor]         2023-12-21 14:50:22: Dmwatcher process ECSZ status switching [OPEN-->RECOVERY] 
                  WTIME                WSTATUS        INST_OK   INAME            ISTATUS     IMODE     RSTAT    N_OPEN   FLSN            CLSN            
                  2023-12-21 14:50:22  RECOVERY       OK        ECSZ             OPEN        PRIMARY   VALID    7        2750994         2750994         

[monitor]         2023-12-21 14:50:25: Dmwatcher process ECSZ status switching [RECOVERY-->OPEN] 
                  WTIME                WSTATUS        INST_OK   INAME            ISTATUS     IMODE     RSTAT    N_OPEN   FLSN            CLSN            
                  2023-12-21 14:50:25  OPEN           OK        ECSZ             OPEN        PRIMARY   VALID    7        2750995         2750995         


Input command illegal, please input help to get the help information!


Input command illegal, please input help to get the help information!

show
2023-12-21 14:57:31 
#================================================================================#
GROUP            OGUID       MON_CONFIRM     MODE            MPP_FLAG  
GRP1             26180       FALSE           AUTO            FALSE     


<<DATABASE GLOBAL INFO:>>
DW_IP               MAL_DW_PORT  WTIME                WTYPE     WCTLSTAT  WSTATUS        INAME            INST_OK   N_EP  N_OK  ISTATUS     IMODE     DSC_STATUS     RTYPE     RSTAT    
192.168.101.205        5256         2023-12-21 14:57:30  GLOBAL    VALID     OPEN           ECSZ             OK        1     1     OPEN        PRIMARY   DSC_OPEN       REALTIME  VALID    

EP INFO:
INST_IP             INST_PORT  INST_OK   INAME            ISTATUS     IMODE     DSC_SEQNO  DSC_CTL_NODE RTYPE     RSTAT    FSEQ            FLSN            CSEQ            CLSN            DW_STAT_FLAG          
192.168.101.205        5236       OK        ECSZ             OPEN        PRIMARY   0          0            REALTIME  VALID    1808850         2751664         1808851         2751668         NONE                  

<<DATABASE GLOBAL INFO:>>
DW_IP               MAL_DW_PORT  WTIME                WTYPE     WCTLSTAT  WSTATUS        INAME            INST_OK   N_EP  N_OK  ISTATUS     IMODE     DSC_STATUS     RTYPE     RSTAT    
192.168.101.206        5256         2023-12-21 14:57:30  GLOBAL    VALID     OPEN           ECSB             OK        1     1     OPEN        STANDBY   DSC_OPEN       REALTIME  VALID    

EP INFO:
INST_IP             INST_PORT  INST_OK   INAME            ISTATUS     IMODE     DSC_SEQNO  DSC_CTL_NODE RTYPE     RSTAT    FSEQ            FLSN            CSEQ            CLSN            DW_STAT_FLAG          
192.168.101.206        5236       OK        ECSB             OPEN        STANDBY   0          0            REALTIME  VALID    1808624         2751663         1808624         2751663         NONE                  

DATABASE(ECSB) APPLY INFO FROM (ECSZ), REDOS_PARALLEL_NUM (4):
DSC_SEQNO[0], (RSEQ, SSEQ, KSEQ)[1808849, 1808849, 1808850], (RLSN, SLSN, KLSN)[2751663, 2751663, 2751664], N_TSK[0], TSK_MEM_USE[512] 
REDO_LSN_ARR: (2751663, 2751663, 2751663, 2751663)


#================================================================================#

--主库切换回205后
跟项目组确认连通测试,观察205库,可通过命令netstat -natp | grep 5236 查看应用连接状态是否已进来。

评论
后发表回复

作者

文章

阅读量

获赞

扫一扫
联系客服