为提高效率,提问时请提供以下信息,问题描述清晰可优先响应。
【DM版本】:dm8
【操作系统】:麒麟10
【问题描述】*:使用过程中,达梦服务自动停止。
systemctl status DmServiceDMSERVER.service
● DmServiceDMSERVER.service - DM Instance Service(DmServiceDMSERVER).
Loaded: loaded (/usr/lib/systemd/system/DmServiceDMSERVER.service; enabled; vendor preset: disabled)
Active: failed (Result: signal) since Sat 2023-04-08 09:11:23 CST; 5min ago
Process: 2630703 ExecStop=/home/dmdba/dmdbms/bin/DmServiceDMSERVER stop (code=exited, status=0/SUCCESS)
Process: 2630760 ExecStart=/home/dmdba/dmdbms/bin/DmServiceDMSERVER start (code=exited, status=0/SUCCESS)
Main PID: 2630784 (code=killed, signal=FPE)
查看服务状态:
systemctl -a |grep DM
● DmServiceDMSERVER.service loaded failed failed DM Instance Service(DmServiceDMSERVER).
日志内容:
ee_space(536645632), n_ep(1)
2023-04-08 09:04:17.676 [INFO] database P0002630784 T0000000000002630801 checkpoint end, 0 pages flushed, used_space[217088], free_space[536645632].
2023-04-08 09:07:17.689 [INFO] database P0002630784 T0000000000002630866 checkpoint requested by CKPT_INTERVAL, rlog free space[536581120], used space[281600]
2023-04-08 09:07:17.690 [INFO] database P0002630784 T0000000000002630866 checkpoint generate by ckpt_interval
2023-04-08 09:07:17.690 [INFO] database P0002630784 T0000000000002630801 checkpoint begin, used_space[282112], free_space[536580608]...
2023-04-08 09:07:17.714 [INFO] database P0002630784 T0000000000002630801 ckpt2_log_adjust: full_status: 160, ptx_reserved: 0
2023-04-08 09:07:17.714 [INFO] database P0002630784 T0000000000002630801 ckpt2_log_adjust: ckpt_lsn(54471262), ckpt_fil(1), ckpt_off(54417920), cur_lsn(54471876), l_next_seq(976859), g_next_seq(976859), cur_free(54690816), total_space(536862720), used_space(272896), free_space(536589824), n_ep(1)
2023-04-08 09:07:17.714 [INFO] database P0002630784 T0000000000002630801 checkpoint end, 0 pages flushed, used_space[272896], free_space[536589824].
2023-04-08 09:10:17.725 [INFO] database P0002630784 T0000000000002630866 checkpoint requested by CKPT_INTERVAL, rlog free space[536522240], used space[340480]
2023-04-08 09:10:17.725 [INFO] database P0002630784 T0000000000002630866 checkpoint generate by ckpt_interval
2023-04-08 09:10:17.725 [INFO] database P0002630784 T0000000000002630801 checkpoint begin, used_space[340992], free_space[536521728]...
2023-04-08 09:10:17.754 [INFO] database P0002630784 T0000000000002630801 ckpt2_log_adjust: full_status: 160, ptx_reserved: 0
2023-04-08 09:10:17.754 [INFO] database P0002630784 T0000000000002630801 ckpt2_log_adjust: ckpt_lsn(54471307), ckpt_fil(1), ckpt_off(54439424), cur_lsn(54472035), l_next_seq(976940), g_next_seq(976940), cur_free(54758912), total_space(536862720), used_space(319488), free_space(536543232), n_ep(1)
2023-04-08 09:10:17.754 [INFO] database P0002630784 T0000000000002630801 checkpoint end, 0 pages flushed, used_space[319488], free_space[536543232].
2023-04-08 09:11:19.740 [FATAL] database P0002630784 T0000000000001795196 sigterm_handler receive signal 11
最后报signal 11。
请帮忙分析服务停止原因。
https://eco.dameng.com/community/question/cc2663ca397695a515217fdee837333b
请参考这个问答
先看下是不是操作系统oom掉了数据库服务。然后再排查下数据库是否有生成core文件,分析下堆栈信息。
检查系统系统日志/var/log/messages日志文件是否有相关更详细的信息,是否有生成coredump文件