
64
1.
OS Stall Monitoring by using OpenIPMI
This section explains OS Stall Monitoring by using OpenIPMI.
F u n c t i o n s
You can monitor OS Stall condition by regularly updating watchdog timer (timer for software stall monitoring)
mounted machine. In case there is no response due to OS stall or, timer is not updated or other reasons,
Watchdog timer expires and the system reboot automatically.
Confirm the movement situation of OpenIPMI by all means before setting this
chapter. Because it uses a server management driver for OS stall monitoring when
"mainte" is displayed by lsmod command, it is not necessary to set this chapter.
S e t t i n g s
You can set timeout period, update interval, action after timeout. The parameter is as follows.
Timeout Period: timeout
Period Value in which whether OS stall generation is judged. You can set it in number of seconds.
Default Value is 60 seconds. It is possible to be set from 10 seconds.
You can set it in /etc/sysconfig/ipmi
Action after Timeout: action
You can select how to restore after timeout.
Default Value is reset. You can set it in /etc/sysconfig/ipmi
Reset system and try to reboot.
System power is shut down.
First power OFF and power ON just after that.
Update Interval: interval
Interval value which timer update. You can set it in number of seconds.
Default Value is 10 seconds. It is possible to be set within 1-59 seconds.
You can set it in /etc/watchdog.conf
By the system load situation of the machine, Even if OS is not a state of the
stall, watchdog timer can not be updated, so there is a possibility that the
time-out is generated. After it evaluates it in the state of a high load in the
system requirements, set the stall monitoring.
Comentarios a estos manuales