ipmi allow to recover informations on system status, and on event log related to hardware failures. ipmi also allow to power on, power off and reboot nodes remotely.

login0:~# yum install ipmi
login0:~# modprobe ipmi_devintf
login0:~# modprobe ipmi_si
login0:~# ipmitool sensor
CPU Temp 1       | 38.000     | degrees C  | ok    | na        | na        | na        | 76.000    | 78.000    | 80.000    
CPU Temp 2       | 36.000     | degrees C  | ok    | na        | na        | na        | 76.000    | 78.000    | 80.000    
Sys Temp         | 33.000     | degrees C  | ok    | na        | na        | na        | 76.000    | 78.000    | 80.000    
CPU1 Vcore       | 0.928      | Volts      | ok    | 0.560     | 0.576     | 0.592     | 1.208     | 1.224     | 1.240     
CPU2 Vcore       | 0.928      | Volts      | ok    | 0.560     | 0.576     | 0.592     | 1.208     | 1.224     | 1.240     
1.5V             | 1.528      | Volts      | ok    | 1.312     | 1.328     | 1.344     | 1.656     | 1.672     | 1.688     
5V               | 5.088      | Volts      | ok    | 4.416     | 4.448     | 4.480     | 5.536     | 5.568     | 5.600     
12V              | 12.084     | Volts      | ok    | 10.653    | 10.706    | 10.759    | 13.250    | 13.303    | 13.356    
5VSB             | 5.056      | Volts      | ok    | 4.416     | 4.448     | 4.480     | 5.536     | 5.568     | 5.600     
-12V             | -12.700    | Volts      | ok    | -10.500   | -10.600   | -10.700   | -13.300   | -13.400   | -13.500   
3.3V             | 3.264      | Volts      | ok    | 2.880     | 2.904     | 2.928     | 3.672     | 3.696     | 3.720     
3.3VSB           | 3.264      | Volts      | ok    | 2.880     | 2.904     | 2.928     | 3.672     | 3.696     | 3.720     
VBAT             | 3.240      | Volts      | ok    | 2.880     | 2.904     | 2.928     | 3.672     | 3.696     | 3.720     
Fan1             | 6400.000   | RPM        | ok    | 200.000   | 300.000   | 400.000   | na        | na        | na        
Fan2             | 6400.000   | RPM        | ok    | 200.000   | 300.000   | 400.000   | na        | na        | na        
Fan3             | 6400.000   | RPM        | ok    | 200.000   | 300.000   | 400.000   | na        | na        | na        
Intrusion        | 0x0        | discrete   | 0x0000| na        | na        | na        | na        | na        | na        
Power Supply     | 0x0        | discrete   | 0x0000| na        | na        | na        | na        | na        | na        
CAT Error        | 0x0        | discrete   | 0x0000| na        | na        | na        | na        | na        | na        
IOH Error        | 0x0        | discrete   | 0x0000| na        | na        | na        | na        | na        | na        
CPU Overheat     | 0x0        | discrete   | 0x0000| na        | na        | na        | na        | na        | na        
Thermal Trip1    | 0x0        | discrete   | 0x0000| na        | na        | na        | na        | na        | na        
Thermal Trip2    | 0x0        | discrete   | 0x0000| na        | na        | na        | na        | na        | na   
login0:~#

System is ok. Possible status are :