IBM 150 User Manual

Page of 286
80
 
RS/6000 43P 7043 Models 150 and 260 Handbook
3.2.3.3  Fault Monitoring Functions
Built-in Self-Test (BIST) and Power-on Self-Test (POST) checks processor, 
L2 cache, memory and associated hardware, that are required for proper 
booting of the operating system every time the system is powered on. If a 
non-critical error is detected, or if the error(s) occur in the resources which 
can be removed from the system configuration, the booting process will 
proceed to completion. The error(s) are logged in the system non-volatile 
RAM.
Disk drive fault tracking that can alert the system administrator of potential 
disk failure before it impacts customer operation.
The AIX log facility where hardware and software failures are recorded and 
analyzed (by Error Log Analysis routine) to provide warning to the system 
administrator on the causes of system problems. This also enables IBM 
service representatives to bring along needed replacement hardware 
components when a service call is placed, thus minimizing system repair 
time.
3.2.3.4  Mutual Surveillance
The service processor can monitor the operation of the firmware during the 
boot process, and it can monitor the operating system for loss of control. It 
also allows the operating system to monitor for service processor activity. The 
service processor can take appropriate action, including calling for service, 
when it detects that the firmware or the operating system has lost control. 
Likewise, the operating system can request a service processor repair action 
if necessary.
3.2.3.5  Environmental Monitoring Functions
The following is a list of the environmental monitoring functions.
  • Temperature monitoring that increases the fan speed rotation when 
ambient temperature is above the normal operating range
  • Temperature monitoring to warn the system administrator of potential 
environmental related problems (for example, air conditioning and air 
circulation around the system) so that appropriate corrective actions can 
be taken before a critical failure threshold is reached, and to provide 
orderly system shutdown when operating temperature exceeds the critical 
level
  • Fan speed monitoring to provide warning and an orderly system shutdown 
when the speed is out of operational specification
  • DC voltages monitoring to provide warning and an orderly system 
shutdown when the voltage(s) are out of operational specification