Cisco Cisco UCS B250 M2 Extended Memory Blade Server Guía Para Resolver Problemas
UCS Memory Error Management
UCS Enhanced Memory Error Management
Page 8
Uncorrectable memory errors are reported by the BIOS which creates an entry in the CIMC SEL. A single
uncorrectable error will result in a fault for the DIMM and the server, both indicating an overall status of
“Degraded”. The status details for an individual memory module reveal that a DIMM requires replacement when it
encounters an uncorrectable ECC error (see Figure 3).
uncorrectable error will result in a fault for the DIMM and the server, both indicating an overall status of
“Degraded”. The status details for an individual memory module reveal that a DIMM requires replacement when it
encounters an uncorrectable ECC error (see Figure 3).
Figure 2: Uncorrectable ECC errors require replacement of the module; thus, DIMM overall status is reported
as “Degraded”
as “Degraded”
DIMM Blacklisting
UCSM version 2.2 introduced the opt-in feature DIMM Blacklisting to help prevent repeat uncorrectable memory
errors. When using this feature UCSM will "blacklist" DIMMs which have encountered uncorrectable errors during
OS runtime. This mechanism prevents repeated crashes due to additional uncorrectable errors on the same DIMM
before troubleshooting or corrective maintenance can occur. DIMM Blacklisting can be enabled in the default
Global Memory Policy (see Figure 3).
errors. When using this feature UCSM will "blacklist" DIMMs which have encountered uncorrectable errors during
OS runtime. This mechanism prevents repeated crashes due to additional uncorrectable errors on the same DIMM
before troubleshooting or corrective maintenance can occur. DIMM Blacklisting can be enabled in the default
Global Memory Policy (see Figure 3).
Figure 3: Blacklisting enablement in Global Memory Policy