Folks,
I have 5 machines getting the following errors:
here's what mcelog spat out
MCE 0
HARDWARE ERROR. This is *NOT* a software problem!
Please contact your hardware vendor
HARDWARE ERROR. This is *NOT* a software problem!
Please contact your hardware vendor
CPU 0 0 data cache bit57 = processor context corrupt
bit59 = misc error valid
bit61 = error uncorrected
bit62 = error overflow (multiple errors)
bus error 'generic participation, request timed out
generic error mem transaction
generic access, level generic'
STATUS fa00000000070f0f MCGSTATUS 0
MCE 1
HARDWARE ERROR. This is *NOT* a software problem!
Please contact your hardware vendor
HARDWARE ERROR. This is *NOT* a software problem!
Please contact your hardware vendor
CPU 4 0 data cache bit57 = processor context corrupt
bit59 = misc error valid
bit61 = error uncorrected
bit62 = error overflow (multiple errors)
bus error 'local node observed, request didn't time out
generic error mem transaction
generic access, level generic'
STATUS fa00001000020c0f MCGSTATUS 0
What's going on here? Do i have bad memory or CPUs are heating up?
Can someone shine a light on this? Please help