Re: amd64 sata_nv (massive) memory corruption

Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]
From: Robert Hancock
Date: Monday, August 4, 2008 - 11:36 pm

Linas Vepstas wrote:

EDAC is mainly useful for detecting non-fatal problems (i.e. things like 
  corrupted transfers that were detected and retried, or ECC errors that 
could be corrected) which might indicate a problem but might go 
unnoticed otherwise. Usually, fatal problems that get detected by 
hardware wouldn't be unnoticeable - they would typically raise NMIs and 
cause funny kernel messages, cause machine check exceptions or just lock 
up or reset the machine.

Of course, if you don't have ECC memory, and you have bad RAM or memory 
timing problems, nothing can detect this at all, and EDAC wouldn't help you.


I think just about all chipsets (more specifically, memory controllers, 
this includes AMD CPUs) support ECC and use the bits, at least if all 
the installed memory supports it. I've never heard of a board that 
couldn't handle ECC at all.

--
Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]

Messages in current thread:
Re: amd64 sata_nv (massive) memory corruption, Robert Hancock, (Sun Aug 3, 8:22 pm)
Re: amd64 sata_nv (massive) memory corruption, Linas Vepstas, (Mon Aug 4, 10:29 pm)
Re: amd64 sata_nv (massive) memory corruption, Robert Hancock, (Mon Aug 4, 11:36 pm)
Re: amd64 sata_nv (massive) memory corruption, Alan Cox, (Tue Aug 5, 5:29 am)