Re: ECC and DMA to/from disk controllers

!MAILaRCHIVE_VOTE_RePLACE
Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]
To: Alan Cox <alan@...>, linux-os (Dick Johnson) <linux-os@...>, Robert Hancock <hancockr@...>
Cc: Linux Kernel Mailing List <linux-kernel@...>, Bruce Allen <bruce.allen@...>
Date: Wednesday, September 12, 2007 - 11:37 pm

Alan, Robert, Dick,

Thank you all for the informed and helpful response!

Alan, I'll pass your comments on to Peter Kelemen.  Not sure if he follows 
LKML.  I think he'll be interested in your characterization of the error 
types.  I'll point him to the thread.  (I think Peter and his 
collaborators are fairly aware of the undetected error rates in standard 
ethernet TCP/IP traffic which as I recall is about one undetected 
single-bit error per 4TB transfered.  I am pretty sure they have ruled 
this out since they have checksums computed after any network transfers.)

Robert, Dick, if I have understood correctly, in response to my specific 
question, RAID controllers on PCI cards will DMA data into memory over a 
PCI bus using one parity bit per 32 data bits for protection.  This does 
provide some protection against errors in the data transfer, but much less 
protection than typical RAM ECC which has one ECC byte for each eight data 
bytes. As I recall, many older motherboards disabled parity on the PCI 
bus, so even this protection may be inactive in many cases. From a few 
minutes of on-line research, I have the impression that PCI-e has better 
ECC protection against address/data errors than PCI but I am not certain.

Thanks again!

Cheers,
 	Bruce
-
Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]

Messages in current thread:
Re: ECC and DMA to/from disk controllers, Robert Hancock, (Mon Sep 10, 9:09 pm)
Re: ECC and DMA to/from disk controllers, Bruce Allen, (Wed Sep 12, 11:37 pm)