Dear LKML, Apologies in advance for potential mis-use of LKML, but I don't know where else to ask. An ongoing study on datasets of several Petabytes have shown that there can be 'silent data corruption' at rates much larger than one might naively expect from the expected error rates in RAID arrays and the expected probability of single bit uncorrected errors in hard disks. The origin of this data corruption is still unknown. See for example http://cern.ch/Peter.Kelemen/talk/2007/kelemen-2007-C5-Silent_Corruptions.pdf In thinking about this, I began to wonder about the following. Suppose that a (possibly RAID) disk controller correctly reads data from disk and has correct data in the controller memory and buffers. However when that data is DMA'd into system memory some errors occur (cosmic rays, electrical noise, etc). Am I correct that these errors would NOT be detected, even on a 'reliable' server with ECC memory? In other words the ECC bits would be calculated in server memory based on incorrect data from the disk. The alternative is that disk controllers (or at least ones that are meant to be reliable) DMA both the data AND the ECC byte into system memory. So that if an error occurs in this transfer, then it would most likely be picked up and corrected by the ECC mechanism. But I don't think that 'this is how it works'. Could someone knowledgable please confirm or contradict? Cheers, Bruce -
| Kristen Carlson Accardi | [PATCH] ata: ahci: power off unused ports |
| Gabriel C | Re: 2.6.22-rc4-mm1 |
| Jeremy Fitzhardinge | [PATCH 12 of 36] x86_64: replace end_pfn with num_physpages |
| Kyle McMartin | Re: -mm merge plans for 2.6.21 |
git: | |
| Andy Parkins | svn:externals using git submodules |
| Marius Storm-Olsen | Re: Git on MSys (or how to make it easy for Windows users to compile git) |
| Alex Riesen | Re: git-fast-import |
| rene.scharfe | Re: [PATCH 1/6] Add strbuf_rtrim and strbuf_insert. |
| Richard Stallman | Real men don't attack straw men |
| Tanvir | Re: Adobe Flash on OpenBSD |
| Will Maier | cron doesn't run commands in /etc/crontab? |
| Paul Taulborg | SMP Support? |
| David Willmore | Re: Intel, the Pentium and Linux |
| Stephen Pierce | SLS |
| Ari Lemmke | find-1.2 |
| Brett McCoy | Re: Offical windows |
| magical mounts | 7 hours ago | Linux kernel |
| Problem in scim in Fedora 9 | 8 hours ago | Linux general |
| The new Western Digital power saving drives | 8 hours ago | Hardware |
| Battery Maximizer Software | 1 day ago | Linux kernel |
| windows folder creation surprise | 1 day ago | Windows |
| Firewall | 1 day ago | OpenBSD |
| IP layer send packet | 2 days ago | Linux kernel |
| dtrace for linux available | 2 days ago | Linux kernel |
| Unable to mount ramdisk image using UBoot while upgrading to 2.6.15 kernel for a MPC8540 based target | 3 days ago | Linux kernel |
| RealTek RTL8169 - can't connect | 3 days ago | NetBSD |
