Re: Hardware Error Kernel Mini-Summit

Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]
From: Nils Carlson
Date: Monday, June 14, 2010 - 12:47 pm

On Jun 14, 2010, at 1:49 PM, Andi Kleen wrote:

A lot of core edac doesn't reflect modern motherboards it's true.

I do have motherboard schematics, or rather, we build our own
boards. But the point is valid, a lot of people don't make their own
hardware. On the other hand, the people who do use this part of
EDAC perhaps aren't your typical home computer users?


This is true, and this is the way things are going on
our end as well. I guess that would mean
one driver that hooks into all frameworks though?
So you wouldn't go to the EDAC sysfs directory
to find everything to do with the same piece of hardware
anymore, but would have to go the n different
directories looking for all the pieces? I don't really
like that...


But all new hardware will look the way the hardware
designers want it to, so our interface will be a moving
target? Maybe it's time to let hardware makers provide
a board specification with device tree and memory
layout? (Pure speculation)

There is a use-case. A lot has to do with how different patrol
scrub rates work, some just go through memory at a constant
speed (MB/s), others vary according to load. The thing is,
different applications want their memory scrubbed within
different time frames, and as the amount of memory on boards
varies and the bios doesn't vary this implies the need for setting
scrub rate from userspace.

Patrol scrubbing is normally used because it discovers errors
faster in seldom accessed memory allowing a DIMM with
too many errors to be replaced faster. Some applications
like to use demand scrubbing as well, and some consider
it to increase memory latency too much.

<snip>

Oh, a hodge podge is much more than just single bit
correctable error reporting... :-) You never know what
you'll find in the sysfs directory for a given memory
controller.

/Nils Carlson
--
Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]

Messages in current thread:
Hardware Error Kernel Mini-Summit, Mauro Carvalho Chehab, (Mon May 17, 11:23 am)
Re: Hardware Error Kernel Mini-Summit, Andi Kleen, (Mon May 17, 3:41 pm)
Re: Hardware Error Kernel Mini-Summit, Hidetoshi Seto, (Mon May 17, 11:52 pm)
Re: Hardware Error Kernel Mini-Summit, Borislav Petkov, (Tue May 18, 6:06 am)
Re: Hardware Error Kernel Mini-Summit, Mauro Carvalho Chehab, (Tue May 18, 9:44 am)
Re: Hardware Error Kernel Mini-Summit, Mauro Carvalho Chehab, (Tue May 18, 9:50 am)
Re: Hardware Error Kernel Mini-Summit, Mauro Carvalho Chehab, (Tue May 18, 9:52 am)
Re: Hardware Error Kernel Mini-Summit, Mauro Carvalho Chehab, (Tue May 18, 10:06 am)
Re: Hardware Error Kernel Mini-Summit, Joe Perches, (Tue May 18, 10:42 am)
Re: Hardware Error Kernel Mini-Summit, Mauro Carvalho Chehab, (Tue May 18, 10:59 am)
Re: Hardware Error Kernel Mini-Summit, Andi Kleen, (Tue May 18, 11:10 am)
Re: Hardware Error Kernel Mini-Summit, Andi Kleen, (Tue May 18, 11:45 am)
Re: Hardware Error Kernel Mini-Summit, Ingo Molnar, (Tue May 18, 11:53 am)
Re: Hardware Error Kernel Mini-Summit, Joe Perches, (Tue May 18, 11:57 am)
RE: Hardware Error Kernel Mini-Summit, Luck, Tony, (Tue May 18, 12:08 pm)
Re: Hardware Error Kernel Mini-Summit, Borislav Petkov, (Tue May 18, 12:18 pm)
Re: Hardware Error Kernel Mini-Summit, Ingo Molnar, (Tue May 18, 12:30 pm)
Re: Hardware Error Kernel Mini-Summit, Ingo Molnar, (Tue May 18, 12:34 pm)
Re: Hardware Error Kernel Mini-Summit, Ingo Molnar, (Tue May 18, 1:42 pm)
Re: Hardware Error Kernel Mini-Summit, Tony Luck, (Tue May 18, 2:37 pm)
Re: Hardware Error Kernel Mini-Summit, Ingo Molnar, (Tue May 18, 3:00 pm)
Re: Hardware Error Kernel Mini-Summit, Eric W. Biederman, (Tue May 18, 3:14 pm)
Re: Hardware Error Kernel Mini-Summit, Andi Kleen, (Tue May 18, 3:28 pm)
Re: Hardware Error Kernel Mini-Summit, Ingo Molnar, (Tue May 18, 3:29 pm)
Re: Hardware Error Kernel Mini-Summit, Eric W. Biederman, (Tue May 18, 6:14 pm)
Re: Hardware Error Kernel Mini-Summit, Ingo Molnar, (Tue May 18, 11:39 pm)
Re: Hardware Error Kernel Mini-Summit, Borislav Petkov, (Tue May 18, 11:46 pm)
Re: Hardware Error Kernel Mini-Summit, Ingo Molnar, (Wed May 19, 12:09 am)
Re: Hardware Error Kernel Mini-Summit, Andi Kleen, (Wed May 19, 2:03 am)
Re: Hardware Error Kernel Mini-Summit, Mauro Carvalho Chehab, (Wed May 19, 4:54 am)
Re: Hardware Error Kernel Mini-Summit, Tony Luck, (Wed May 19, 10:30 am)
Re: Hardware Error Kernel Mini-Summit, Ingo Molnar, (Thu May 20, 5:37 am)
Re: Hardware Error Kernel Mini-Summit, Russ Anderson, (Mon May 24, 8:55 am)
Re: Hardware Error Kernel Mini-Summit, Russ Anderson, (Mon May 24, 9:21 am)
Re: Hardware Error Kernel Mini-Summit, Russ Anderson, (Mon May 24, 10:13 am)
Re: Hardware Error Kernel Mini-Summit, Tony Luck, (Mon May 24, 10:35 am)
Re: Hardware Error Kernel Mini-Summit, Andi Kleen, (Mon May 24, 11:26 am)
Re: Hardware Error Kernel Mini-Summit, Andi Kleen, (Mon May 24, 11:31 am)
Re: Hardware Error Kernel Mini-Summit, Nils Carlson, (Mon Jun 14, 3:03 am)
Re: Hardware Error Kernel Mini-Summit, Andi Kleen, (Mon Jun 14, 4:49 am)
Re: Hardware Error Kernel Mini-Summit, Nils Carlson, (Mon Jun 14, 12:47 pm)
Re: Hardware Error Kernel Mini-Summit, Eric W. Biederman, (Mon Jun 14, 1:06 pm)
Re: Hardware Error Kernel Mini-Summit, Andi Kleen, (Mon Jun 14, 1:21 pm)
RE: Hardware Error Kernel Mini-Summit, Luck, Tony, (Mon Jun 14, 1:21 pm)
Re: Hardware Error Kernel Mini-Summit, Andi Kleen, (Mon Jun 14, 1:36 pm)
Re: Hardware Error Kernel Mini-Summit, Tony Luck, (Mon Jun 14, 2:34 pm)
Re: Hardware Error Kernel Mini-Summit, Andi Kleen, (Mon Jun 14, 11:44 pm)
Re: Hardware Error Kernel Mini-Summit, Andi Kleen, (Mon Jun 14, 11:56 pm)
Re: Hardware Error Kernel Mini-Summit, Nils Carlson, (Tue Jun 15, 1:06 am)
Re: Hardware Error Kernel Mini-Summit, Borislav Petkov, (Tue Jun 15, 3:01 am)
Re: Hardware Error Kernel Mini-Summit, Andi Kleen, (Tue Jun 15, 4:41 am)
Re: Hardware Error Kernel Mini-Summit, Nils Carlson, (Tue Jun 15, 5:21 am)
RE: Hardware Error Kernel Mini-Summit, Luck, Tony, (Tue Jun 15, 11:15 am)
Re: Hardware Error Kernel Mini-Summit, Nils Carlson, (Tue Jun 15, 11:38 am)
Re: Hardware Error Kernel Mini-Summit, Andi Kleen, (Tue Jun 15, 12:35 pm)
Re: Hardware Error Kernel Mini-Summit, Andi Kleen, (Tue Jun 15, 12:37 pm)
Re: Hardware Error Kernel Mini-Summit, Nils Carlson, (Tue Jun 15, 1:48 pm)
Re: Hardware Error Kernel Mini-Summit, Tony Luck, (Tue Jun 15, 3:33 pm)
Re: Hardware Error Kernel Mini-Summit, Andi Kleen, (Wed Jun 16, 2:40 am)