Re: [patch] ext2/3: document conditions when reliable operation is possible

Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]
From: Theodore Tso
Date: Thursday, August 27, 2009 - 5:24 am

On Thu, Aug 27, 2009 at 12:19:02AM -0500, Rob Landley wrote:

I'm not convinced that information which needs to be known by System
Administrators is best documented in the kernel Documentation
directory.  Should there be a HOWTO document on stuff like that?
Sure, if someone wants to put something like that together, having
free documentation about ways to set up your storage stack in a sane
way is not a bad thing.  

It should be noted that these sorts of issues are discussed in various
books targetted at System Administrators, and in Usenix's System
Administration tutorials.  The computer industry is highly
specialized, and so just because an OS kernel hacker might not be
familiar with these issues, doesn't mean that professionals whose job
it is to run data centers don't know about these things!  Similarly,
you could be a whiz at Linux's networking stack, but you might not
know about certain pitfalls in configuring a Cisco router using IOS;
does that mean we should have an IOS tutorial in the kernel
documentation directory?  I'm not so sure about that!


Sure, but the fact that we don't currently say much about storage
stacks doesn't mean we should accept a patch that might actively
mislead people.   I'm NACK'ing the patch on that basis.


Sounds like they were using really cheap UPS's; certainly not the kind
I would expect to find in a data center.  And if company's system
administrator is using the cheapest possible consumer-grade UPS's,
then yes, they might have a problem.  Even an educational institution
like MIT, where I was an network administrator some 15 years ago, had
proper UPS's, *and* we had a diesel generator which kicked in after 15
seconds --- and we tested the diesel generator every Friday morning,
to make sure it worked properly.


Even my home RAID array uses hot-plug SATA disks, so I can replace a
failed disk without shutting down my system.  (And yes, I have a
backup battery for the hardware RAID, and the firmware runs periodic
tests on it; the hardware RAID card also will send me e-mail if a RAID
array drive fails and it needs to use my hot-spare.  At that point, I
order a new hard drive, secure in the knowledge that the system can
still suffer another drive failure before falling into degraded mode.
And no, this isn't some expensive enterprise RAID setup; this is just
a mid-range Areca RAID card.)


If you want to document that as a property of RAID arrays, sure.  But
it's not something that should live in Documentation/filesystems/ext2.txt
and Documentation/filesystems/ext3.txt.  The MD RAID howto might be a
better place, since it's far more likely more users will read it.  How
many system administrators read what's in the kernel's Documentation
directory, after all, and this is basic information about how RAID
works; it's not necessarily something that someone would *expect* to
be in kernel documentation, nor would necessarily go looking for it
there.  And the reality is that it's not like most people go reading
Documentation/* for pleasure.  :-)

BTW, the RAID write atomicity issue and the possibility of failures
cause data loss *is* documented in the Wikipedia article on RAID.
It's not as written as direct practical advice to a system
administrator (you'd have to go to a book that is really targetted at
system administrators to find that sort of thing).

       		      	      	   	   - Ted
--
To unsubscribe from this list: send the line "unsubscribe linux-ext4" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]

Messages in current thread:
Re: ext2/3: document conditions when reliable operation is ..., Goswin von Brederlow, (Mon Mar 30, 8:06 am)
[patch] document flash/RAID dangers, Pavel Machek, (Tue Aug 25, 3:21 pm)
[patch] document that ext2 can't handle barriers, Pavel Machek, (Tue Aug 25, 3:27 pm)
Re: [patch] document flash/RAID dangers, david, (Tue Aug 25, 3:33 pm)
Re: [patch] document flash/RAID dangers, Pavel Machek, (Tue Aug 25, 3:40 pm)
Re: [patch] document flash/RAID dangers, david, (Tue Aug 25, 3:59 pm)
Re: [patch] document flash/RAID dangers, Pavel Machek, (Tue Aug 25, 4:37 pm)
Re: [patch] document flash/RAID dangers, Ric Wheeler, (Tue Aug 25, 4:48 pm)
Re: [patch] document flash/RAID dangers, david, (Tue Aug 25, 4:56 pm)
Re: [patch] document flash/RAID dangers, Pavel Machek, (Tue Aug 25, 5:06 pm)
Re: [patch] document flash/RAID dangers, Pavel Machek, (Tue Aug 25, 5:12 pm)
Re: [patch] document flash/RAID dangers, Ric Wheeler, (Tue Aug 25, 5:12 pm)
Re: [patch] document flash/RAID dangers, david, (Tue Aug 25, 5:20 pm)
Re: [patch] document flash/RAID dangers, Pavel Machek, (Tue Aug 25, 5:20 pm)
Re: [patch] document flash/RAID dangers, david, (Tue Aug 25, 5:26 pm)
Re: [patch] document flash/RAID dangers, Ric Wheeler, (Tue Aug 25, 5:26 pm)
Re: [patch] document flash/RAID dangers, Ric Wheeler, (Tue Aug 25, 5:28 pm)
Re: [patch] document flash/RAID dangers, Pavel Machek, (Tue Aug 25, 5:38 pm)
Re: [patch] document flash/RAID dangers, Pavel Machek, (Tue Aug 25, 5:39 pm)
Re: [patch] document flash/RAID dangers, Pavel Machek, (Tue Aug 25, 5:44 pm)
Re: [patch] document flash/RAID dangers, Ric Wheeler, (Tue Aug 25, 5:45 pm)
Re: [patch] document flash/RAID dangers, Ric Wheeler, (Tue Aug 25, 5:50 pm)
Re: [patch] document flash/RAID dangers, david, (Tue Aug 25, 6:17 pm)
Re: [patch] document flash/RAID dangers, david, (Tue Aug 25, 6:19 pm)
Re: [patch] ext2/3: document conditions when reliable oper ..., Henrique de Moraes H ..., (Tue Aug 25, 7:53 pm)
Re: [patch] document flash/RAID dangers, Rik van Riel, (Tue Aug 25, 9:20 pm)
Re: [patch] document flash/RAID dangers, Rik van Riel, (Tue Aug 25, 9:24 pm)
Re: [patch] document flash/RAID dangers, Pavel Machek, (Wed Aug 26, 4:21 am)
Re: [patch] document flash/RAID dangers, Pavel Machek, (Wed Aug 26, 4:22 am)
Re: [patch] document flash/RAID dangers, Pavel Machek, (Wed Aug 26, 4:25 am)
Re: [patch] document flash/RAID dangers, Ric Wheeler, (Wed Aug 26, 4:58 am)
Re: [patch] document flash/RAID dangers, Theodore Tso, (Wed Aug 26, 5:37 am)
Re: [patch] document flash/RAID dangers, Theodore Tso, (Wed Aug 26, 5:40 am)
Re: [patch] document flash/RAID dangers, Ric Wheeler, (Wed Aug 26, 6:11 am)
Re: [patch] document flash/RAID dangers, david, (Wed Aug 26, 6:44 am)
Re: [patch] document flash/RAID dangers, Rik van Riel, (Wed Aug 26, 7:45 am)
Re: [patch] ext2/3: document conditions when reliable oper ..., Theodore Tso, (Thu Aug 27, 5:24 am)
Re: MD/DM and barriers (was Re: [patch] ext2/3: document c ..., Alasdair G Kergon, (Thu Aug 27, 11:09 am)
Re: raid is dangerous but that's secret, Florian Weimer, (Fri Aug 28, 12:11 am)
Re: raid is dangerous but that's secret, NeilBrown, (Fri Aug 28, 12:23 am)
Re: [patch] document flash/RAID dangers, Pavel Machek, (Sat Aug 29, 2:38 am)
Re: [patch] document flash/RAID dangers, Pavel Machek, (Sat Aug 29, 2:39 am)
Re: [patch] document flash/RAID dangers, Ron Johnson, (Sat Aug 29, 4:47 am)
Re: [patch] document flash/RAID dangers, jim owens, (Sat Aug 29, 9:12 am)
Re: [patch] document flash/RAID dangers, Pavel Machek, (Sat Aug 29, 11:49 pm)
Re: raid is dangerous but that's secret (was Re: [patch] e ..., Christoph Hellwig, (Sun Aug 30, 9:35 am)
Re: raid is dangerous but that's secret (was Re: [patch] e ..., Christoph Hellwig, (Mon Aug 31, 6:16 am)
Re: raid is dangerous but that's secret (was Re: [patch] e ..., Christoph Hellwig, (Mon Aug 31, 6:21 am)
Re: raid is dangerous but that's secret (was Re: [patch] e ..., Jesse Brandeburg, (Mon Aug 31, 10:49 am)
Re: raid is dangerous but that's secret (was Re: [patch] e ..., Christoph Hellwig, (Mon Aug 31, 11:31 am)