Re: [PATCH -mm 13/25] Noreclaim LRU Infrastructure

Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]
From: Andrew Morton
Date: Sunday, June 8, 2008 - 11:10 pm

On Sun, 8 Jun 2008 20:56:29 -0400 Rik van Riel <riel@redhat.com> wrote:


Well.  Intrusiveness doesn't matter much.  But no, you're dead wrong -
this stuff is far more risky than timer changes.  Because things like
the timer changes are trivial to detect errors in - it either works or
it doesn't.

Whereas reclaim problems can take *years* to identify and are often
very hard for the programmers to understand, reproduce and diagnose.


Earlier discussion about removing these pages from ALL LRUs reached a
quite detailed stage, but nobody seemed to finish any code.


We shouldn't rewrite core VM to cater for incorrectly configured
systems.


I see handwavy non-bug-reports loosely associated with a vast pile of
code and vague expressions of hope that one will fix the other.

Where's the meat in this, Rik?  This is engineering.

Do you or do you not have a test case which demonstrates this problem? 
It doesn't sound terribly hard.  Where are the before-and-after test
results?


I'd love to see hard evidence that they have been.  And that doesn't
mean getting palmed off on wikis and random blog pages.

Also, it is incumbent upon us to consider the other design proposals,
such as removing anon pages from the LRUs, removing mlocked pages from
the LRUs.


That actually isn't a fundamental problem.

It _becomes_ a problem because we try to treat the two types of pages
differently.

Stupid question: did anyone try setting swappiness=100?  What happened?


Well for starters that system was grossly misconfigured.  It is
incumbent upon you, in your design document (that thing we call a
changelog) to justify why the VM design needs to be altered to cater
for such misconfigured systems.  It just drives me up the wall having
to engage in a 20-email discussion to be able to squeeze these little
revelations out.  Only to have them lost again later.

Secondly, I expect that removal of mlocked pages from the LRU (as was
discussed a year or two ago and perhaps implemented by Andrea) along
with swappiness=100 might be get us towards a fix.  Don't know.


Unless I missed it, none of your patch descriptions even attempt to
describe these fundamental issues.  It's all buried in 20-deep email
threads.


I see no fundamental reason why we need to put mlocked or SHM_LOCKED
pages onto a VM LRU at all.

One cause of problms is that we attempt to prioritise anon pages over
file-backed pagecache.  And we prioritise mmapped pages, which your patches
don't address, do they?  Stopping doing that would, I expect, prevent a
range of these problems.  It would introduce others, probably.


-mm isn't a very useful testing place any more, I'm afraid.  The
patches would be better off in linux-next, but then they would screw up
all the other pending MM patches, and it's probably a bit early for
getting them into linux-next.

Once I get sections of -mm feeding into linux-next, things will be better.


That helps.
--
Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]

Messages in current thread:
[PATCH -mm 13/25] Noreclaim LRU Infrastructure, Rik van Riel, (Fri Jun 6, 1:28 pm)
Re: [PATCH -mm 13/25] Noreclaim LRU Infrastructure, Andrew Morton, (Fri Jun 6, 6:05 pm)
Re: [PATCH -mm 13/25] Noreclaim LRU Infrastructure, Rik van Riel, (Sun Jun 8, 1:34 pm)
Re: [PATCH -mm 13/25] Noreclaim LRU Infrastructure, Andrew Morton, (Sun Jun 8, 1:57 pm)
Re: [PATCH -mm 13/25] Noreclaim LRU Infrastructure, KOSAKI Motohiro, (Sun Jun 8, 2:07 pm)
Re: [PATCH -mm 13/25] Noreclaim LRU Infrastructure, Rik van Riel, (Sun Jun 8, 2:32 pm)
Re: [PATCH -mm 13/25] Noreclaim LRU Infrastructure, Rik van Riel, (Sun Jun 8, 3:03 pm)
Re: [PATCH -mm 13/25] Noreclaim LRU Infrastructure, Andrew Morton, (Sun Jun 8, 4:22 pm)
Re: [PATCH -mm 13/25] Noreclaim LRU Infrastructure, Rik van Riel, (Sun Jun 8, 4:34 pm)
Re: [PATCH -mm 13/25] Noreclaim LRU Infrastructure, Andrew Morton, (Sun Jun 8, 4:54 pm)
Re: [PATCH -mm 13/25] Noreclaim LRU Infrastructure, Rik van Riel, (Sun Jun 8, 5:56 pm)
Re: [PATCH -mm 13/25] Noreclaim LRU Infrastructure, Rik van Riel, (Sun Jun 8, 7:58 pm)
Re: [PATCH -mm 13/25] Noreclaim LRU Infrastructure, Andrew Morton, (Sun Jun 8, 10:44 pm)
Re: [PATCH -mm 13/25] Noreclaim LRU Infrastructure, Andrew Morton, (Sun Jun 8, 11:10 pm)
Re: [PATCH -mm 13/25] Noreclaim LRU Infrastructure, Rik van Riel, (Mon Jun 9, 6:44 am)
Re: [PATCH -mm 13/25] Noreclaim LRU Infrastructure, Christoph Lameter, (Tue Jun 10, 12:17 pm)
Re: [PATCH -mm 13/25] Noreclaim LRU Infrastructure, Rik van Riel, (Tue Jun 10, 12:37 pm)
Re: [PATCH -mm 13/25] Noreclaim LRU Infrastructure, Rik van Riel, (Tue Jun 10, 1:09 pm)
Re: [PATCH -mm 13/25] Noreclaim LRU Infrastructure, Andrew Morton, (Tue Jun 10, 2:33 pm)
Re: [PATCH -mm 13/25] Noreclaim LRU Infrastructure, Andi Kleen, (Tue Jun 10, 2:48 pm)
Re: [PATCH -mm 13/25] Noreclaim LRU Infrastructure, Dave Hansen, (Tue Jun 10, 3:05 pm)
Re: [PATCH -mm 13/25] Noreclaim LRU Infrastructure, Paul Mundt, (Tue Jun 10, 10:09 pm)
Re: [PATCH -mm 13/25] Noreclaim LRU Infrastructure, Andrew Morton, (Tue Jun 10, 11:16 pm)
Re: [PATCH -mm 13/25] Noreclaim LRU Infrastructure, Paul Mundt, (Tue Jun 10, 11:29 pm)
Re: [PATCH -mm 13/25] Noreclaim LRU Infrastructure, Andi Kleen, (Wed Jun 11, 5:06 am)
Re: [PATCH -mm 13/25] Noreclaim LRU Infrastructure, Andy Whitcroft, (Wed Jun 11, 12:03 pm)
Re: [PATCH -mm 13/25] Noreclaim LRU Infrastructure, Andi Kleen, (Wed Jun 11, 1:52 pm)
Re: [PATCH -mm 13/25] Noreclaim LRU Infrastructure, Christoph Lameter, (Wed Jun 11, 4:25 pm)