login
Login
/
Register
Search
Search this site:
Forums
News
Blogs
Features
Site
Home
»
Mailing list archives
»
linux-kernel
»
2010
»
May
»
3
Re: [PATCH 1/2] mm: Take all anon_vma locks in anon_vma_lock
view
thread
Previous message: [
thread
] [
date
] [
author
]
Next message: [
thread
] [
date
] [
author
]
[view in full thread]
From: Peter Zijlstra
Subject:
Re: [PATCH 1/2] mm: Take all anon_vma locks in anon_vma_lock
Date: Monday, May 3, 2010 - 9:55 am
On Mon, 2010-05-03 at 12:18 -0400, Rik van Riel wrote:
quoted text
> From: Rik van Riel <riel@redhat.com> > > Both the page migration code and the transparent hugepage patches expect > 100% reliable rmap lookups and use page_lock_anon_vma(page) to prevent > races with mmap, munmap, expand_stack, etc. > > Specifically, try_to_unmap indirectly calls vma_address, which uses the > difference between vma->vm_start and vma->vm_pgoff, which can race when a > stack is expanded downwards. VMA splitting and merging present similar > issues. > > With the new anon_vma code, one VMA can be attached to multiple anon_vmas, > however mmap, munmap, expand_stack and others only took one anon_vma->lock. > This patch changes things so we take the anon_vma locks for all of the > anon_vmas attached to a VMA in the code that try_to_unmap would otherwise > race against: mmap, munmap, expand_stack, etc. > > Unfortunately, this leads to a lock ordering conflict with the page_table_lock, > which protected the "same_vma" list in the anon_vma_chain. Replacing that > lock with a new lock (mm->anon_vma_chain_lock), which is taken higher up in > the mm locking hierarchy, solves that issue. This changes the locking rules > for the "same_vma" list to be either mm->mmap_sem for write, or mm->mmap_sem > for read plus the new mm->anon_vma_chain lock. This limits the place where > the new lock is taken to 2 locations - anon_vma_prepare and expand_downwards. > > Document the locking rules for the same_vma list in the anon_vma_chain and > remove the anon_vma_lock call from expand_upwards, which does not need it.
quoted text
> diff --git a/mm/mmap.c b/mm/mmap.c > index 456ec6f..81850fc 100644 > --- a/mm/mmap.c > +++ b/mm/mmap.c
quoted text
> @@ -1705,12 +1707,11 @@ int expand_upwards(struct vm_area_struct *vma, unsigned long address) > return -EFAULT; > > /* > - * We must make sure the anon_vma is allocated > - * so that the anon_vma locking is not a noop. > + * Unlike expand_downwards, we do not need to take the anon_vma lock, > + * because we leave vma->vm_start and vma->pgoff untouched. > + * This means rmap lookups of pages inside this VMA stay valid > + * throughout the stack expansion. > */ > - if (unlikely(anon_vma_prepare(vma))) > - return -ENOMEM; > - anon_vma_lock(vma); > > /* > * vma->vm_start/vm_end cannot change under us because the caller > @@ -1721,7 +1722,6 @@ int expand_upwards(struct vm_area_struct *vma, unsigned long address) > if (address < PAGE_ALIGN(address+4)) > address = PAGE_ALIGN(address+4); > else { > - anon_vma_unlock(vma); > return -ENOMEM; > } > error = 0; > @@ -1737,7 +1737,6 @@ int expand_upwards(struct vm_area_struct *vma, unsigned long address) > if (!error) > vma->vm_end = address; > } > - anon_vma_unlock(vma); > return error; > } > #endif /* CONFIG_STACK_GROWSUP || CONFIG_IA64 */
This does leave me worrying about concurrent faults poking at vma->vm_end without synchronization. --
unsubscribe notice
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to
majordomo@vger.kernel.org
More majordomo info at
http://vger.kernel.org/majordomo-info.html
Please read the FAQ at
http://www.tux.org/lkml/
Previous message: [
thread
] [
date
] [
author
]
Next message: [
thread
] [
date
] [
author
]
Messages in current thread:
[PATCH 1/2] mm: Take all anon_vma locks in anon_vma_lock
, Rik van Riel
, (Mon May 3, 9:18 am)
Re: [PATCH 1/2] mm: Take all anon_vma locks in anon_vma_lock
, Linus Torvalds
, (Mon May 3, 9:41 am)
Re: [PATCH 1/2] mm: Take all anon_vma locks in anon_vma_lock
, Rik van Riel
, (Mon May 3, 9:53 am)
Re: [PATCH 1/2] mm: Take all anon_vma locks in anon_vma_lock
, Peter Zijlstra
, (Mon May 3, 9:55 am)
Re: [PATCH 1/2] mm: Take all anon_vma locks in anon_vma_lock
, Andrea Arcangeli
, (Mon May 3, 10:02 am)
Re: [PATCH 1/2] mm: Take all anon_vma locks in anon_vma_lock
, Peter Zijlstra
, (Mon May 3, 10:11 am)
Re: [PATCH 1/2] mm: Take all anon_vma locks in anon_vma_lock
, Linus Torvalds
, (Mon May 3, 10:17 am)
Re: [PATCH 1/2] mm: Take all anon_vma locks in anon_vma_lock
, Andrea Arcangeli
, (Mon May 3, 10:18 am)
Re: [PATCH 1/2] mm: Take all anon_vma locks in anon_vma_lock
, Rik van Riel
, (Mon May 3, 10:58 am)
Re: [PATCH 1/2] mm: Take all anon_vma locks in anon_vma_lock
, Andrea Arcangeli
, (Mon May 3, 11:13 am)
Re: [PATCH 1/2] mm: Take all anon_vma locks in anon_vma_lock
, Linus Torvalds
, (Mon May 3, 11:19 am)
Re: [PATCH 1/2] mm: Take all anon_vma locks in anon_vma_lock
, Rik van Riel
, (Mon May 3, 11:38 am)
Re: [PATCH 1/2] mm: Take all anon_vma locks in anon_vma_lock
, Mel Gorman
, (Tue May 4, 6:12 am)
Navigation
Mailing list archives
Recent posts
Popular discussions
linux-kernel
:
Mel Gorman
Re: [PATCH 1/4] vmstat: remove zone->lock from walk_zones_in_node
Guenter Roeck
Re: [lm-sensors] Location for thermal drivers
David Woodhouse
Re: RFC: Moving firmware blobs out of the kernel.
Siddha, Suresh B
Re: [PATCH 2.6.21 review I] [11/25] x86: default to physical mode on hotplug CPU k...
Peter Zijlstra
Re: [patch 4/6] mm: merge populate and nopage into fault (fixes nonlinear)
git-commits-head
:
Linux Kernel Mailing List
[MIPS] Fix potential latency problem due to non-atomic cpu_wait.
Linux Kernel Mailing List
USB: rename USB_SPEED_VARIABLE to USB_SPEED_WIRELESS
Linux Kernel Mailing List
lib/vsprintf.c: fix bug omitting minus sign of numbers (module_param)
Linux Kernel Mailing List
[Bluetooth] Initiate authentication during connection establishment
Linux Kernel Mailing List
[POWERPC] 4xx: Add ppc40x_defconfig
linux-netdev
:
MERCEDES
Your mail id has won 950,000.00 in the MERCEDES Benz Online Promo.for claims send:
David Miller
Re: [PATCH] xen/netfront: do not mark packets of length < MSS as GSO
David Miller
Re: skb_segment() questions
Shan Wei
[RFC PATCH net-next 2/5]IPv6:netfilter: Send an ICMPv6 "Fragment Reassembly Timeou...
Stanislaw Gruszka
[PATCH 1/4] bnx2x: use smp_mb() to keep ordering of read write operations
git
:
Nicolas Sebrecht
git-svn died of signal 11 (was "3 failures on test t9100 (svn)")
Junio C Hamano
Re: [PATCH 2/2] Add url.<base>.pushInsteadOf: URL rewriting for push only
Martin Langhoff
Re: [PATCH] GIT commit statistics.
Alexandre Julliard
[PATCH] gitweb: Put back shortlog instead of graphiclog in the project list.
Josh Triplett
[PATCH 2/2] Add url.<base>.pushInsteadOf: URL rewriting for push only
openbsd-misc
:
Taisto Qvist XX
Re: AMD GEODE LX-800 just works with kernel from install42.iso and kernelpanics wi...
Nico Meijer
Re: gOS Develop Kit with VIA pc-1 Processor Platform VIA C7-D
Andreas Bihlmaier
Re: jetway board sensors (Fintek F71805F)
admin
Drive a 2009 car from R799p/m
Antti Harri
Re: how to create a sha256 hash
Colocation donated by:
Syndicate