In speculative page cache look up protocol, page_count(page) is set to 0 while radix-tree modification is going on, truncation, migration, etc... While page migration, a page fault to page under migration does - look up page table - find it is migration_entry_pte - decode pfn from migration_entry_pte and get page of pfn_page(pfn) - wait until page is unlocked It does get_page() -> wait_on_page_locked() -> put_page() now. In page migration's radix-tree replacement, page_freeze_refs() -> page_unfreeze_refs() is called. And page_count(page) turns to be zero and must be kept to be zero while radix-tree replacement. If get_page() is called against a page under radix-tree replacement, the kernel panics(). To avoid this, we shouldn't increment page_count() if it is zero. This patch uses get_page_unless_zero(). Even if get_page_unless_zero() fails, the caller just retries. But will be a bit busier. Change log v1->v2: - rewrote the patch description and added comments. From: Daisuke Nishimura <nishimura@mxp.nes.nec.co.jp> Signed-off-by: Daisuke Nishimura <nishimura@mxp.nes.nec.co.jp> Signed-off-by: KAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com> --- mm/migrate.c | 11 +++++++++-- 1 file changed, 9 insertions(+), 2 deletions(-) Index: test-2.6.26-rc5-mm3/mm/migrate.c =================================================================== --- test-2.6.26-rc5-mm3.orig/mm/migrate.c +++ test-2.6.26-rc5-mm3/mm/migrate.c @@ -242,8 +242,15 @@ void migration_entry_wait(struct mm_stru goto out; page = migration_entry_to_page(entry); - - get_page(page); + /* + * Once radix-tree replacement of page migration started, page_count + * *must* be zero. And, we don't want to call wait_on_page_locked() + * against a page without get_page(). + * So, we use get_page_unless_zero(), here. Even failed, page fault + * will occur again. + */ + if (!get_page_unless_zero(page)) + goto out; pte_unmap_unlock(ptep, ptl); wait_on_page_locked(page); put_page(page); --
| Andrew Morton | -mm merge plans for 2.6.23 |
| Greg Kroah-Hartman | [PATCH 006/196] Chinese: add translation of oops-tracing.txt |
| Greg KH | Re: Dual-Licensing Linux Kernel with GPL V2 and GPL V3 |
| Roland Dreier | Re: Integration of SCST in the mainstream Linux kernel |
git: | |
| David Miller | [GIT]: Networking |
| Gerrit Renker | [PATCH 15/37] dccp: Set per-connection CCIDs via socket options |
| Linus Torvalds | Re: iptables very slow after commit 784544739a25c30637397ace5489eeb6e15d7d49 |
| Herbert Xu | Re: [PATCH] pkt_sched: Destroy gen estimators under rtnl_lock(). |
