Re: pcmcia resume 60 second hang. Re: [patch 00/69] -stable review

!MAILaRCHIVE_VOTE_RePLACE
Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]
To: Linus Torvalds <torvalds@...>
Cc: Pavel Machek <pavel@...>, Romano Giannetti <romanol@...>, Chris Wright <chrisw@...>, Chuck Ebbert <cebbert@...>, Linux Kernel Mailing List <linux-kernel@...>, <stable@...>, Justin Forbes <jmforbes@...>, Zwane Mwaikambo <zwane@...>, Theodore Ts'o <tytso@...>, Randy Dunlap <rdunlap@...>, Dave Jones <davej@...>, Chuck Wolber <chuckw@...>, Chris Wedgwood <reviews@...>, Michael Krufky <mkrufky@...>, <akpm@...>, <alan@...>, Rafael J. Wysocki <rjw@...>
Date: Thursday, May 24, 2007 - 9:55 pm

Hi Linus.

On Thu, 2007-05-24 at 17:37 -0700, Linus Torvalds wrote:
e=20

Let me see if I can help. I'll probably fail miserably, but I can only
try :)

First, let me agree with you that for the atomic copy itself, the
freezer is unnecessary. Disabling irqs and so on is enough to ensure the
atomic copy is atomic. I don't think any of us are arguing with you
there.

Where we see the problem is with what happens after the atomic copy is
made. The problem is that the atomic copy includes struct inodes, dnodes
and such like - an in memory representation of the state of mounted
filesystems. Imagine that, post atomic copy, we don't have the freezer.
Processes can then make on-disk changes to these mounted filesystems in
the time before we complete saving the image and powering down. If, at
resume time, we then restore the atomic copy, we have a mismatch between
what the in-memory data structures say and what the on-disk data says.
This leads to corruption.

How to avoid?

Well, there are only two options as far as I can see. We either stop
those changes occurring in the first place, or we make them undoable.

Freezing processes, and/or filesystems would be the first path,
checkpointing the second.

So, as far as I can see, we're stuck with freezing processes at least
until checkpointing is implemented.

I have to admit though, that even if checkpointing was implemented, I'd
like to see freezing processes remain. The image gets written faster if
we don't have to compete for cpu and i/o. It also allows us to do a
fuller image of memory than is otherwise possible (Yes, I know some
people don't care for full images, but others of us have usage patterns
that make the system far more useable if a full image is kept, or simply
prefer to have our machines as if the power had never gone away).
Without processes freezing, I'd have to work a lot harder to find a way
to do that full image. The simplest way would probably be to carry the
freezer code myself. (Yeah, I'm reconciled to the idea of never getting
Suspend2 merged. I'd like it to happen, but won't hold my breath.
Someone needs to break your suspend-to-ram or battery so you see the use
for hibernation :>).

Hope this helps.

Nigel
Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]

Messages in current thread:
Re: [patch 00/69] -stable review, Linus Torvalds, (Tue May 22, 6:07 pm)
Re: [patch 00/69] -stable review, Romano Giannetti, (Thu May 24, 8:06 am)
Re: [patch 00/69] -stable review, Linus Torvalds, (Thu May 24, 12:16 pm)
Re: pcmcia resume 60 second hang. Re: [patch 00/69] -stable ..., Rafael J. Wysocki, (Thu May 24, 5:03 pm)
Re: pcmcia resume 60 second hang. Re: [patch 00/69] -stable ..., Henrique de Moraes Holschuh..., (Thu May 24, 7:16 pm)
Re: pcmcia resume 60 second hang. Re: [patch 00/69] -stable ..., Rafael J. Wysocki, (Fri May 25, 7:44 am)
Re: pcmcia resume 60 second hang. Re: [patch 00/69] -stable ..., Rafael J. Wysocki, (Sun May 27, 2:32 pm)
Re: pcmcia resume 60 second hang. Re: [patch 00/69] -stable ..., Rafael J. Wysocki, (Mon May 28, 4:11 am)
Re: pcmcia resume 60 second hang. Re: [patch 00/69] -stable ..., Rafael J. Wysocki, (Sun May 27, 3:09 pm)
Re: pcmcia resume 60 second hang. Re: [patch 00/69] -stable ..., Rafael J. Wysocki, (Tue May 29, 8:00 am)
Re: pcmcia resume 60 second hang. Re: [patch 00/69] -stable ..., Michael-Luke Jones, (Tue May 29, 8:00 am)
Re: pcmcia resume 60 second hang. Re: [patch 00/69] -stable ..., Rafael J. Wysocki, (Tue May 29, 8:16 am)
Re: pcmcia resume 60 second hang. Re: [patch 00/69] -stable ..., Nigel Cunningham, (Thu May 24, 9:55 pm)
Re: pcmcia resume 60 second hang. Re: [patch 00/69] -stable ..., Nigel Cunningham, (Thu May 24, 10:20 pm)
Re: pcmcia resume 60 second hang. Re: [patch 00/69] -stable ..., Nigel Cunningham, (Thu May 24, 11:00 pm)
Re: pcmcia resume 60 second hang. Re: [patch 00/69] -stable ..., Nigel Cunningham, (Fri May 25, 12:33 am)
Re: pcmcia resume 60 second hang. Re: [patch 00/69] -stable ..., Rafael J. Wysocki, (Wed May 30, 10:04 am)
Re: pcmcia resume 60 second hang. Re: [patch 00/69] -stable ..., Rafael J. Wysocki, (Wed May 30, 10:08 am)
Long delay in resume from RAM (Was Re: [patch 00/69] -stable..., Romano Giannetti, (Thu May 24, 11:06 am)
Re: Long delay in resume from RAM (Was Re: [patch 00/69] -st..., Johannes Stezenbach, (Thu May 24, 6:30 pm)
Re: [patch 00/69] -stable review, Romano Giannetti, (Wed May 23, 3:09 am)
Re: [patch 00/69] -stable review, Paolo Ornati, (Wed May 23, 3:19 am)
Re: [patch 00/69] -stable review, Romano Giannetti, (Wed May 23, 3:38 am)