Re: 2.6.28-rc3-git1: spitz still won't boot

Previous thread: schedule tracing by Steven Rostedt on Friday, October 31, 2008 - 8:23 am. (3 messages)

Next thread: Measuring boot time on beagle board with PRINTK_TIME by Shivdas Gujare on Friday, October 31, 2008 - 8:33 am. (2 messages)
From: Pavel Machek
Date: Friday, October 31, 2008 - 8:06 am

Hi!

...unfortunately, 2.6.27 contains the 'reset floating' bug, and from
the point 'reset floating' is fixed, to 2.6.28-rc1, tree will not
compile, so bisect is quite hard to do. Any ideas?

									Pavel 
-- 
(english) http://www.livejournal.com/~pavelmachek
(cesky, pictures) http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses/blog.html
--

From: Rafael J. Wysocki
Date: Friday, October 31, 2008 - 2:55 pm

Is this a regression from .27?

Rafael
--

From: Pavel Machek
Date: Monday, November 3, 2008 - 3:40 am

Yes, it is (*).
									Pavel
(*) Okay, so 2.6.27 does not boot due to the 'reset floating'
bug. 'reset floating' is fixed in 2.6.28-rc1, and compilation of spitz
is fixed in 2.6.28-rc2, but it still will not boot.

So yes, some new spitz-breaking bug got introduced. But no, 2.6.27
will not boot due to other bug. 2.6.26 boots.

-- 
(english) http://www.livejournal.com/~pavelmachek
(cesky, pictures) http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses/blog.html
--

From: Russell King
Date: Monday, November 3, 2008 - 4:04 am

I wouldn't say that was a bug.  That was by design of Dmitry, which
I'm far from happy with.  If an output is being used to drive a
reset line, you always want to actively drive it to the inactive

Compilation errors are only going to get worse as ARM continues to
diversify.  We're entirely reliant on sfr's linux-next project to
find build errors prior to merging, and it is unfortunate that this
time around, sfr was on vacation on the run up to the merge window.
Hence, there was very little build coverage of the changes that
went in during this last merge window.  I spent most of the merge
window fixing up all the shitty patches that people had submitted up
until that point that either broke compilation or had runtime

If 2.6.27 doesn't boot even with the reset fixes applied, there's a hell
of a lot of changes that could be the culpret (and I don't remember much
that happened before this last merge window.)  The only thing I can
suggest is to bisect with the reset fixes on top.

This is precisely where we're let down by not having a Zaurus maintainer
who knows the Zaurus issues inside out, and regularly puts effort into
building and testing kernels for these platforms.  That's a hint for
anyone who's reading this.

-- 
Russell King
 Linux kernel    2.6 ARM Linux   - http://www.arm.linux.org.uk/
 maintainer of:
--

From: Pavel Machek
Date: Monday, November 3, 2008 - 4:12 am

Well, if it breaks boot on the machine, I think it is fair to call it

No, 2.6.27 _does_ boot with reset fixes applied. (But it needs those

With the reset & compilation fixes on top... and I'll have to search
for the small patch that fixes the compilation :-((.

If you have any idea what should I try to revert between 2.6.27 and

I don't think I have enough time to really maintain Zauruses... but I
can try to help with some testing. So -next is the place that should
get more testing...?
									Pavel
-- 
(english) http://www.livejournal.com/~pavelmachek
(cesky, pictures) http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses/blog.html
--

From: Pavel Machek
Date: Wednesday, November 5, 2008 - 7:23 am

It is still there in -rc3... And yes, it is a regression from 2.6.27.

									Pavel
-- 
(english) http://www.livejournal.com/~pavelmachek
(cesky, pictures) http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses/blog.html
--

From: Dmitry
Date: Wednesday, November 5, 2008 - 8:13 am

Would you be so nice to provide a boot log of 2.6.28-rc3 with
CONFIG_DEBUG_DRIVER enabled?
Thank you.

-- 
With best wishes
Dmitry
--

From: Pavel Machek
Date: Wednesday, November 5, 2008 - 10:33 am

I'd love to, but it fails with black screen :-(. (But I guess I should
mention that I'm using kexec...)

Do you have "good" config for spitz I could try? (Similar
configuration worked in 2.6.27, but maybe it is a config difference?)

									Pavel
-- 
(english) http://www.livejournal.com/~pavelmachek
(cesky, pictures) http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses/blog.html
--

From: Dmitry
Date: Wednesday, November 5, 2008 - 4:00 pm

Try disabling TOUCHSCREEN_CORGI, and enabling SPI, PXA2XX_SPI, HWMON, MAX1111.
Please post your .config anyway.

Do you have anything on the serial console, btw?

-- 
With best wishes
Dmitry
--

From: Pavel Machek
Date: Thursday, November 6, 2008 - 1:04 am

DOes not boot for me, either :-(... black screen. config attached.


I do not have serial cable :-(,
									Pavel
-- 
(english) http://www.livejournal.com/~pavelmachek
(cesky, pictures) http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses/blog.html
From: Cyril Hrubis
Date: Thursday, November 6, 2008 - 12:14 pm

Okay I have one that works for sl5500, it needs external power on
sl-cxxx but it should be easy to fix.

-- 
metan
--

From: Eric Miao
Date: Friday, November 7, 2008 - 6:23 am

Well, IIRC spitz still needs the patch to change the vmlinux.ld.S.
Did you guys ever try that?

--

From: Pavel Machek
Date: Friday, November 7, 2008 - 7:29 am

I never heard about that patch, do you have it handy?


-- 
(english) http://www.livejournal.com/~pavelmachek
(cesky, pictures) http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses/blog.html
--

From: Dmitry
Date: Friday, November 7, 2008 - 7:57 am

From: Russell King
Date: Friday, November 7, 2008 - 11:23 am

That doesn't look like something that should be accepted.  Take a moment
to put some thought into the question.  Why should we _allocate_ and
contain the stack in the resulting image?  Does the stack contain any
data that must be pre-initialized?

Obviously not.

-- 
Russell King
 Linux kernel    2.6 ARM Linux   - http://www.arm.linux.org.uk/
 maintainer of:
--

From: Richard Purdie
Date: Saturday, November 8, 2008 - 7:45 am

Firstly, I don't think that patch should ever make it into a mainline
kernel. I can perhaps give some clues why its required though. I think
the bootloader on the zaurus truncates the image when writing the kernel
into flash using the standard flashing process. By having that much
extra padding on the end of the kernel, nothing important is lost.

How did the original 2.4 kernels ever work? Binutils used to be buggy
and left this padding in. Nobody therefore ever noticed the bug in the
bootloader.

The above theory is guesswork based on the kernels I've seen work and
not work and I wrote the patch mentioned above as a workaround a long
time ago and more pressing issues meant I never got back to it. I'd love
to see someone work out the problem for sure!

Cheers,

Richard




--

From: Dmitry
Date: Saturday, November 8, 2008 - 4:12 pm

From what I do understand from updater.sh, the kernel is fully written to nand.
Otherwise there will be lot's more problems. Maybe w/o padding the image is
loaded too high? Can't verify ATM.

BTW: Since I'm away from my zaurii for few days, can one please send
me the whole

I did rewrote your patch in a bit cleaner way (to apply the hack to
vmlinux.lds.in
in a cleaner way and only on PXA_SHARPSL), however I'm not submitting it
till I find what's the real reason for this problem.

-- 
With best wishes
Dmitry
--

From: Russell King
Date: Saturday, November 8, 2008 - 4:21 pm

Even if the reason is found, I don't think it's something that should
concern the kernel, especially as there is a script which writes the
kernel image into the flash.  If it's a case that the boot loader needs
an extra 4K tacked on the end of the kernel image, that should be easy
enough for this 'updater.sh' shell script to do.

-- 
Russell King
 Linux kernel    2.6 ARM Linux   - http://www.arm.linux.org.uk/
 maintainer of:
--

From: Dmitry
Date: Saturday, November 8, 2008 - 4:25 pm

Actually on my tosa builds it's like 40k of zeroes (.bss is before .stack and so
is also present in the zImage). It seems that just 4k of padding isn't  enough



-- 
With best wishes
Dmitry
--

From: Russell King
Date: Saturday, November 8, 2008 - 4:28 pm

That doesn't change my view - it's something that the kernel shouldn't be
caring about.  And as I point out, this script can add the padding itself
which seems to be the right place to resolve the problem.

(and I've finally dropped that bad email address from the CC line.)

-- 
Russell King
 Linux kernel    2.6 ARM Linux   - http://www.arm.linux.org.uk/
 maintainer of:
--

From: Pavel Machek
Date: Sunday, November 9, 2008 - 4:45 am

Could you send me full .config that is known to work?

Do I need pxa-linking-bug.patch to get it to boot?
									Pavel
-- 
(english) http://www.livejournal.com/~pavelmachek
(cesky, pictures) http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses/blog.html
--

From: Pavel Machek
Date: Sunday, November 9, 2008 - 4:46 am

(Note that I'm booting using kexec, not sharp bootloader).
									Pavel
-- 
(english) http://www.livejournal.com/~pavelmachek
(cesky, pictures) http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses/blog.html
--

From: Dmitry
Date: Sunday, November 9, 2008 - 5:30 am

It seems that neither kexec, nor qemu require pxa-linking-bug.patch

Attached a .config that I use for testing. It contains all sharp zaurii
selected, however you can easily deselect unnecessary ones.

-- 
With best wishes
Dmitry
From: Pavel Machek
Date: Sunday, November 9, 2008 - 7:12 am

It should still boot on spitz, right? I'd rather not touch that
.config...

Thanks!
									Pavel



-- 
(english) http://www.livejournal.com/~pavelmachek
(cesky, pictures) http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses/blog.html
--

From: Dmitry
Date: Sunday, November 9, 2008 - 7:52 am

It boots under qemu spitz emulation. No real HW of SL-CXX00 yet :(

-- 
With best wishes
Dmitry
--

From: Pavel Machek
Date: Monday, November 10, 2008 - 12:50 am

Ugh, that one took quite a while to compile. Fortunately, this one
actually boots... (up to "launching init", anyway)... so the problem
seems to be config-dependend.
									Pavel
-- 
(english) http://www.livejournal.com/~pavelmachek
(cesky, pictures) http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses/blog.html
--

Previous thread: schedule tracing by Steven Rostedt on Friday, October 31, 2008 - 8:23 am. (3 messages)

Next thread: Measuring boot time on beagle board with PRINTK_TIME by Shivdas Gujare on Friday, October 31, 2008 - 8:33 am. (2 messages)