Re: [19.666764] Disabling IRQ #23

Previous thread: [PATCH 0/2] WM8400 support by Mark Brown on Wednesday, September 10, 2008 - 11:28 am. (13 messages)

Next thread: Oops/Warning report for the week of September 10th, 2008 by Arjan van de Ven on Wednesday, September 10, 2008 - 12:16 pm. (8 messages)
From: Justin Mattock
Date: Wednesday, September 10, 2008 - 12:06 pm

hello; I accidently erased the original thread
appologize for starting a new one of the same,
I've been noticing this message(below) appear
every "X" amount of boot's(maybe ten or so);
normally upon booting leaving me unable
to login.

below is dmesg of the message:

[    0.000000] Linux version 2.6.27-rc6 (root@unix) (gcc version 4.1.3
20080623 (prerelease) (Debian 4.1.2-23)) #22 SMP Tue Sep 9 18:46:38
PDT 2008
[    0.000000] BIOS-provided physical RAM map:
[    0.000000]  BIOS-e820: 0000000000000000 - 000000000009fc00 (usable)
[    0.000000]  BIOS-e820: 000000000009fc00 - 00000000000a0000 (reserved)
[    0.000000]  BIOS-e820: 00000000000e0000 - 0000000000100000 (reserved)
[    0.000000]  BIOS-e820: 0000000000100000 - 000000003f0ea000 (usable)
[    0.000000]  BIOS-e820: 000000003f0ea000 - 000000003f2eb000 (ACPI NVS)
[    0.000000]  BIOS-e820: 000000003f2eb000 - 000000003febe000 (ACPI data)
[    0.000000]  BIOS-e820: 000000003febe000 - 000000003feef000 (ACPI NVS)
[    0.000000]  BIOS-e820: 000000003feef000 - 000000003ff00000 (ACPI data)
[    0.000000]  BIOS-e820: 000000003ff00000 - 0000000040000000 (reserved)
[    0.000000]  BIOS-e820: 00000000f0000000 - 00000000f4000000 (reserved)
[    0.000000]  BIOS-e820: 00000000fec00000 - 00000000fec01000 (reserved)
[    0.000000]  BIOS-e820: 00000000fed14000 - 00000000fed1a000 (reserved)
[    0.000000]  BIOS-e820: 00000000fed1c000 - 00000000fed20000 (reserved)
[    0.000000]  BIOS-e820: 00000000fee00000 - 00000000fee01000 (reserved)
[    0.000000]  BIOS-e820: 00000000ffe00000 - 0000000100000000 (reserved)
[    0.000000] last_pfn = 0x3f0ea max_arch_pfn = 0x100000
[    0.000000] kernel direct mapping tables up to 38000000 @ 7000-c000
[    0.000000] DMI 2.4 present.
[    0.000000] ACPI: RSDP 000FE020, 0024 (r2 APPLE )
[    0.000000] ACPI: XSDT 3FEFD1C0, 0074 (r1 APPLE   Apple00       A5
     1000013)
[    0.000000] ACPI: FACP 3FEFB000, 00F4 (r3 APPLE   Apple00       A5
Loki       5F)
[    0.000000] ACPI: DSDT 3FEF0000, 48D1 (r1 APPLE  ...
From: Yinghai Lu
Date: Wednesday, September 10, 2008 - 12:36 pm

On Wed, Sep 10, 2008 at 12:06 PM, Justin Mattock


uhci and ehci share one irq. and irq handers for them all registered....

or there is hub under that uhci again...

YH
--

From: Justin Mattock
Date: Wednesday, September 10, 2008 - 12:39 pm

Not sure,
let me disable ehci_hcd for a while to see if this message appears


-- 
Justin P. Mattock
--

From: Bjorn Helgaas
Date: Wednesday, September 17, 2008 - 8:04 am

Can you confirm whether this is really a regression?  That is,
was it working in 2.6.26 and broken in 2.6.27-rc6?  We want to
fix all bugs, of course, but it is much more important to fix
regressions than it is to fix things that have always been
broken.

I noticed several errors related to the ehci controller in your
log.  I'm not a USB expert, so I cc'd some folks who are.

[   12.140614] ehci_hcd 0000:00:1d.7: PCI INT A -> GSI 23 (level, low) -> IRQ 23
[   12.149917] ehci_hcd 0000:00:1d.7: setting latency timer to 64
[   12.159523] ehci_hcd 0000:00:1d.7: EHCI Host Controller
[   12.168858] ehci_hcd 0000:00:1d.7: new USB bus registered, assigned
bus number 5
[   12.182246] ehci_hcd 0000:00:1d.7: debug port 1
[   12.191574] ehci_hcd 0000:00:1d.7: cache line size of 32 is not supported
[   12.200970] ehci_hcd 0000:00:1d.7: irq 23, io mem 0x50405400
[   12.250079] usb 3-2: new full speed USB device using uhci_hcd and address 2
[   12.270062] ehci_hcd 0000:00:1d.7: USB 2.0 started, EHCI 1.00,
driver 10 Dec 2004
[   12.279955] usb usb5: configuration #1 chosen from 1 choice
[   12.289791] hub 5-0:1.0: USB hub found
[   12.299531] hub 5-0:1.0: 8 ports detected

[   13.300084] usb 5-4: new high speed USB device using ehci_hcd and address 3

[   13.461784] usb 5-4: configuration #1 chosen from 1 choice

[   15.346683] drivers/hid/usbhid/hid-core.c: couldn't find an input interrupt endpoint

[   16.068174] ehci_hcd 0000:00:1d.7: HC died; cleaning up

[   16.155047] hub 5-0:1.0: hub_port_status failed (err = -19)
[   16.166001] hub 5-0:1.0: connect-debounce failed, port 4 disabled
[   16.176912] usb 5-4: USB disconnect, address 3

[   18.821797] irq 23: nobody cared (try booting with the "irqpoll" option)

[   25.276333] usb 4-1: usbfs: USBDEVFS_CONTROL failed cmd hid2hci rqt
64 rq 0 len 0 ret -84

[   25.450080] usb 4-1: USB disconnect, address 2

My guess is that because of the "HC died" situation, we unregistered
the ehci handler on IRQ 23, then got another interrupt ...
From: Justin Mattock
Date: Wednesday, September 17, 2008 - 10:13 am

I've seeing something interesting at the moment
with this. I've upgraded isight-firmware-tools to the latest,
and instead of using udev I'm using hal. I've done numerous
reboots, and am not receiving this message. I think it might have been
something to do with ift-load. If I don't see this message after a few more days
of rebooting, and cold starts I think we can close this.

regards;


-- 
Justin P. Mattock
--

From: Bjorn Helgaas
Date: Wednesday, September 17, 2008 - 2:48 pm

Maybe the problem doesn't happen with different user-space
tools, but that doesn't mean the kernel problem is fixed. 

And we still don't know whether this is really a regression
or not.

Bjorn
--

From: Justin P. Mattock
Date: Wednesday, September 17, 2008 - 3:18 pm

From what I remember this started around 2.6.27-rc1(I.g. Seeing this  
message during bootup), with having issues With ift-load probably  
around 2.6.26-rc1(should of done an upgrade then,but never got around  
to it), as for this being a regression? I guess; if this was happening  
with 2.6.26 and other previous kernels.
  As for my situation I've rebooted again a few times with
Not receiving this message. As for the issue being fixed? Not sure,  
one side of me says yes it is due to not having the kernel report any  
problems, but then there's the side that agrees with what you had  
said. maybe the problem doesn't happen with different user-space  
tools, doesn't mean the kernels problem is fixed.

justin P. Mattock



--

From: Justin Mattock
Date: Wednesday, September 17, 2008 - 5:14 pm

On Wed, Sep 17, 2008 at 3:18 PM, Justin P. Mattock

Well it sure is nice not having any issues with ift-load, but
Bjorn you're right, So I reverted back to udev were the issue seems
to be happening. FWIW here is what I see in dmesg after loading the system
without ehci_hcd and then modprobing ehci_hcd:

[  136.764390] ehci_hcd 0000:00:1d.7: PCI INT A -> GSI 23 (level, low) -> IRQ 23
[  136.764429] ehci_hcd 0000:00:1d.7: setting latency timer to 64
[  136.764436] ehci_hcd 0000:00:1d.7: EHCI Host Controller
[  136.764666] ehci_hcd 0000:00:1d.7: new USB bus registered, assigned
bus number 5
[  136.768610] ehci_hcd 0000:00:1d.7: debug port 1
[  136.768631] ehci_hcd 0000:00:1d.7: cache line size of 32 is not supported
[  136.768650] ehci_hcd 0000:00:1d.7: irq 23, io mem 0x50405400
[  136.792024] ehci_hcd 0000:00:1d.7: USB 2.0 started, EHCI 1.00,
driver 10 Dec 2004
[  136.792464] usb usb5: configuration #1 chosen from 1 choice
[  136.792613] hub 5-0:1.0: USB hub found
[  136.792638] hub 5-0:1.0: 8 ports detected
[  136.794750] uvcvideo: Non-zero status (-84) in status completion handler.
[  136.950399] usb 1-2: USB disconnect, address 2
[  136.998597] input: appletouch disconnected
[  137.180146] usb 2-2: USB disconnect, address 2
[  137.350104] usb 3-2: USB disconnect, address 2
[  137.510109] usb 4-1: USB disconnect, address 3
[  137.511079] btusb_intr_complete: hci0 urb f5574580 failed to resubmit (19)
[  137.511103] btusb_bulk_complete: hci0 urb f5574420 failed to resubmit (19)
[  137.512077] btusb_bulk_complete: hci0 urb f5574e70 failed to resubmit (19)
[  137.512618] btusb_send_frame: hci0 urb f5574e70 submission failed
[  137.513381] __set_isoc_interface: hci0 setting interface failed (19)
[  138.000112] usb 5-4: new high speed USB device using ehci_hcd and address 3
[  138.151670] usb 5-4: configuration #1 chosen from 1 choice
[  138.152237] uvcvideo: Found UVC 1.00 device Built-in iSight (05ac:8501)
[  138.158357] input: Built-in iSight as /class/input/input12
[  ...
From: Justin Mattock
Date: Wednesday, September 17, 2008 - 9:44 pm

Hello,
after a bit of
searching I located a patch for something
that might be similar to this issue:(cant seem to locate the
linux-kernel thread).
http://www.gossamer-threads.com/lists/linux/kernel/966076
I've applied the patches just to see what would happen,
As for now I have not seen the Disable irq #23 yet,
(usually after a recompile of the kernel, upon reboot this message will appear,
this time it didn't). But you never know I'll run the system for a few days
and see if it pops up.

-- 
Justin P. Mattock
--

From: David Brownell
Date: Thursday, September 18, 2008 - 2:09 am

Those two patches should be in Greg's queue for 2.6.27...

It's also suspicious that EHCI is getting loaded *last* not
first.  That's a common distro bug, although as I recall
we've always (since first driver versions) been clear that
it should be loaded first, before the companion (OHCI or UHCI)
driver.  Loading it last causes various confusions and races
at the hardware level.

- Dave
--

From: Justin Mattock
Date: Thursday, September 18, 2008 - 9:57 am

I have not tried load ehci_hcd first in /etc/modules,
but can try. As for the message itself  I've rebooted
a couple of times without disable irq #23 going off.


-- 
Justin P. Mattock
--

From: April Tsui
Date: Thursday, September 18, 2008 - 2:56 pm

Please remove this email address from your discussions.

--

From: Alan Stern
Date: Thursday, September 18, 2008 - 3:23 pm

You are concentrating on the symptom instead of the problem.  The
symptom is that "Disabling IRQ #23" message -- don't worry about it.  

The underlying problem is the "ehci_hcd 0000:00:1d.7: HC died;
cleaning up" message -- it should _never_ appear.  If you have a 
reliable way of provoking it then something needs to be fixed.

Alan Stern

--

From: Justin Mattock
Date: Thursday, September 18, 2008 - 5:32 pm

The reliable way of provoking this message is
when uvcvideo is loading(not the module, but the tools involved with
the communication of loading the firmware and module).
i.g. when using the udev approach with isight-firware-tools
I will receive this, but if I use the hal approach
this message does not appear. Now with using the udev approach
I did try this patch:(not knowing what it really was doing),
http://www.gossamer-threads.com/lists/linux/kernel/966076
And it seems this message has not appeared. From looking at the
situation after running this patch irregardless if it was intended for this
or not, I'm wondering if isight-firware-tools is the main issue with this
rather than the kernel.


-- 
Justin P. Mattock
--

From: Alan Stern
Date: Friday, September 19, 2008 - 9:08 am

Which message?  The "Disabling IRQ #23" message or the "HC died; 

It doesn't matter what tools or other userspace programs you run; the 
"HC died; cleaning up" message should _never_ appear.

Alan Stern

--

From: Justin Mattock
Date: Friday, September 19, 2008 - 9:41 am

The disable irq #23 message.

As for the HC died; cleaning up
(where did you see that, can't seem to find that?)


-- 
Justin P. Mattock
--

From: Alan Stern
Date: Friday, September 19, 2008 - 10:05 am

It's in your original message:

	http://marc.info/?l=linux-kernel&m=122107367715361&w=2

The line with timestamp 16.068174.

Alan Stern

--

From: Justin Mattock
Date: Friday, September 19, 2008 - 11:18 am

ahh there it is...
I've been so focused on making sure
uvcvideo is loading everything properly, I forgot to
see what happens when I load everything without
any of the uvcvideo stuff.


-- 
Justin P. Mattock
--

From: Justin Mattock
Date: Friday, September 19, 2008 - 2:20 pm

Hmm, I know you said none of the userspace
apps should cause this.
I not sure but, I might
be having a clash between udev and hal
(not a clean install of isight-firmware-tools);
when installing these items using the udev approach
I noticed some files still remaining(below is the path)
/usr/local/share/hal/fdi/preprobe/20thirdparty/50-isight-firmware.fdi
when removing this file the system runs without any issues with the
udev approach,
as soon as I add this file:
disable irq #23 appears.(but not all the time).


-- 
Justin P. Mattock
--

From: David Brownell
Date: Friday, September 19, 2008 - 5:56 pm

In

  http://www.kernel.org/pub/linux/kernel/people/gregkh/gregkh-2.6/gregkh-02-usb.current/

the patch

  usb-fix-ehci-periodic-transfers.patch

seems like it should resolve the root cause of this, and

 usb-ehci-fix-some-ehci-hangs-and-crashes.patch

would cover symptoms which probably caused the "Disabling..." message.
Both patches were in that LKML thread you referenced.
--

From: Justin Mattock
Date: Friday, September 19, 2008 - 6:19 pm

Cool, I did apply that patch and the periodic patch as well,
after doing so the system did not report any disable irq #23
message., but after running these two for a few days without
any issues I reverted. Then looking more into my situation,
I think ehci_hcd was getting confused with one usb device being
created by udev, and then a few seconds later another device with the
same numbers
created by hal(if thats how those two work), causing ehci_hcd to freak out.
in any case; applying the patch you had mentioned, as well
as the periodic patch did make ehci_hcd accept the two(if thats what
was happening).

regards;

-- 
Justin P. Mattock
--

Previous thread: [PATCH 0/2] WM8400 support by Mark Brown on Wednesday, September 10, 2008 - 11:28 am. (13 messages)

Next thread: Oops/Warning report for the week of September 10th, 2008 by Arjan van de Ven on Wednesday, September 10, 2008 - 12:16 pm. (8 messages)