On Jan 5, 2008 11:13 AM, Jarek Poplawski <jarkao2@gmail.com> wrote:still on the todo-list, I had no time to try this yet... I don't think ieee1394 is to blame here. See http://lkml.org/lkml/2007/11/29/372 This was the first report of these crashes. The first one is a similar crash in the ieee1394 code and my first try was to blame it. But switching to a real network card did not solve this, as the second crash in that mail shows. Also Stefan Richter said in http://lkml.org/lkml/2007/11/29/419 this: "FWIW, eth1394 and the entire rest of the 1394 stack beneath eth1394 are identical between -mm and Linus' tree." I'm still using the old ieee1394-stack and not the new firewire one, as eth1394 had not been ported at that time. It might be possible that these are two different bugs, but two bugs with same symptom's of corrupted lists at the same time seem unlikely. (Especially this last report of the oops in 1394 looks rather strange. Things can only go onto hpsbpkt_queue if they have a non NULL complete_routine. (see queue_packet_complete() in drivers/ieee1394/ieee1394_core.c). But a call to a NULL complete_routine seems to be the cause of one of the two oopses. So it looks like the hpsbpkt_queue list got mangled. But this list is only used in this file and all three places that access this list are protected by spinlocking pending_packets_lock. So my personal conclusion would be, that someone is writing to memory that he no longer owns. Most probably 0-bytes. (the complete_routine got NULLed and the warning about dst->__refcnt being 0). Use-after-free or something else? [snip] Attached. (Last one I was using with 2.6.24-rc6-mm1. For all other tests I copied this one and did a make oldconfig) Interesting. I didn't even know about this file / option. But four things make an involvement rather doubtful: a) I do not find a single line like "init_ohci1394_dma: initializing OHCI-1394" in any of the syslogs. b) I do not have the parameter ohci1394_dma=early set c) # CONFIG_PROVIDE_OHCI1394_DMA_INIT is not set d) I have seen the crash in svc_xprt_enqueue() without eth1394 and at that try there was not a single firewire device attached. I will now try broken-out-patches... Torsten
| Benjamin Herrenschmidt | Re: [PATCH] Remove process freezer from suspend to RAM pathway |
| Daniel Walker | Re: [Announce] [patch] Modular Scheduler Core and Completely Fair Scheduler [CFS] |
| Greg KH | [GIT PATCH] driver core patches against 2.6.24 |
| Andrew Morton | -mm merge plans for 2.6.23 |
git: | |
| David Miller | [GIT]: Networking |
| Hannes Eder | [PATCH 01/43] drivers/net/at1700.c: fix sparse warning: symbol shadows an earlier ... |
| Gerrit Renker | [PATCH 16/37] dccp: API to query the current TX/RX CCID |
| Herbert Xu | Re: [PATCH] pkt_sched: Destroy gen estimators under rtnl_lock(). |
