On Thu, Jul 17, 2008 at 02:14:40PM +0100, David Vrabel wrote:Yes it is, but we don't touch that bit at this time. When we enable MSIs, we set the INTx disable bit (or at least do a write to it ... as Krzysztof Halasa pointed out, not all devices implement this bit). When we mask MSIs, we clear the MSI enable bit. So a device conforming to PCI 2.3 has both INTx and MSI disabled. Unfortunately, a device conforming to PCI 2.2 has MSI disabled and INTx implicitly re-enabled. Oops. Maybe so, but we don't control that. Here's the flow that leads to the problem you've observed (note: only x86-64 analysed here, other architectures may vary): The mask_msi_irq() function is the heart of what's going on. This is set to be the ->mask() function pointer in the MSI irq_chip struct. The device generates an MSI interrupt. Some magic happens in assembly, and the processor ends up in do_IRQ() which calls generic_handle_irq(). Because it's an MSI IRQ, handle_edge_irq() is called instead of __do_IRQ(). There are two opportunities for calling the ->mask() here, one is: if (unlikely((desc->status & (IRQ_INPROGRESS | IRQ_DISABLED)) || !desc->action)) { desc->status |= (IRQ_PENDING | IRQ_MASKED); mask_ack_irq(desc, irq); The other is: if (unlikely(!action)) { desc->chip->mask(irq); This is all in generic code which knows nothing about any device-specific method of masking interrupts from the chip. This is a good thought, let's follow it through. What if we simply make ->mask a no-op for devices which don't support mask bits? MSIs are 'edge triggered' interrupts. They're sent once and then forgotten by the hardware (as opposed to level triggered which will continue to be triggered until deasserted). The first time through (assuming ->action is non-NULL ...), we won't try to mask the irq. The second time through, IRQ_INPROGRESS will be set, so we try to mask_ack_irq(). How about we simply don't ack the irq at this point? That should prevent it being triggered again, right? Working on a patch to do this now ... -- Intel are signing my paycheques ... these opinions are still mine "Bill, look, we understand that you're interested in selling us this operating system, but compare it to ours. We can't possibly take such a retrograde step." --
| Davide Libenzi | Re: [patch 7/8] fdmap v2 - implement sys_socket2 |
| Bart Van Assche | Integration of SCST in the mainstream Linux kernel |
| Greg Kroah-Hartman | [PATCH 005/196] Chinese: add translation of SubmittingDrivers |
| Mariusz Kozlowski | [KJ PATCHES] mostly kmalloc + memset conversion to k[cz]alloc |
git: | |
| KOSAKI Motohiro | [bug?] tg3: Failed to load firmware "tigon/tg3_tso.bin" |
| Stefan Richter | Re: [GIT]: Networking |
| David Miller | Re: [PATCH] pkt_sched: Destroy gen estimators under rtnl_lock(). |
| Gerrit Renker | [PATCH 0/37] dccp: Feature negotiation - last call for comments |
