TX on r8169 causes latency and messes up real-time apps, on some boards

Previous thread: Re: System freeze on reboot - general protection fault by Eric Dumazet on Sunday, August 16, 2009 - 2:16 am. (50 messages)

Next thread: Re: [PATCH] Revert netlink ABI change to gnet_stats_basic by Eric Dumazet on Sunday, August 16, 2009 - 5:33 am. (6 messages)
From: Andrew Lutomirski
Date: Sunday, August 16, 2009 - 3:39 am

Hi-

I'm having a bit of a mystery problem.  I have a bunch of identical
machines (Intel D201GLY2A + Trendnet TEG-PCITXR) which all work
wonderfully.

I have four brand-new machines that should be basically identical,
except that they are Intel D201GLY2 boards to which I have attached
fans myself (that's supposed to be the only difference), and for which
I bought the Trendnet TEG-PCITXR cards much later.  On those four new
machines, transmitting with the Trendnet card breaks my real-time app.

These are 2.6.30.1, although I've reproduced the behavior on
2.6.29.something.  All machines in question are running bitwise
identical kernels and userspace.

I've diffed the dmesg results.  The good machines differ only in
BogoMIPS (tiny difference), MAC addresses (obviously), and RTC time.
The difference from good to bad is attached and also shows which r8169
device I have.  (The USB stuff at the bottom is the real-time device
in question.)

The working machines do have a newer BIOS, and the only difference I
see that might make a difference is that the PWM fan control changed,
which might affect SMI latency, but I don't see why this should only
matter when sending data over the network.

Any ideas?  I'm happy to test things.

Thanks,
Andy
From: Francois Romieu
Date: Sunday, August 16, 2009 - 10:19 pm

Andrew Lutomirski <luto@mit.edu> :

There is some difference. Either the kernel, the board or the bios
revision. I'd be surprized if the /proc/interrupts looked the same.

-- 
Ueimor
--

From: Andrew Lutomirski
Date: Sunday, August 16, 2009 - 10:31 pm

This might be a false alarm -- it's possible I'm just trying to exceed
available PCI bandwidth.  I'll email back if I find any more
convincing evidence it's a problem with r8169.

Thanks,
Andy

--

Previous thread: Re: System freeze on reboot - general protection fault by Eric Dumazet on Sunday, August 16, 2009 - 2:16 am. (50 messages)

Next thread: Re: [PATCH] Revert netlink ABI change to gnet_stats_basic by Eric Dumazet on Sunday, August 16, 2009 - 5:33 am. (6 messages)