Re: E1000: e1000_update_stats()

Previous thread: [PATCH] V4L1 - fix v4l_compat_translate_ioctl possible NULL deref by Cyrill Gorcunov on Wednesday, March 5, 2008 - 1:24 pm. (1 message)

Next thread: [PATCH] checkpatch: Make some text more consinstent and informative by Paolo Ciarrocchi on Wednesday, March 5, 2008 - 1:49 pm. (3 messages)
From: Todd Tomaino
Date: Wednesday, March 5, 2008 - 1:45 pm

I have been having trouble with interrupt latencies.  While profiling my system, I measured the e1000_update_stats() function disabling interrupts for roughly 100 usec.  For my application, this long of a delay results in data loss.  The e1000_update_stats() function is broken down into 3 main operations:

1. acquire spinlock
2. read registers & update adapter->stat counters
3. update OS adapter->net_stats statistics
4. update phy stats.
5. release spinlock.

On my system I measured 57 usec to perform operations #2 and #3, and 52 usec to perform operation #4.

Reading the comments in the code it appeared that updating adapter->net_stats and adapter->stat required interrupts to be turned off to prevent the e1000_adjust_tbi_stats() function from modifying these structures from interrupt context.  In order to reduce the amount of time spent with interrupts disabled I separated the register reads from updating adapter->stat counters.  The register reads take up the bulk of the 57 usec and shouldn't require protection (I could be wrong).  The new operation looks like this and reduces the amount of time spent with interrupts turned off by 1/2:

1. read registers
2. acquire spinlock
3. update adapter->stat counters
4. update OS adapter->net_stats statistics
5. update phy stats.
6. release spinlock.


It seemed like a bad idea to attempt to move the phy reads out of the spin locked code area.

These changes have greatly reduced my interrupt latency, but are there any negative side effects that could be caused??


Best regards,
Todd


System Info:

  Kernel:            2.6.18 (RHEL5.1)

  E1000 driver:      7.3.20-k2
--

From: Chris Friesen
Date: Wednesday, March 5, 2008 - 2:22 pm

One potential problem with this is that the stats can be further out of 
date.

Consider this case:

cpu0 reads some of the registers, then gets preempted/interrupted
cpu1 reads all the registers, updates the stats, then dumps the data


At this point cpu1 dumped information that doesn't include the registers 
that cpu0 was in the middle of updating.  Eventually cpu0 will run again 
and the stats will be accurate, but there is a window where the stats 
may be self-inconsistent and the various counts may not add up.

Chris

--

From: Kok, Auke
Date: Wednesday, March 5, 2008 - 2:40 pm

the update_stats function is only called every two seconds, and scheduled once. I
think that the chance that two of these functions run in contention in the way you
point out are practically zero, and if they do then we have much bigger problems
than just counters being wrong...

the lock is there to prevent an mii-tool PHY read from interfering with the
regularly scheduled PHY reads, so we might want to keep those reads locked
together as Todd suggests himself, but not the bulk MAC reads.

Auke
--

From: Chris Friesen
Date: Wednesday, March 5, 2008 - 3:00 pm

I recently did an investigation that ended up sampling a subset of those 
stats every millisecond (to get an accurate picture of how many packets 
were being arriving vs being dropped due to rx_fifo errors over short 
intervals).  Ultimately it turned out that we had very bursty traffic. 
So it's not entirely unheard-of to sample at much higher rates.

At the very least, if we're going to remove the atomicity of reading 
registers and updating stats, how about adding a comment to this effect 
in the code?

Chris
--

From: Kok, Auke
Date: Wednesday, March 5, 2008 - 2:34 pm

[ Adding netdev@vger.kernel.org !]


not per se, I think you did the right thing here. I think it might even be safe to
move the PHY reads outside the spinlock area, and this is certainly something I
would like to try.

would you care to send an RFC patch to netdev@vger.kernel.org?

Also, I notice you are using an old driver/kernel. care to retest against the
latest git trees?

I'll certainly take your suggestion and have it tested.

Auke
--

Previous thread: [PATCH] V4L1 - fix v4l_compat_translate_ioctl possible NULL deref by Cyrill Gorcunov on Wednesday, March 5, 2008 - 1:24 pm. (1 message)

Next thread: [PATCH] checkpatch: Make some text more consinstent and informative by Paolo Ciarrocchi on Wednesday, March 5, 2008 - 1:49 pm. (3 messages)