login
Login
/
Register
Search
Search this site:
Forums
News
Blogs
Features
Site
Home
»
Mailing list archives
»
linux-kernel
»
2010
»
December
»
9
Re: Big git diff speedup by avoiding x86 "fast string" memcmp
view
thread
Previous message: [
thread
] [
date
] [
author
]
Next message: [
thread
] [
date
] [
author
]
[view in full thread]
From: Borislav Petkov
Subject:
Re: Big git diff speedup by avoiding x86 "fast string" memcmp
Date: Thursday, December 9, 2010 - 6:37 am
On Thu, Dec 09, 2010 at 06:09:38PM +1100, Nick Piggin wrote:
quoted text
> I was actually discussing this with Linus a while back, and finally > got around to testing it out now that I have a modern CPU to measure > it on! CCing linux-arch because it would be interesting to know > whether your tuned functions do better than gcc or not (I would > suspect not). > > BTW. patch and numbers are on top of my scaling series, just for > an idea of what it does, I just want to generate some interesting > discussion. > > If people are interested in running benchmarks, I'll be pushing out > a new update soon, after some more testing and debugging here. > > The standard memcmp function on a Westmere system shows up hot in > profiles in the `git diff` workload (both parallel and single threaded), > and it is likely due to the costs associated with trapping into > microcode, and little opportunity to improve memory access (dentry > name is not likely to take up more than a cacheline). > > So replace it with an open-coded byte comparison. This increases code > size by 24 bytes in the critical __d_lookup_rcu function, but the > speedup is huge, averaging 10 runs of each: > > git diff st user sys elapsed CPU > before 1.15 2.57 3.82 97.1 > after 1.14 2.35 3.61 96.8 > > git diff mt user sys elapsed CPU > before 1.27 3.85 1.46 349 > after 1.26 3.54 1.43 333 > > Elapsed time for single threaded git diff at 95.0% confidence: > -0.21 +/- 0.01 > -5.45% +/- 0.24%
Nice. [..]
quoted text
> +static inline int dentry_memcmp(const unsigned char *cs, > + const unsigned char *ct, size_t count) > +{ > + while (count) { > + int ret = (*cs != *ct); > + if (ret) > + return ret; > + cs++; > + ct++; > + count--; > + } > + return 0; > +}
we have a memcmp() in lib/string.c. Maybe reuse it from there? -- Regards/Gruss, Boris. --
unsubscribe notice
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to
majordomo@vger.kernel.org
More majordomo info at
http://vger.kernel.org/majordomo-info.html
Please read the FAQ at
http://www.tux.org/lkml/
Previous message: [
thread
] [
date
] [
author
]
Next message: [
thread
] [
date
] [
author
]
Messages in current thread:
Big git diff speedup by avoiding x86 "fast string" memcmp
, Nick Piggin
, (Thu Dec 9, 12:09 am)
Re: Big git diff speedup by avoiding x86 "fast string" memcmp
, Borislav Petkov
, (Thu Dec 9, 6:37 am)
Re: Big git diff speedup by avoiding x86 "fast string" memcmp
, Nick Piggin
, (Thu Dec 9, 7:38 pm)
Re: Big git diff speedup by avoiding x86 "fast string" memcmp
, Nick Piggin
, (Thu Dec 9, 9:27 pm)
Re: Big git diff speedup by avoiding x86 "fast string" memcmp
, J. R. Okajima
, (Fri Dec 10, 7:23 am)
Re: Big git diff speedup by avoiding x86 "fast string" memcmp
, Nick Piggin
, (Sun Dec 12, 6:45 pm)
Re: Big git diff speedup by avoiding x86 "fast string" memcmp
, J. R. Okajima
, (Mon Dec 13, 12:29 am)
Re: Big git diff speedup by avoiding x86 "fast string" memcmp
, Nick Piggin
, (Mon Dec 13, 1:25 am)
Re: Big git diff speedup by avoiding x86 "fast string" memcmp
, J. R. Okajima
, (Tue Dec 14, 12:01 pm)
Re: Big git diff speedup by avoiding x86 "fast string" memcmp
, Américo Wang
, (Tue Dec 14, 9:38 pm)
Re: Big git diff speedup by avoiding x86 "fast string" memcmp
, Nick Piggin
, (Tue Dec 14, 10:54 pm)
Re: Big git diff speedup by avoiding x86 "fast string" memcmp
, J. R. Okajima
, (Tue Dec 14, 10:57 pm)
Re: Big git diff speedup by avoiding x86 "fast string" memcmp
, Linus Torvalds
, (Wed Dec 15, 12:12 am)
Re: Big git diff speedup by avoiding x86 "fast string" memcmp
, David Miller
, (Wed Dec 15, 11:00 am)
Re: Big git diff speedup by avoiding x86 "fast string" memcmp
, Tony Luck
, (Wed Dec 15, 4:09 pm)
Re: Big git diff speedup by avoiding x86 "fast string" memcmp
, Nick Piggin
, (Wed Dec 15, 7:34 pm)
Re: Big git diff speedup by avoiding x86 "fast string" memcmp
, Boaz Harrosh
, (Thu Dec 16, 2:53 am)
Re: Big git diff speedup by avoiding x86 "fast string" memcmp
, Nick Piggin
, (Thu Dec 16, 6:13 am)
Re: Big git diff speedup by avoiding x86 "fast string" memcmp
, Boaz Harrosh
, (Thu Dec 16, 7:03 am)
Re: Big git diff speedup by avoiding x86 "fast string" memcmp
, Nick Piggin
, (Thu Dec 16, 7:15 am)
Re: Big git diff speedup by avoiding x86 "fast string" memcmp
, Linus Torvalds
, (Thu Dec 16, 9:51 am)
Re: Big git diff speedup by avoiding x86 "fast string" memcmp
, David Miller
, (Thu Dec 16, 10:57 am)
Navigation
Mailing list archives
Recent posts
Popular discussions
linux-kernel
:
Mel Gorman
Re: [PATCH 1/4] vmstat: remove zone->lock from walk_zones_in_node
Guenter Roeck
Re: [lm-sensors] Location for thermal drivers
David Woodhouse
Re: RFC: Moving firmware blobs out of the kernel.
Siddha, Suresh B
Re: [PATCH 2.6.21 review I] [11/25] x86: default to physical mode on hotplug CPU k...
Peter Zijlstra
Re: [patch 4/6] mm: merge populate and nopage into fault (fixes nonlinear)
git-commits-head
:
Linux Kernel Mailing List
[MIPS] Fix potential latency problem due to non-atomic cpu_wait.
Linux Kernel Mailing List
USB: rename USB_SPEED_VARIABLE to USB_SPEED_WIRELESS
Linux Kernel Mailing List
lib/vsprintf.c: fix bug omitting minus sign of numbers (module_param)
Linux Kernel Mailing List
[Bluetooth] Initiate authentication during connection establishment
Linux Kernel Mailing List
[POWERPC] 4xx: Add ppc40x_defconfig
linux-netdev
:
MERCEDES
Your mail id has won 950,000.00 in the MERCEDES Benz Online Promo.for claims send:
David Miller
Re: [PATCH] xen/netfront: do not mark packets of length < MSS as GSO
David Miller
Re: skb_segment() questions
Shan Wei
[RFC PATCH net-next 2/5]IPv6:netfilter: Send an ICMPv6 "Fragment Reassembly Timeou...
Stanislaw Gruszka
[PATCH 1/4] bnx2x: use smp_mb() to keep ordering of read write operations
git
:
Nicolas Sebrecht
git-svn died of signal 11 (was "3 failures on test t9100 (svn)")
Junio C Hamano
Re: [PATCH 2/2] Add url.<base>.pushInsteadOf: URL rewriting for push only
Martin Langhoff
Re: [PATCH] GIT commit statistics.
Alexandre Julliard
[PATCH] gitweb: Put back shortlog instead of graphiclog in the project list.
Josh Triplett
[PATCH 2/2] Add url.<base>.pushInsteadOf: URL rewriting for push only
openbsd-misc
:
Taisto Qvist XX
Re: AMD GEODE LX-800 just works with kernel from install42.iso and kernelpanics wi...
Nico Meijer
Re: gOS Develop Kit with VIA pc-1 Processor Platform VIA C7-D
Andreas Bihlmaier
Re: jetway board sensors (Fintek F71805F)
admin
Drive a 2009 car from R799p/m
Antti Harri
Re: how to create a sha256 hash
Colocation donated by:
Syndicate