On a HT 3GHz Xeon, I see a ~35% increase in time (1830ns -> 2500ns),
averaged over 1000,000 interrupts. Because it's only 2 cpu, it ends up
always using the smp_call_function_single path (but via the
smp_call_function_mask path).
I need to try getting a more-cpu machine running native to do a proper
test, esp to see if putting kmalloc on that path is very expensive or
not. (Is it worth using a kmem_cache?)
J
--