That's another argument for not introducing the sysctl; the number of
nodes and zones are a static property of the machine that cannot change
without a reboot (numa=fake, mem=, introducing movable zones, etc). We
don't have node hotplug that can suddenly introduce additional zones from
which to reclaim.
My point was that there doesn't appear to be any use case for tuning this
via a sysctl that isn't simply attempting to workaround some other reclaim
problem when the VM is stressed. If that's agreed upon, then deciding
between a config option that is either per-cpu or per-node should be based
on the benchmarks that you've run. At this time, it appears that per-node
is the more advantageous.
That seems to indicate that the NUMA topology is more important than lock
contention for the reclaim throttle.
David
--