So what? The cpu gives you incoherent view of data somewhere when only
processing data from a single cpu?
If you have remote data accesses (loop summing the data?) and you have to
be concerned about data coherence then you CANNOT use this_cpu_ops since
they are not guaranteed to be coherent to other processors.
--