yes but note that by caching the whole mapping on 64-bit we get
everything we want: trivially lockless, works from any CPU, can be
preempted at will, and there are no ugly INVLPG flushes anywhere.
you'll even get 2MB mappings automatically, if the BAR is aligned and
sized correctly.
32-bit we should handle as well but not design for it.
Ingo
--