We are changing definitions. The original patch by Andrea calls
invalidate_page for each pte that is cleared. So strictly you would not
need an invalidate_range.
The refcount is not necessary if we adopt Andrea's approach of a callback
on the clearing of each pte. At that point the page is still guaranteed to
exist. If we do the range_invalidate later (as in V3) then the page may
have been released (see sys_remap_file_pages() f.e.) before we zap the GRU
ptes. So there will be a time when the GRU may write to a page that has
been freed and used for another purpose.
Taking a refcount on the page defers the free until the range_invalidate
runs.
I would prefer a solution that does not require taking refcounts (pins)
for establishing an external pte and for release (like what the GRU does).
If we could effectively determine that there are no external ptes in a
range then the invalidate_page() call may return immediately. Maybe it is
then effective to do these gazillions of invalidate_page() calls when a
process terminates or an remap is performed.
--