> Having the infrastructure to automatically off-line pages
It's already there with a modern mcelog in daemon mode
and a recent kernel that supports soft offlining.
The current default in mcelog is 10 corrected errors per 24h
per 4k page or 1 uncorrected error on the page (if your CPU
supports recovering from that). It is on by default.
You can configure it to be different if you want.
-Andi
--