Sure, I read what you wrote before writing (although, frankly, didn't
get the idea). But I don't think that overall it would be as efficient
as full hardware offload. See my reply to Jeff Garzik about that.
The test used Myricom Myri-10G cards (myri10ge driver), which support
LRO. And from ethtool -S output I conclude it was enabled. Just in case,
I attached it, so you can recheck me.
Thus, apparently, LRO doesn't make a fundamental difference. Maybe this
particular implementation isn't too efficient, I don't know. I don't
have enough information for that.
Vlad