ah, in retrospect i realized that this test had one flaw: some of the
systems i the build cluster already ran a newer kernel and hence were
targets for this bug.
so i turned off CONFIG_TCP_CONG_CUBIC on all the testboxes and rebooted
the cluster boxes into 2.6.25, and the hung sockets are now gone. (about
150 successful iterations)
i did another change as well: i removed the localhost distcc component.
I'll reinstate that now to make sure it's really related to
TCP_CONG_CUBIC and not to localhost networking.
Ingo
--