* Ingo Molnar <mingo@elte.hu> wrote:hm, the distcc TCP hangs are back: Distcc client box (quad, 10.0.1.16) running v2.6.24: dione:~> netstat -nt | grep -vw TIME_WAIT | grep 3632 tcp 0 250455 10.0.1.16:55559 10.0.1.19:3632 ESTABLISHED tcp 0 254743 10.0.1.16:56096 10.0.1.19:3632 ESTABLISHED tcp 0 219617 10.0.1.16:55674 10.0.1.19:3632 ESTABLISHED [ ^--- note the stuck send-queue ] Distcc server box (16-way, 10.0.1.19) running very-latest: phoenix:~> netstat -nt | grep 10.0.1.16 | grep 3632 tcp 0 0 10.0.1.19:3632 10.0.1.16:55559 ESTABLISHED tcp 0 0 10.0.1.19:3632 10.0.1.16:56096 ESTABLISHED tcp 0 0 10.0.1.19:3632 10.0.1.16:55674 ESTABLISHED tcp 0 0 10.0.1.19:3632 10.0.1.16:34411 ESTABLISHED tcp 0 0 10.0.1.19:3632 10.0.1.16:51094 ESTABLISHED tcp 0 0 10.0.1.19:3632 10.0.1.16:60787 ESTABLISHED tcp 0 0 10.0.1.19:3632 10.0.1.16:50874 ESTABLISHED I.e. the client side send-queue is stuck in established state, server side thinks it's a proper established connection. Nobody makes any progress. Also note the final 4 connections on the server side - those are not present on the client box. The hung condition seemed permanent (i waited a couple of minutes). Then i shut down the distccd on the server side, which propagated to the client: distcc[18496] (dcc_pump_sendfile) ERROR: sendfile failed: Broken pipe distcc[18496] (dcc_readx) ERROR: unexpected eof on fd4 distcc[18496] (dcc_r_token_int) ERROR: read failed while waiting for token "DONE" distcc[18496] Warning: failed to distribute kernel/futex.c to ph/20, running locally instead Server side lingered in FIN_WAIT2 a bit: Proto Recv-Q Send-Q Local Address Foreign Address State tcp 0 0 10.0.1.19:3632 10.0.1.16:56096 FIN_WAIT2 tcp 0 0 10.0.1.19:3632 10.0.1.16:55559 FIN_WAIT2 I retried the same build 10 times and it would not reproduce - so this again is a hard to reproduce condition. (and there's no chance to get a proper tcpdump either, at these traffic levels) Ingo --
| Greg Kroah-Hartman | [PATCH 001/196] Chinese: Add the known_regression URI to the HOWTO |
| Tarkan Erimer | Re: Dual-Licensing Linux Kernel with GPL V2 and GPL V3 |
| Christoph Lameter | [00/41] Large Blocksize Support V7 (adds memmap support) |
| Linus Torvalds | Linux 2.6.27-rc5 |
git: | |
| David Miller | Re: [PATCH] pkt_sched: Destroy gen estimators under rtnl_lock(). |
| David Miller | [GIT]: Networking |
| Gerrit Renker | [PATCH 15/37] dccp: Set per-connection CCIDs via socket options |
| Nick Piggin | Re: Mainline kernel OLTP performance update |
