Sunkyoung Shin wrote:I understand that you want the QP timeout/retries to be smaller, and not the CM timeout/retries and hence there might be some confusion here which the following rdma-cm code snip from cma_connect_ib() might help resolving: ... ... The user is in total control on the QP retry count through the rdma-cm connection param structure, the req.max_cm_retries has nothing to do with the QP timeout. The RC QP timeout is derived by the IB CM internally (on ofed through module param which you have changed) and the rdma-cm nor its consumer have direct control on it. This follows the IB spec spirit that the SM/SA is the one to calculate and return to the host a param named "this path packet life time" so the IB CM combines the packet life time and something called the "hca ack delay". Currently the IB CM just 2 * path.packet_life_time as an estimation for the timeout which is the packet life time plus the hca ack delay, see cm_init_av_by_path() in core/cm.c . Note that the actual timeout T = 4.096us * 2^t where t is the value plugged into the QP. Hence doing t = path.packet_life_time + 1 does what I described above. In examination I did on the past I think that the openSM always returns path.packet_life_time = 18 and same for some vendor SMs. This means that the timeout is 2^(2+18+1) = 2^21us = 2 seconds The # retries set by the iser initiator are seven (see iser_route_handler()) so seven times two give 14 seconds, which makes your report on the 100 seconds it took the initiator to reconnect to possibly point on the different problem. Or. _______________________________________________ general mailing list general@lists.openfabrics.org http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
| Bart Van Assche | Integration of SCST in the mainstream Linux kernel |
| Andrew Morton | 2.6.23-mm1 |
| Greg KH | [GIT PATCH] driver core patches against 2.6.24 |
| Justin Piszcz | Re: 2.6.23.1: mdadm/raid5 hung/d-state |
| Gerrit Renker | [PATCH 27/37] dccp: Integration of dynamic feature activation - part 2 (server side) |
| Kenny Chang | Multicast packet loss |
| Stephen Hemminger | Re: HTB accuracy for high speed |
| David Miller | [GIT]: Networking |
git: | |
| Sander | 'struct task_struct' has no member named 'mems_allowed' (was: Re: 2.6.20-rc4-mm1) |
