Hello, Recently my X terminals showed annoying behaviour where the application in the terminal was not resized properly to the actual size of the X terminal emulator window, resulting in a lot of misaligned text on the screen. Hunting the issue down from the windowmanager and the terminal emulator program, I suspect the problem might lie in the kernel. I'm running 2.6.26 on a dual core i386. What I see is this: the userspace application receives a SIGWINCH signal and acquires the terminal size usign the TIOCGWINSZ ioctl. It seems that in some cases the old instead of the new terminal size is returned. A small delay before the ioctl seems to 'fix' this behaviour. I noticed some changes involving locking in the the pty code in the last kernel verions, could one of these changes cause the above behaviour ? If so, wouldn't this affect much more users ? Ico ^X^Cy^K^X^C^C^C^C --
hm, that code is pretty simple and although it does the SIGWINCH and the window-size setting in a peculiar order, it looks to be race-free. Approximately what proportion of the time does it go wrong? --
I guess about 10 to 20% of the resizes. I happen to be using a tiling window manager which causes resizing more often and more agressive then 'normal' window managers, I guess this helps triggering the problem. I temporary worked around this issue this by changing the order of the signal and the updating of the pty size in tty_io.c's tiocswinsz(), but this is not much of a real fix. Ico --
Well damn. Are you sure? The code looks solid to me.
At least, it does after
Author: Alan Cox <alan@redhat.com> 2008-08-15 02:39:38
Committer: Linus Torvalds <torvalds@linux-foundation.org> 2008-08-15 10:34:07
Parent: 000b9151d7851cc1e490b2a76d0206e524f43cca (Fix race/oops in tty layer after BKL pushdown)
Branches: git-cifs, git-ia64, git-nfs, git-powerpc-merge, linux-next, remotes/origin/master
Follows: v2.6.27-rc3
Precedes: next-20080818
tty: remove resize window special case
perhaps you're still running a kernel which is earlier than that?
--
I reported the behaviour on 2.6.26.2, but I was not aware this issue was adressed already in the 2.6.27 tree. I will update to the latest 2.6.27 rc and report if the problem still persists. --
I am not able to reproduce the problem with git 2.6.27-rc3-next-20080819. --
On Tue, 19 Aug 2008 19:56:39 +0200
That's linux-next, yes?
linux-next has additional locking in there:
commit 2283faa9cec083b6ddc1fa02a974ce1c797e847f
Author: Alan Cox <alan@redhat.com>
Date: Thu Aug 14 09:53:22 2008 +1000
tty-fix-pty-termios-race
Kanru Chen posted a patch versus the old code which deals with the case
where you resize the pty side of a pty/tty pair. In that situation the
termios data is updated for both pty and tty but the locks are not held
on both sides.
This reimplements the fix against the updated tty code. Patch by self but
the hard bit (noticing and fixing the bug) is thanks to Kanru Chen.
Signed-off-by: Alan Cox <alan@redhat.com>
diff --git a/drivers/char/tty_io.c b/drivers/char/tty_io.c
index a8ddcba..779c6b5 100644
--- a/drivers/char/tty_io.c
+++ b/drivers/char/tty_io.c
@@ -2068,7 +2068,7 @@ static int tiocgwinsz(struct tty_struct *tty, struct winsize __user *arg)
/**
* tty_do_resize - resize event
* @tty: tty being resized
- * @real_tty: real tty (if using a pty/tty pair)
+ * @real_tty: real tty (not the same as tty if using a pty/tty pair)
* @rows: rows (character)
* @cols: cols (character)
*
@@ -2085,6 +2085,14 @@ int tty_do_resize(struct tty_struct *tty, struct tty_struct *real_tty,
mutex_lock(&tty->termios_mutex);
if (!memcmp(ws, &tty->winsize, sizeof(*ws)))
goto done;
+
+ /* If a pty/tty pair is updated we will have a real_tty defined
+ which doesn't match the tty. In this case as we will update
+ both of the tty termios sets. We can lock both mutex safely here
+ as in this case real_tty is the tty, tty is the pty side and we
+ have lock ordering */
+ if (real_tty != tty)
+ mutex_lock(&real_tty->termios_mutex);
/* Get the PID values and reference them so we can
avoid holding the tty ctrl lock while sending signals */
spin_lock_irqsave(&tty->ctrl_lock, flags);
@@ -2102,6 +2110,8 @@ int tty_do_resize(struct tty_struct *tty, struct ...2.6.25.9: good 2.6.26-rc1 bad ( +/- 10% of the resizes) 2.6.26.2: still bad ( +/- 10% of the resizes ) 2.6.27-rc3: ugly bad ( > 75% of the resizes ) linux-next good Any other versions to test ? ^X^Cy^K^X^C^C^C^C --
On Tue, 19 Aug 2008 22:13:11 +0200
Current Linus mainline has
commit 8c9a9dd0fa3a269d380eaae2dc1bee39e865fae1
Author: Alan Cox <alan@redhat.com>
Date: Fri Aug 15 10:39:38 2008 +0100
tty: remove resize window special case
which might have fixed this after 2.6.27-rc3, so testing
ftp://ftp.kernel.org/pub/linux/kernel/v2.6/snapshots/patch-2.6.27-rc3-git6.gz
would be most interesting.
We should hunt down the problem and get 2.6.26.x fixed up too.
Thanks.
--
Still very broken. ^X^Cy^K^X^C^C^C^C --
