Re: SATA problems

Previous thread: Re: Dual-Licensing Linux Kernel with GPL V2 and GPL V3 by Nicolas Mailhot on Thursday, June 14, 2007 - 8:40 am. (2 messages)

Next thread: heartbeat problems on linux 2.6.21.[34] by Vitez Gabor on Thursday, June 14, 2007 - 8:52 am. (2 messages)
To: <linux-kernel@...>
Subject: SATA problems
Date: Thursday, June 14, 2007 - 8:36 am

I'm stumped trying to track down the below intermittent problem.....

I've confirmed this problem on 2.6.19, 2.6.20 and 2.6.21.

Any help greatly appreciated!

Regards
Nigel

Jun 14 07:55:52 nigel-m2v kernel: ata2.00: exception Emask 0x0 SAct 0x0
SErr 0x0 action 0x2 frozen
Jun 14 07:55:52 nigel-m2v kernel: ata2.00: cmd
ca/00:18:87:e7:00/00:00:00:00:00/e0 tag 0 cdb 0x0 data 12288 out
Jun 14 07:55:52 nigel-m2v kernel: res
40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
Jun 14 07:55:52 nigel-m2v kernel: ata2: soft resetting port
Jun 14 07:55:52 nigel-m2v kernel: ATA: abnormal status 0x7F on port
0x0001c807
Jun 14 07:55:52 nigel-m2v kernel: ATA: abnormal status 0x7F on port
0x0001c807
Jun 14 07:56:22 nigel-m2v kernel: ata2.00: qc timeout (cmd 0xef)
Jun 14 07:56:22 nigel-m2v kernel: ata2.00: failed to set xfermode
(err_mask=3D0x4)
Jun 14 07:56:22 nigel-m2v kernel: ata2: failed to recover some devices,
retrying in 5 secs
Jun 14 07:56:27 nigel-m2v kernel: ata2: soft resetting port
Jun 14 07:56:27 nigel-m2v kernel: ATA: abnormal status 0x7F on port
0x0001c807
Jun 14 07:56:27 nigel-m2v kernel: ATA: abnormal status 0x7F on port
0x0001c807
Jun 14 07:56:57 nigel-m2v kernel: ata2.00: qc timeout (cmd 0xef)
Jun 14 07:56:57 nigel-m2v kernel: ata2.00: failed to set xfermode
(err_mask=3D0x4)
Jun 14 07:56:57 nigel-m2v kernel: ata2.00: limiting speed to UDMA/133:PIO=
3
Jun 14 07:56:57 nigel-m2v kernel: ata2: failed to recover some devices,
retrying in 5 secs
Jun 14 07:57:02 nigel-m2v kernel: ata2: soft resetting port
Jun 14 07:57:02 nigel-m2v kernel: ATA: abnormal status 0x7F on port
0x0001c807
Jun 14 07:57:02 nigel-m2v kernel: rtc: lost 2 interrupts
Jun 14 07:57:02 nigel-m2v kernel: ATA: abnormal status 0x7F on port
0x0001c807
Jun 14 07:57:32 nigel-m2v kernel: ata2.00: qc timeout (cmd 0xef)
Jun 14 07:57:32 nigel-m2v kernel: ata2.00: failed to set xfermode
(err_mask=3D0x4)
Jun 14 07:57:32 nigel-m2v kernel: ata2.00: disabled
Jun 14 07:57:33 nigel-m2v kernel: ata2: EH c...

To: Nigel Kukard <nkukard@...>
Cc: <linux-kernel@...>
Date: Thursday, June 14, 2007 - 12:21 pm

Try 2.6.22-rc4-gitX...

Jeff

-

To: Jeff Garzik <jeff@...>
Cc: Nigel Kukard <nkukard@...>, <linux-kernel@...>
Date: Thursday, June 14, 2007 - 2:28 pm

On Thu, Jun 14, 2007 at 12:21:49PM -0400, Jeff Garzik wrote:

> > Jun 14 07:55:52 nigel-m2v kernel: ATA: abnormal status 0x7F on port
> > 0x0001c807
> > Jun 14 07:55:52 nigel-m2v kernel: ATA: abnormal status 0x7F on port
> > 0x0001c807

Unrelated to the other error, but I've been meaning to ask for a while..
If this is 'abnormal', why does every SATA box I've seen do it?

Dave

--
http://www.codemonkey.org.uk
-

To: Jeff Garzik <jeff@...>, Nigel Kukard <nkukard@...>, <linux-kernel@...>
Date: Monday, June 18, 2007 - 4:07 pm

On Thu, Jun 14, 2007 at 02:28:54PM -0400, Dave Jones wrote:
> On Thu, Jun 14, 2007 at 12:21:49PM -0400, Jeff Garzik wrote:
>
> > > Jun 14 07:55:52 nigel-m2v kernel: ATA: abnormal status 0x7F on port
> > > 0x0001c807
> > > Jun 14 07:55:52 nigel-m2v kernel: ATA: abnormal status 0x7F on port
> > > 0x0001c807
>
> Unrelated to the other error, but I've been meaning to ask for a while..
> If this is 'abnormal', why does every SATA box I've seen do it?

*crickets*

Should we check for this case explicitly, and not print this?

Dave

--
http://www.codemonkey.org.uk
-

To: Dave Jones <davej@...>, Jeff Garzik <jeff@...>, <linux-kernel@...>
Date: Tuesday, June 19, 2007 - 12:57 am

After I get the above errors, my entire SATA bus crashes and I need to
hard reset the box ... not sure we can just ignore the errors?

To: Dave Jones <davej@...>, Jeff Garzik <jeff@...>, <linux-kernel@...>
Date: Thursday, August 30, 2007 - 5:24 am

Appears even with the patch provided a few months ago I'm getting
freezes. Replaced the HDD & all cables, same errors ... especially
whilst doing heavy IO.

Can anyone shed some light?

ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen
ata2.00: cmd c8/00:00:9f:c9:cb/00:00:00:00:00/e1 tag 0 cdb 0x0 data
131072 in
res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
ata2: soft resetting port
ATA: abnormal status 0x7F on port 0x0001c807
ATA: abnormal status 0x7F on port 0x0001c807
ata2.00: configured for UDMA/133
ata2: EH complete
ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen
ata2.00: cmd c8/00:00:9f:c9:cb/00:00:00:00:00/e1 tag 0 cdb 0x0 data
131072 in
res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
ata2: soft resetting port
ATA: abnormal status 0x7F on port 0x0001c807
ATA: abnormal status 0x7F on port 0x0001c807
ata2.00: configured for UDMA/133
ata2: EH complete
ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen
ata2.00: cmd c8/00:00:9f:c9:cb/00:00:00:00:00/e1 tag 0 cdb 0x0 data
131072 in
res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
ata2: soft resetting port
ATA: abnormal status 0x7F on port 0x0001c807
ATA: abnormal status 0x7F on port 0x0001c807
ata2.00: configured for UDMA/133
ata2: EH complete
ata2.00: limiting speed to UDMA/100:PIO4
ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen
ata2.00: cmd c8/00:00:9f:c9:cb/00:00:00:00:00/e1 tag 0 cdb 0x0 data
131072 in
res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
ata2: soft resetting port
ATA: abnormal status 0x7F on port 0x0001c807
ATA: abnormal status 0x7F on port 0x0001c807
ata2.00: configured for UDMA/100
ata2: EH complete
ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen
ata2.00: cmd c8/00:00:9f:c9:cb/00:00:00:00:00/e1 tag 0 cdb 0x0 data
131072 in
res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
ata2: soft resetting port
ATA: abnorm...

To: Nigel Kukard <nkukard@...>
Cc: Dave Jones <davej@...>, Jeff Garzik <jeff@...>, <linux-kernel@...>, <linux-ide@...>
Date: Monday, September 10, 2007 - 5:02 am

I think I was told last week that copying the appropriate mailing list will
at least prevent chirping, so let's try that.

-

To: Andrew Morton <akpm@...>
Cc: Nigel Kukard <nkukard@...>, Dave Jones <davej@...>, Jeff Garzik <jeff@...>, <linux-kernel@...>, <linux-ide@...>
Date: Thursday, September 13, 2007 - 4:55 am

It's removed (finally). :-)

--
tejun
-

Previous thread: Re: Dual-Licensing Linux Kernel with GPL V2 and GPL V3 by Nicolas Mailhot on Thursday, June 14, 2007 - 8:40 am. (2 messages)

Next thread: heartbeat problems on linux 2.6.21.[34] by Vitez Gabor on Thursday, June 14, 2007 - 8:52 am. (2 messages)