Re: What do these SATA errors mean / kernel 2.6.25.6 (DRDY ERR/ICRC ABRT)

Previous thread: [PATCH] Clean up thermal API by Matthew Garrett on Wednesday, June 11, 2008 - 3:06 am. (14 messages)

Next thread: Oops: 0000 [#1] PREEMPT SMP by Walter Franzini on Wednesday, June 11, 2008 - 3:21 am. (2 messages)
From: Justin Piszcz
Date: Wednesday, June 11, 2008 - 3:14 am

Never had a single error so far, powered down my host, powered it back up,
and now with kernel 2.6.25.6:

Jun 11 05:23:24 p34 kernel: [   67.118632] mtrr: no more MTRRs available
Jun 11 05:46:23 p34 kernel: [ 1445.288619] ata12.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2
Jun 11 05:46:23 p34 kernel: [ 1445.288626] ata12.00: irq_stat 0x00060002, device error via D2H FIS
Jun 11 05:46:23 p34 kernel: [ 1445.288632] ata12.00: cmd 35/00:f8:47:dc:35/00:03:02:00:00/e0 tag 0 dma 520192 out
Jun 11 05:46:23 p34 kernel: [ 1445.288634]          res 51/84:f8:47:dc:35/00:03:02:00:00/e0 Emask 0x10 (ATA bus error)
Jun 11 05:46:23 p34 kernel: [ 1445.288637] ata12.00: status: { DRDY ERR }
Jun 11 05:46:23 p34 kernel: [ 1445.288639] ata12.00: error: { ICRC ABRT }
Jun 11 05:46:23 p34 kernel: [ 1445.288649] ata12: hard resetting link
Jun 11 05:46:25 p34 kernel: [ 1447.419983] ata12: SATA link up 3.0 Gbps (SStatus 123 SControl 0)
Jun 11 05:46:25 p34 kernel: [ 1447.429612] ata12.00: configured for UDMA/100
Jun 11 05:46:25 p34 kernel: [ 1447.429628] ata12: EH complete
Jun 11 05:46:25 p34 kernel: [ 1447.813910] sd 11:0:0:0: [sdl] Write Protect is off
Jun 11 05:46:25 p34 kernel: [ 1447.813912] sd 11:0:0:0: [sdl] Mode Sense: 00 3a 00 00
Jun 11 05:46:25 p34 kernel: [ 1447.813928] sd 11:0:0:0: [sdl] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
Jun 11 06:00:32 p34 kernel: [ 2293.491350] ata1.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen
Jun 11 06:00:32 p34 kernel: [ 2293.491360] ata1.00: cmd 35/00:02:43:90:7d/00:00:12:00:00/e0 tag 0 dma 1024 out
Jun 11 06:00:32 p34 kernel: [ 2293.491362]          res 40/00:ff:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
Jun 11 06:00:32 p34 kernel: [ 2293.491365] ata1.00: status: { DRDY }
Jun 11 06:00:32 p34 kernel: [ 2293.794295] ata1: soft resetting link
Jun 11 06:00:32 p34 kernel: [ 2293.947277] ata1: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
Jun 11 06:00:32 p34 kernel: [ 2294.614206] ata1.00: configured for UDMA/133
Jun 11 ...
From: Justin Piszcz
Date: Wednesday, June 11, 2008 - 4:33 am

Will replace/re-connect/check cables/connectors, a long test on each disk
just passed fine as well but there was a single (1) CRC error, could be the
cables/connectors/will verify later today.

Justin.

--

From: Jeff Garzik
Date: Wednesday, June 11, 2008 - 7:50 pm

http://ata.wiki.kernel.org/index.php/Libata_error_messages

In particular, timeouts may be solved by acpi=off or 'noapic' or

Yes.  ATA is always back-compatible.

	Jeff





--

From: Tejun Heo
Date: Sunday, June 15, 2008 - 8:52 pm

And a write command timed out which is also often caused by transmission

No, according to the log, there was no slow down.  Transmission speed is

For SATA drives, occasional transmission problems are expected even on
otherwise pretty healthy systems.  No need to worry about it too much
unless the problem repeats itself a lot.

-- 
tejun
--

Previous thread: [PATCH] Clean up thermal API by Matthew Garrett on Wednesday, June 11, 2008 - 3:06 am. (14 messages)

Next thread: Oops: 0000 [#1] PREEMPT SMP by Walter Franzini on Wednesday, June 11, 2008 - 3:21 am. (2 messages)