[Sysadmins] Отказ HDD

Nikolay =?iso-8859-1?q?sn=5Fkirovograd=5Fsub_=CE=C1_rambler=2Eru?=
Чт Дек 20 18:33:10 MSK 2007


Вот  приключилось плохое 

Dec 20 17:04:34 hq kernel: ata1.00: speed down requested but no transfer mode 
left
Dec 20 17:04:34 hq kernel: ata1.00: exception Emask 0x0 SAct 0x0 SErr 0x0 
action 0x2 frozen
Dec 20 17:04:34 hq kernel: ata1.00: tag 0 cmd 0xc4 Emask 0x4 stat 0x40 err 0x0 
(timeout)
Dec 20 17:04:34 hq kernel: ata1: soft resetting port
Dec 20 17:04:34 hq kernel: ata1: softreset failed (port busy but CLO 
unavailable)
Dec 20 17:04:34 hq kernel: ata1: softreset failed, retrying in 5 secs
Dec 20 17:04:39 hq kernel: ata1: hard resetting port
Dec 20 17:04:47 hq kernel: ata1: port is slow to respond, please be patient 
(Status 0x80)
Dec 20 17:05:10 hq kernel: ata1: port failed to respond (30 secs, Status 0x80)
Dec 20 17:05:10 hq kernel: ata1: COMRESET failed (device not ready)
Dec 20 17:05:10 hq kernel: ata1: hardreset failed, retrying in 5 secs
Dec 20 17:05:15 hq kernel: ata1: hard resetting port
Dec 20 17:05:15 hq kernel: ata1: SATA link up 1.5 Gbps (SStatus 113 SControl 
300)
Dec 20 17:05:15 hq kernel: ata1.00: configured for PIO0
Dec 20 17:05:15 hq kernel: ata1: EH complete
------------------------------------------------------

Dec 20 17:06:56 hq kernel: ata1.00: configured for PIO0
Dec 20 17:06:56 hq kernel: sd 0:0:0:0: SCSI error: return code = 0x08000002
Dec 20 17:06:56 hq kernel: sda: Current: sense key: Aborted Command
Dec 20 17:06:56 hq kernel:     Additional sense: No additional sense 
information
Dec 20 17:06:56 hq kernel: Info fld=0x4697415
Dec 20 17:06:56 hq kernel: end_request: I/O error, dev sda, sector 20527255
Dec 20 17:06:56 hq kernel: ata1: EH complete
+
В messages было до установки ядра 2.6.18-std-smp-alt9 
ata2: spurious interrupt (irq_stat 0x8 active_tag -84148995 sactive 0x0)
ata2: spurious interrupt (irq_stat 0x8 active_tag -84148995 sactive 0x0)
ata4: spurious interrupt (irq_stat 0x8 active_tag -84148995 sactive 0xff3)

Сейчас при нагрузке на hdd приличной 
Просто валит ошибки и пытается ресетить порт
Когда стояло 32бит pae ядро такого ненаблюдалось
Именно pae..


Система ALTLinux Master 4
Ядро Linux hq. 2.6.18-std-smp-alt9 #1 SMP Mon Nov 26 00:53:11 MSK 2007 x86_64 
GNU/Linux

Очень похоже на 
http://ussg.iu.edu/hypermail/linux/kernel/0612.3/0452.html

lspi говорит 
00:00.0 Host bridge: Intel Corporation E7220/E7221 Memory Controller Hub 
00:01.0 PCI bridge: Intel Corporation E7220/E7221 PCI Express Root Port 
00:02.0 VGA compatible controller: Intel Corporation E7221 Integrated Graphics 
00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev d3)
00:1f.0 ISA bridge: Intel Corporation 82801FB/FR (ICH6/ICH6R) LPC Interface 
00:1f.1 IDE interface: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) 
00:1f.2 SATA controller: Intel Corporation 82801FR/FRW (ICH6R/ICH6RW) SATA 
00:1f.3 SMBus: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) SMBus 
01:00.0 PCI bridge: Intel Corporation 6702PXH PCI Express-to-PCI Bridge A 
01:00.1 PIC: Intel Corporation 6700/6702PXH I/OxAPIC Interrupt Controller A 

Не подскажете как быть в данной ситуации ?
Самому что либо патчить - да ещё на рабочем роутере нехватает знаний :(


-- 
С уважением, Николай


Подробная информация о списке рассылки Sysadmins