OVH Community, your new community space.

SATA issues


cyclo
17-03-2013, 16:52
I managed to solve the problem by disabling write cache

cyclo
14-03-2013, 10:15
I hope it doesn't come to that. In the meantime I've updated the kernel to 3.7.10, just hope that fixes it

Kode
14-03-2013, 09:16
I know it's not ideal, but worst case maybe order a new SP32G a couple of days before yours is due to expire and copy everything across and allow the other one to expire?

cyclo
14-03-2013, 08:13
lscpi -k reports:

00:1f.2 SATA controller: Intel Corporation 6 Series/C200 Series Chipset Family SATA AHCI Controller (rev 05)
Subsystem: Intel Corporation Device 2002
Kernel driver in use: ahci

Which seems to be the correct one. smartctl doesn't report any errors that i can see, although I can't run a long test for some reason, it just does a short one every time.

I've been googling extensively for solutions and can't really find anything, some people suggest disable NCQ, others suggest disabling aspm which I've tried. I don't want to disable NCQ as that will be a performance loss.

It just feels like I've wasted money upgrading to a better server (had an SP 16G before and that was fine). Customer support told me to run a system test in rescue mode, did that, nothing showed up. Although the HDD tests were very short. I just feel at a loss right now.

I understand that OVH won't do anything unless it's a hardware problem, but at the same time I've been having this issue since I first rented it, so it's not something I've done to make it like that

Myatu
13-03-2013, 20:40
Make sure the appropriate driver for the HD controller is loaded (check with `lspci -k` and google if that's the right driver).

You can also check the HDD with `smartctl -x`. You can also use it to tell the HDD to perform a self-test.

cyclo
13-03-2013, 10:36
I ordered an SP 32G Server less than a day ago and I'm noticing the following in kern.log:

The entire system freezes when this happens, dropping every network connection in the process which is causing serious problems.

Mar 12 13:03:02 ns363974 kernel: ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen
Mar 12 13:03:02 ns363974 kernel: ata2.00: failed command: FLUSH CACHE
Mar 12 13:03:02 ns363974 kernel: ata2.00: cmd e7/00:00:00:00:00/00:00:00:00:00/a0 tag 0
Mar 12 13:03:02 ns363974 kernel: res 40/00:00:00:4f:c2/00:00:00:00:00/00 Emask 0x4 (timeout)
Mar 12 13:03:02 ns363974 kernel: ata2.00: status: { DRDY }
Mar 12 13:03:02 ns363974 kernel: ata2: hard resetting link
Mar 12 13:03:02 ns363974 kernel: ata2: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
Mar 12 13:03:02 ns363974 kernel: ata2.00: configured for UDMA/133
Mar 12 13:03:02 ns363974 kernel: ata2.00: retrying FLUSH 0xe7 Emask 0x4
Mar 12 13:03:02 ns363974 kernel: ata2.00: device reported invalid CHS sector 0
Mar 12 13:03:02 ns363974 kernel: ata2: EH complete

Any suggestions would be appreciated.