Hi,sorry if this has been discussed already, all i could find are topics about upgrading the firmware on these drives.I've got a NV+v2 with 4 3TB ST3000DM001-9YN166 disks in it, firmware CCH4 (the latest).System has been running for a couple of months now, and all was working smoothly until i got a warning 2 weeks ago:Detected increasing command timeouts[65537] on disk 1 [ST3000DM001-9YN166, W1F083HJ]. This often indicates an impending failure. Please be prepared to replace this disk to maintain data redundancy.Looking into the SMART status, it indeed gives some ridiculously high number (4295032833) on command time out for disk 1. On the other 3 disks it's 0.Another thing i noticed in the SMART status for all disks though, is that the load count cycle is quite high: around 43000 after 1800 power on hours.After a bit of googling however, all i could find was this problem cropping up on WD green drives. The solution seems to be to set the idle time a bit higher.I did find this old topic (http://www.readynas.com/forum/viewtopic.php?f=36&t=51536) explaining how to do it on some other seagate model, but i wanted to get some feedback before i attempt to change things via ssh. Don't wanna corrupt anything.So basically my questions are:1) Is the command time out problem in anyway related to the lcc problem, or is this drive just about to die no matter what?2) Does anybody else have these high lcc numbers with the ST3000DM001 drives?3) Is it worthwhile changing the idle time setting with hdparm?3b) If yes, are the instructions in that old topic still valid?On a related note, i did install the ssh add-on and looked around a little, however when i try to get drive status with hdparm i get this:root@nas:~# hdparm -i /dev/sda/dev/sda: HDIO_GET_IDENTITY failed: Inappropriate ioctl for deviceSame for sdb, sdc, and sddAm i using the right identifiers here? Sorry for the ignorance :oops: Hope this isn't all too confusing, i tried to cram a lot of information in this post :-)Thanks in advance for any help!EDIT: apparently i do not have permission to use the url tag..

Hi, I have the exact same setup (NV+v2 with 4 3TB ST3000DM001-9YN166 with firmware CCH4) and im getting the same warning for disk 1.Detected increasing command timeouts[65537] on disk 1 [ST3000DM001-9YN166, W1F0NNDC]. This often indicates an impending failure. Please be prepared to replace this disk to maintain data redundancy

Command Timeout: Indicates a number of aborted operations due to hard disk timeoutthis is a critical parameterit would be a good idea to connect the drive to your computer and test with sea tools > http://www.seagate.com/support/downloads/seatools/

This is the list of known S.M.A.R.T. attributes supported by IDE and Serial ATA hard disks.Note: some manufacturers may use the attributes for different purposes also.Attributes not listed here are "vendor specific" attributes (their purpose is not known)Raw Read Error Rate - Errors occured while reading raw data from a diskIndicate problem with the disk surface or the read/write heads.Critical attributeThroughput Performance - General throughput performance of the hard diskIndicate problem with motor, servo or bearings.Spin Up Time - Time needed by spindle to spin-up to full RPMIndicate problem with motor or bearings.Critical attributeStart/Stop Count - Count of start/stop cycles of spindleThis value does not directly affect the condition of the drive.Reallocated Sector Count (Reallocated Sectors Count) - Count of sectors moved to the spare areaIndicate problem with the disk surface or the read/write heads.Critical attributeCommand Timeout - Indicates a number of aborted operations due to hard disk timeoutCritical attributeRead Channel Margin - Margin of a channel while reading dataThe exact function of this attribute is not specified.Seek Error Rate - Rate of positioning errors of the read/write headsIndicate problem with servo, head. High temperature can also cause this problem.Critical attributeSeek Time Performance - Average time of seek operations of the headsIndicate problem with servo.Critical attributePower-On Time Count - Total time the drive is powered onThe unit of the measure depends on the manufacturer.Spin Retry Count - Retry count of spin start attemptsIndicate problem with motor, bearings or power supply.Critical attributeDrive Calibration Retry Count - Number of attempts to calibrate a driveIndicate problem with motor, bearings or power supply.Drive Power Cycle Count - Number of complete power on/off cyclesThis value does not directly affect the condition of the drive.Soft Read Error Rate - Number of software read errorsThe number of uncorreactable read errors.Airflow Temperature - Airflow temperatureThe temperature of the air inside the hard disk housing.Mechanical Shock - Count of problems caused by mechanical shockAcceleration (for example falling) can cause mechanical shock.Power off Retract Cycle - Count of power off cyclesThis value does not directly affect the condition of the drive.Load/Unload Cycle Count - Count of load/unload cyclesNumber of cycles the head moved into landing zone position.HDD Temperature - Disk temperatureThe temperature inside the hard disk housing.Hardware ECC Recovered - Count of correctable errorsNumber of errors corrected by the internal error correcting mechanism.Reallocation Event Count - Count of sector remap operationsNumber of all (successful and failed) remap operations.Critical attributeCurrent Pending Sector Count - Count of unstable sectorsThese pending sectors may be remapped to the spare area.Critical attributeOff-Line Uncorrectable Sector Count - Count of uncorrectable errors when reading/writingIndicate problem with the disk surface or the read/write heads.Critical attributeUltra ATA CRC Error Count - Count of errors during data transfer between disk and hostIndicate problem with the power supply or data cable.Write Error Rate - Errors occured while writing raw data from a diskIndicate problem with the disk surface or the read/write heads.Soft Read Error Rate - Number of software read errorsThe number of uncorreactable read errors.Data Address Mark Errors - Number of data address mark errorsNumber of incorrect or invalid address marks.Run Out Cancel - Number of data correction errorsInvalid error correction checksum found during error correction.Soft ECC Correction - Number of corrected data errorsErrors corrected by the internal error correction mechanism.Thermal Asperity Rate - Number of thermal problemsTotal number of problems caused by high temperature.Flying Height - Head flying heightThe height of the disk heads above the disk surface.Spin High Current - Current value during spin upThe current needed to spin up the drive.Spin Buzz - Number of cycles needed to spin upThe number of retries during spin up because of low current available.Offline Seek Performance - Drive performance during offline operationsThe seek performance of the drive during internal self tests.Disk Shift - Distance of the disk has shifted relative to the spindleIncorrect disk spin can be cause by mechanical shock or high temperature.G-Sense Error Rate - Number of mechanical errorsNumber of errors resulting from shock or vibration.Loaded Hours - Number of powered on hoursThis value is constantly increasing (once per every hour).Load/Unload Retry Count - Number of load/unload operationsThe number of drive head enters/leaves the data zone.Load Friction - Mechanical friction rateThe rate of friction between mechanical parts. Indicate problem with the mechanical subsystem of the drive.Load-in Time - Total time the heads are loadedThe time while the read/write heads are in the data zone.Torque Amplification Count - Rate of torque increaseTorque increase during the spin up operation of the hard disk.Power-off Retract Count - Number of power off cyclesThe number of times the head was retracted as a result of power loss.GMR Head Amplitude - Head positioning amplitudeHead moving distances between operations.Hard Disk Temperature - Disk temperatureThe temperature inside the hard disk housing.Head Flying Hours - Number of head positioning hoursTime spent during the positioning of the drive heads.Read Error Retry Rate - Number of retries during read operationsNumber of errors found during reading a sector from disk surface.

I have the same configuration, a NV+v2 with 4 3TB ST3000DM001-9YN166 disks upgraded firmware CCH4 (the latest).And the identical message error:Detected increasing command timeouts[524296] on disk 1 [ST3000DM001-9YN166, S1F03ANB]. This often indicates an impending failure. Please be prepared to replace this disk to maintain data redundancy.I just checked (two days ago, after the upgrade to new firmware) all the disks with seatools and there were no problems..

what command timeout count do you see in the smart stats?

[Solved] Seagate ST3000DM001-9YN166: Load Cycle Count

50 Replies

Replies have been turned off for this discussion

StephenB

Guru - Experienced User

Oct 23, 2012

intothevoid wrote:
The problem appears isolated to the disk test. So that would seem to rule out the supply and the contacts - they wouldn't mysteriously heal when the test completes.

It is conceivable that the disk test includes some commands that provoke the drive to fail. For instance, malformed/illegal commands, or perhaps legal but unusual commands that the drive firmware doesn't handle properly.

I'd suggest a support case.

In the meantime I would simply not run that test! And record/track the current counts and confirm that they never increase in normal operation.

Is that test run automatically at boot, or on a schedule? I've never manually triggered it.
Both times my drive 'failed' (and had a higher command timeout value afterwards) the NAS was in normal operation.

EDIT: Included quote for top of page clarity.

I believe he means the "full disk test" function on the boot menu, which is described in the V2 hardware manual. If so, that test is only done upon request.

It looks to me like there is more than one issue here related to command timeouts for this disk model. For some, the command timeout errors in the alerts/log appear to be false (at least the count is reported as 0 when looking directly at the SMART stats). Others have some real command timeout errors, but the count in the alert is wrong. In spett's case, the timeout counts (though huge) are confirmed when looking directly at the SMART stats, and these counts only go up when he manually runs a disk test on the ReadyNAS.

I'm not sure how to untangle this now, since we have several folks seeing different things posting on the same thread (whose title is not about command timeouts at all!).

spett

Aspirant

Oct 23, 2012

intothevoid wrote:
The problem appears isolated to the disk test. So that would seem to rule out the supply and the contacts - they wouldn't mysteriously heal when the test completes.

It is conceivable that the disk test includes some commands that provoke the drive to fail. For instance, malformed/illegal commands, or perhaps legal but unusual commands that the drive firmware doesn't handle properly.

I'd suggest a support case.

In the meantime I would simply not run that test! And record/track the current counts and confirm that they never increase in normal operation.

Is that test run automatically at boot, or on a schedule? I've never manually triggered it.
Both times my drive 'failed' (and had a higher command timeout value afterwards) the NAS was in normal operation.

EDIT: Included quote for top of page clarity.

No, I have run this test manually trough the boot menu. That is a offline hard drive test.
Haven't noticed anything unusual in normal operation without the nas completely freezing when using sabnzbd from readynas extras. Could be a problem with sabnzbd, just strange that no one is mention any problems with it..

toomanybarts
Aspirant
Oct 24, 2012
I am running a different seagate drive st32000641as, but again in Drive 1 and have just rcvd error with high command timeouts of 4295032833.
I took out drive and ran sea tools and the drive passed every test.
The other 2 drives (I'm only running 3 drives at present in the NV+ v2) are fine at zero command timeouts.

I did have disk spindown set to 15 minutes and have since turned that off.
intothevoid
Aspirant
Oct 24, 2012
Hmm, seems to me like we're looking at two different paths leading to the same problem.

1) running the disk test leads to an extremely high command timeout on disk1.
2) the NAS drops disk1 saying it has failed, after reboot it resyncs and the command time out has increased (to a ridiculous value).

The high load cycle count form the thread title was my original guess for the cause of the problem, but after that was fixed with the HDPARM tweak, the command timeout problem (via scenario 2) persisted.

So either there's something wrong with all our drives, and it's a coincidence they all happen to be installed in the first slot on the NAS, or there is some weird compatibility issue with these seagate ST's and the NV+ v2, which causes the drive installed as disk1 to fail.

It's still strange why for some the command time out value seems to be real and for some not though. To complicate matters, I cannot test my drives in a normal computer as i only have a laptop...
StephenB
Guru - Experienced User
Oct 24, 2012
intothevoid wrote:
To complicate matters, I cannot test my drives in a normal computer as i only have a laptop...
Seatools will find and test them even if they are in a USB enclosure or USB->Sata adapter. My laptop has eSata, and that of course works as well (though you need an eSata->Sata cable, and a power supply).

Though you can test it (and verify the SMART stats), upgrading the firmware when you are using USB is another matter.

intothevoid wrote:
To complicate matters, I cannot test my drives in a normal computer as i only have a laptop...

spett

Aspirant

Oct 24, 2012

intothevoid wrote:

So either there's something wrong with all our drives, and it's a coincidence they all happen to be installed in the first slot on the NAS, or there is some weird compatibility issue with these seagate ST's and the NV+ v2, which causes the drive installed as disk1 to fail.

intothevoid wrote:
So either there's something wrong with all our drives, and it's a coincidence they all happen to be installed in the first slot on the NAS, or there is some weird compatibility issue with these seagate ST's and the NV+ v2, which causes the drive installed as disk1 to fail.

I think I have ruled out faulty disks by resyncing a new drive in slot 1 and then sync the disc that where in slot 1 in slot 2.
Same result after a disk test, command timeout only increased on disk 1.

intothevoid

Aspirant

Oct 24, 2012

StephenB wrote:
Seatools will find and test them even if they are in a USB enclosure or USB->Sata adapter. My laptop has eSata, and that of course works as well (though you need an eSata->Sata cable, and a power supply).

Though you can test it (and verify the SMART stats), upgrading the firmware when you are using USB is another matter.

StephenB wrote:
Seatools will find and test them even if they are in a USB enclosure or USB->Sata adapter. My laptop has eSata, and that of course works as well (though you need an eSata->Sata cable, and a power supply). Though you can test it (and verify the SMART stats), upgrading the firmware when you are using USB is another matter.

Ah, good to know. Will test it this weekend when i have some more time.

Spett wrote:
I think I have ruled out faulty disks by resyncing a new drive in slot 1 and then sync the disc that where in slot 1 in slot 2.
Same result after a disk test, command timeout only increased on disk 1.

Spett wrote:
I think I have ruled out faulty disks by resyncing a new drive in slot 1 and then sync the disc that where in slot 1 in slot 2. Same result after a disk test, command timeout only increased on disk 1.

Yeah seems like it's not the drives. What do you guys think, time to open a support ticket? And if yes, can I refer to this thread if I make one?

toomanybarts
Aspirant
Oct 25, 2012
My 3 ST drives were donated from a friend who had them running in a DROBO enclosure error free for 6 months.
Now I know a drive can fail at any time - but coincidence that after a week in my brand new readynas nv+ v2, whilst sitting idle overnight, I suddenly get 4295032833 errors??
I dont think so.
I agree - sounds very much like Firmware issues on Readynas to me.
cynan
Aspirant
Oct 25, 2012
Just thought I'd add that I'm experiencing the same issue with my NV+ v2 and Seagate ST2000DM001. I've had the NV+ v2 and 3x ST2000DM001 working fine for just over 3 months now. All drives were updated by me the CC4H firmware when new, prior to being installed in the NAS. Also, prior to installing, I checked each disk with all Seatools tests (boot disk) and all were fine.

Suddenly, yesterday, I got an email from the NAS telling me that all 3 disks had failed. I ended up having to unplug the NAS from the wall to get it to power off (it was unresponsive). When I rebooted, my volume was back to normal and everything seems fine.

However, looking at the SMART info I see a command timeout value for Disk 1 of 4295032833. Both Drives 2 and 3 have a command timeout of "0". I have no idea what this value was before yesterday's incident or whether the two are related...

Seems like there might be a compatibility issue between these Seagate drives and the NV+ V2. Strange that it's always Drive 1 that has these super high command timeouts and that the high numbers are always about the same, if not identical.

I have never run any full disk check in the NAS (that you access through the boot menu) only the shorter volume check on reboot. My spin-down time is set for 60 min and has been so for months. have been on 5.3.5 the whole time.
cynan
Aspirant
Oct 26, 2012
FYI, I was reading on the Seagate forums that a SMART Command Timeout reading of 4295032833 is actually a hexadecimal value.

In hexadecimal, 4295032833 = 0x0001 0001 0001, which the poster thought represented a value of "1"

Therefore, the reading of 4295032833 probably doesn't indicate that a whole bunch of Command Timeouts has occurred, but rather only a single one.

Forum Discussion

[Solved] Seagate ST3000DM001-9YN166: Load Cycle Count

50 Replies

Related Content

Seagate ST3000DM001 9YN166 (fw CC82) compatible?

ReadyNAS RN104 fresh setup to replace Seagate NAS

Netgear Nighthawk RAX50V2 - Access Denied Bug SOLVED SOLUTION

SOLVED: ReadyNAS Backups of new Windows 11 PC

ST3000DM001-9YN166 causing problems

NETGEAR Academy

ProSupport for Business