NETGEAR is aware of a growing number of phone and online scams. To learn how to stay safe click here.
Forum Discussion
mtkeane
Jun 20, 2014Aspirant
ReadyNAS NV+ Hanging...think it's a bad disk
3 times over the past 2 weeks my ReadyNAS NV+ has hung...I can't access shares or FrontView. I have to do a hard boot by pulling the power cable. Once it comes back up it does a resync which takes a long time. I suspect it is possibly a bad disk b/c I think I heard some clicking coming from it the other day, but it isn't consistent and only heard it once. This has been working fine for about 6 months. My problem is I can't find any useful info in Frontview to tell me which disk to test....normally it tells me which disk is the culprit. I don't want to pull each disk one-by-one b/c I am afraid if it hangs or is the wrong disk, the RAID will fail and I will lose the data. I do see SMART info, just not sure what to make of it...what is considered good and what is bad. Does anyone know of a good way to test disks online still in the NAS?
Any help would be GrEaTlY appreciated!
Mike
My config looks like so:
I am running RAIDiator 4.1.13
4x 2TB Disks in an X-RAID
Ch 1 : Seagate ST32000542AS [1862 GB]
Ch 2 : Seagate ST2000DL004 HD204UI [1862 GB]
Ch 3 : Seagate ST2000DM001-9YN164 [1862 GB]
Ch 4 : Seagate ST32000542AS [1862 GB]
SMART Information for Disk 1
Model: ST32000542AS
Serial: <REMOVED BY MODERATOR>
Firmware: CC34
SMART Attribute
Spin Up Time 0
Start Stop Count 82
Reallocated Sector Count 0
Power On Hours 20131
Spin Retry Count 0
Power Cycle Count 78
Runtime Bad Block 0
End-to-End Error 0
Reported Uncorrect 0
Command Timeout 7
High Fly Writes 54
Airflow Temperature Cel 39
Temperature Celsius 39
Current Pending Sector 0
Offline Uncorrectable 0
UDMA CRC Error Count 3
Head Flying Hours 88098369195495
Total LBAs Written 4256263024
Total LBAs Read 4178922060
ATA Error Count 0
Extended Attribute
Hot-add events 0
Hot-remove events 0
Lp stat events 1201
Power glitches 0
Hard disk resets 0
Retries 0
Repaired sectors 0
SMART Information for Disk 2
Model: ST2000DL004 HD204UI
Serial: <REMOVED BY MODERATOR>
Firmware: 1AQ10001
SMART Attribute
Throughput Performance 0
Spin Up Time 7783
Start Stop Count 18
Reallocated Sector Count 0
Seek Time Performance 0
Power On Hours 14930
Spin Retry Count 0
Calibration Retry Count 0
Power Cycle Count 17
Program Fail Cnt Total 6
G-Sense Error Rate 1
Power-Off Retract Count 0
Temperature Celsius 39
Reallocated Event Count 0
Current Pending Sector 0
Offline Uncorrectable 0
UDMA CRC Error Count 0
Multi Zone Error Rate 0
Load Retry Count 0
Load Cycle Count 18
ATA Error Count 0
Extended Attribute
Hot-add events 0
Hot-remove events 0
Lp stat events 561
Power glitches 0
Hard disk resets 0
Retries 0
Repaired sectors 0
SMART Information for Disk 3
Model: ST2000DM001-9YN164
Serial: <REMOVED BY MODERATOR>
Firmware: CC4C
SMART Attribute
Spin Up Time 0
Start Stop Count 18
Reallocated Sector Count 0
Power On Hours 16822
Spin Retry Count 0
Power Cycle Count 18
Runtime Bad Block 0
End-to-End Error 0
Reported Uncorrect 3
Command Timeout 1
High Fly Writes 35
Airflow Temperature Cel 44
G-Sense Error Rate 0
Power-Off Retract Count 17
Load Cycle Count 165
Temperature Celsius 44
Current Pending Sector 0
Offline Uncorrectable 0
UDMA CRC Error Count 0
Head Flying Hours 92586610016692
Total LBAs Written 7112055710370
Total LBAs Read 14895927521725
ATA Error Count 0
Extended Attribute
Hot-add events 0
Hot-remove events 0
Lp stat events 577
Power glitches 0
Hard disk resets 0
Retries 0
Repaired sectors 0
SMART Information for Disk 4
Model: ST32000542AS
Serial: <REMOVED BY MODERATOR>
Firmware: CC34
SMART Attribute
Spin Up Time 0
Start Stop Count 1189
Reallocated Sector Count 0
Power On Hours 15731
Spin Retry Count 0
Power Cycle Count 62
Runtime Bad Block 0
End-to-End Error 0
Reported Uncorrect 0
Command Timeout 1
High Fly Writes 0
Airflow Temperature Cel 39
Temperature Celsius 39
Current Pending Sector 0
Offline Uncorrectable 0
UDMA CRC Error Count 4
Head Flying Hours 136034499180397
Total LBAs Written 1129722558
Total LBAs Read 2693633828
ATA Error Count 0
Extended Attribute
Hot-add events 0
Hot-remove events 0
Lp stat events 0
Power glitches 0
Hard disk resets 0
Retries 0
Repaired sectors 0
Any help would be GrEaTlY appreciated!
Mike
My config looks like so:
I am running RAIDiator 4.1.13
4x 2TB Disks in an X-RAID
Ch 1 : Seagate ST32000542AS [1862 GB]
Ch 2 : Seagate ST2000DL004 HD204UI [1862 GB]
Ch 3 : Seagate ST2000DM001-9YN164 [1862 GB]
Ch 4 : Seagate ST32000542AS [1862 GB]
SMART Information for Disk 1
Model: ST32000542AS
Serial: <REMOVED BY MODERATOR>
Firmware: CC34
SMART Attribute
Spin Up Time 0
Start Stop Count 82
Reallocated Sector Count 0
Power On Hours 20131
Spin Retry Count 0
Power Cycle Count 78
Runtime Bad Block 0
End-to-End Error 0
Reported Uncorrect 0
Command Timeout 7
High Fly Writes 54
Airflow Temperature Cel 39
Temperature Celsius 39
Current Pending Sector 0
Offline Uncorrectable 0
UDMA CRC Error Count 3
Head Flying Hours 88098369195495
Total LBAs Written 4256263024
Total LBAs Read 4178922060
ATA Error Count 0
Extended Attribute
Hot-add events 0
Hot-remove events 0
Lp stat events 1201
Power glitches 0
Hard disk resets 0
Retries 0
Repaired sectors 0
SMART Information for Disk 2
Model: ST2000DL004 HD204UI
Serial: <REMOVED BY MODERATOR>
Firmware: 1AQ10001
SMART Attribute
Throughput Performance 0
Spin Up Time 7783
Start Stop Count 18
Reallocated Sector Count 0
Seek Time Performance 0
Power On Hours 14930
Spin Retry Count 0
Calibration Retry Count 0
Power Cycle Count 17
Program Fail Cnt Total 6
G-Sense Error Rate 1
Power-Off Retract Count 0
Temperature Celsius 39
Reallocated Event Count 0
Current Pending Sector 0
Offline Uncorrectable 0
UDMA CRC Error Count 0
Multi Zone Error Rate 0
Load Retry Count 0
Load Cycle Count 18
ATA Error Count 0
Extended Attribute
Hot-add events 0
Hot-remove events 0
Lp stat events 561
Power glitches 0
Hard disk resets 0
Retries 0
Repaired sectors 0
SMART Information for Disk 3
Model: ST2000DM001-9YN164
Serial: <REMOVED BY MODERATOR>
Firmware: CC4C
SMART Attribute
Spin Up Time 0
Start Stop Count 18
Reallocated Sector Count 0
Power On Hours 16822
Spin Retry Count 0
Power Cycle Count 18
Runtime Bad Block 0
End-to-End Error 0
Reported Uncorrect 3
Command Timeout 1
High Fly Writes 35
Airflow Temperature Cel 44
G-Sense Error Rate 0
Power-Off Retract Count 17
Load Cycle Count 165
Temperature Celsius 44
Current Pending Sector 0
Offline Uncorrectable 0
UDMA CRC Error Count 0
Head Flying Hours 92586610016692
Total LBAs Written 7112055710370
Total LBAs Read 14895927521725
ATA Error Count 0
Extended Attribute
Hot-add events 0
Hot-remove events 0
Lp stat events 577
Power glitches 0
Hard disk resets 0
Retries 0
Repaired sectors 0
SMART Information for Disk 4
Model: ST32000542AS
Serial: <REMOVED BY MODERATOR>
Firmware: CC34
SMART Attribute
Spin Up Time 0
Start Stop Count 1189
Reallocated Sector Count 0
Power On Hours 15731
Spin Retry Count 0
Power Cycle Count 62
Runtime Bad Block 0
End-to-End Error 0
Reported Uncorrect 0
Command Timeout 1
High Fly Writes 0
Airflow Temperature Cel 39
Temperature Celsius 39
Current Pending Sector 0
Offline Uncorrectable 0
UDMA CRC Error Count 4
Head Flying Hours 136034499180397
Total LBAs Written 1129722558
Total LBAs Read 2693633828
ATA Error Count 0
Extended Attribute
Hot-add events 0
Hot-remove events 0
Lp stat events 0
Power glitches 0
Hard disk resets 0
Retries 0
Repaired sectors 0
8 Replies
Replies have been turned off for this discussion
- mdgm-ntgrNETGEAR Employee RetiredPower down, remove disks (label order), hook them up to your PC and check them using SeaTools
- ReadySECUREApprenticeHi.
Disk 1 has command timeouts and UDMA CRC errors.
Disk 2 has Program Fail count errors.
Disk 3 has reported Uncorrectables and command timeout.
Disk 4 has command timeout and UDMA CRC errors.
Your disks have varying ages. It looks like they are almost at least 2 years old (some are older than others). Most individuals are lucky to have their drives last so long in a RAID device. - StephenBGuru - Experienced User
They should last longer than that. Backblaze uses consumer drives, and 80% of their drives are still running after 4 years. They expect half them will still be running after 6 years. http://blog.backblaze.com/2013/11/12/ho ... ives-last/readysecure1985 wrote: ...Your disks have varying ages. It looks like they are almost at least 2 years old (some are older than others). Most individuals are lucky to have their drives last so long in a RAID device...
All four drives with errors, ages from 10 months to 27 months is certainly not lucky - even the relatively unreliable Seagates should do better than that. http://blog.backblaze.com/2014/01/21/wh ... uld-i-buy/ - mdgm-ntgrNETGEAR Employee RetiredYou could still see if SeaTools thinks some drives are in worse condition than others.
You could end up needing to try cloning some disks onto good new ones e.g. using dd_rescue.
And another thing, please don't post the serial numbers for drives. We don't need that information. - StephenBGuru - Experienced UserThese stats look pretty reasonable to me.
The main thing that jumps out at me is the "program fail" count for drive 2. Though the description of that stat isn't all that clear, it sounds like it refers to a program failure within the drive (e.g. the drive firmware). Though seagates don't usually report that particular parameter, it is reported on that drive because it was developed by Samsung.
And of course there are the uncorrectable errors on drive 3.
So maybe start with an extended test of drive 2 and 3.
CRC errors and command timeouts could be a NAS problem, they aren't necessarily the fault of the drives. - mtkeaneAspirantHey all,
Thanks so much for the great suggestions and information.
Stephen B. I really appreciate the extra analysis and giving me a way to go on which drives to try first...this is what I was really looking for in terms of all the drives have errors, but what seems the most drastic. I will start with drive 2 & 3.
On a separate note, since it looks like I will be replacing some drives, does anyone have any good suggestions or methods they use for buying drives for their NAS's? I hear allot about red drives over black drives, Seagate over Hitachi, but I am mostly looking for dependability. In the past, I always just went with the best bang for my buck, but maybe that doesn't get me as far down the road as I thought it would.
Thoughts? - StephenBGuru - Experienced UserI posted a link earlier to the BackBlaze recommendations (based on their failure rates): http://blog.backblaze.com/2014/01/21/wh ... uld-i-buy/
They seem to take your strategy btw, as they continue to buy Seagates based on price, even though they expect them to be less reliable. However, they find Hitachi to be the most reliable, followed closely by Western Digital. Most people seem to feel that enterprise drives are not worth the extra money, and don't perform differently from consumer grade drives. I've never purchased enterprise drives, so I have no personal experience with them.
Personally I favor the WDC Reds. I have 14 drives in 4 NAS at the moment (which is admittedly crazy. I will consolidate to 2 in the future, but that is a separate subject...).
In terms of drives:
4 are WDC Green drives - which have been working for 4 years with no issues. But I don't really recommend them.
8 are WDC Red drives. I do recommend them, since they are on the HCL, have 3 year warranties,are tuned for NAS and are as "green" as the WDC Green drives. They are also acoustically quiet and run cool. I haven't had any fail. Generally they are only slightly more expensive than WDC consumer drives. So I think they are a good deal overall.
2 are Seagates. These have been in place for quite a while. There used to be more, but I have gradually replaced them with WDC as they have failed.
My reason for switching away from Seagates was that I had a bunch of failures a couple of years ago - both internal drives (mainly 1.5 TB) and USB drives of various sizes. I've had better luck with the 2.5" seagates than the 3.5. I've had much lower failure rates with Western Digital.
Keep in mind that all information is retrospective - generally based on older drive models no longer in production. - mtkeaneAspirantHey Stephen B.,
Thanks so much for all the great personal recommendations....this is exactly the type of information that I was looking for!
I did read the article(s) that you posted from BackBlaze, but was looking for something of experience as well...you nailed it with your what you use and why experiences. I appreciate the objective offerings you provided and hopefully this will help others as well. I've decided on the WDC Red drives as well, so I am off to buy some at a local place that has them in stock and plan my migration.
Thanks again!
Related Content
- Aug 06, 2024Retired_Member
NETGEAR Academy
Boost your skills with the Netgear Academy - Get trained, certified and stay ahead with the latest Netgear technology!
Join Us!