× NETGEAR will be terminating ReadyCLOUD service by April 1st, 2023. For more details click here.
Reply

High Raw Read Error Rate: is it time to replace my drive?

fearless_fool
Aspirant

High Raw Read Error Rate: is it time to replace my drive?

The story:

The last few times I powered up my ReadyNAS (following a shutdown), it found and corrected errors. But the last two times I've tried, it got stuck at 98% during File System Check.

This morning, I forced skipping of FSCK (http://www.readynas.com/kb/faq/boot/how ... lume_check) so I could read the log file. The log file showed a SATA reset (see below), so I have initiated a RAID resync.

While that was going on, I decided to look at the SMART status of the four drives. One of them (Disk 3) shows Raw Read Error Rate of 1764; for the other three drives, that value is 0.

One other factor: I have enabled fast CIFS writes. I have a UPS, but its battery has (finally) gone bad. The system has powered itself off at least once without an "orderly" shutdown.

My question:

Would a marginal drive (with raw read errors) account for the problems I'm seeing (stuck on FSCK, SATA reset)? Or could the unscheduled shutdown be responsible for that?

Most importantly, is it time to purchase a replacement drive? (I've already ordered a replacement battery for the UPS.)

The log file:


Sun Jan 19 06:33:23 PST 2014 RAID sync started on volume C.
Sun Jan 19 06:27:24 PST 2014 System is up.
Sun Jan 19 06:05:21 PST 2014 System powering off...
Wed Jan 8 21:18:54 PST 2014 A SATA reset has been performed on one or more of your disks that may have affected the RAID parity integrity. It is recommended that you perform a RAID volume resync from the RAID Settings tab ( accessible in the Volumes page => Volume tab in FrontView ). The resync process will run in the background, and you can continue to use the ReadyNAS in the meantime.
Wed Jan 8 21:16:13 PST 2014 System is up.
Wed Jan 8 20:52:33 PST 2014 Volume scan found and corrected errors.
Sun Jan 5 18:01:41 PST 2014 System powering off...
Tue Dec 31 00:53:42 PST 2013 System is up.
Tue Dec 31 00:53:45 PST 2013 System Update Status (chimborazo) : Your ReadyNAS device has been updated with a new firmware image. (RAIDiator 4.1.13)
Tue Dec 31 00:48:52 PST 2013 Volume scan found and corrected errors.
Sat Dec 21 20:31:59 PST 2013 System powering off...
Tue Dec 17 13:35:36 PST 2013 UPS is on battery power.
Fri Dec 13 04:19:41 PST 2013 Please reboot your ReadyNAS device to continue with the update process.
Sat Nov 30 14:03:51 PST 2013 System is up.


The SMART logs:

Disk 1:
SMART Information for Disk 1
Model: SAMSUNG HD103UJ
Firmware: 1AA01113
SMART Attribute
Raw Read Error Rate 0
Spin Up Time 9750
Start Stop Count 107626
Reallocated Sector Count 0
Seek Error Rate 0
Seek Time Performance 0
Power On Hours 36606
Spin Retry Count 14
Calibration Retry Count 0
Power Cycle Count 56
Read Soft Error Rate 0
Runtime Bad Block 0
End-to-End Error 0
Reported Uncorrect 0
Command Timeout 0
Airflow Temperature Cel 35
Temperature Celsius 36
Hardware ECC Recovered 1265083
Reallocated Event Count 0
Current Pending Sector 0
Offline Uncorrectable 0
UDMA CRC Error Count 0
Multi Zone Error Rate 0
Soft Read Error Rate 0
ATA Error Count 0

Extended Attribute
Hot-add events 0
Hot-remove events 0
Lp stat events 0
Power glitches 0
Hard disk resets 0
Retries 0
Repaired sectors 0


Drive 2
SMART Information for Disk 2
Model: SAMSUNG HD103UJ
Firmware: 1AA01113
SMART Attribute
Raw Read Error Rate 0
Spin Up Time 9910
Start Stop Count 121920
Reallocated Sector Count 0
Seek Error Rate 0
Seek Time Performance 0
Power On Hours 36499
Spin Retry Count 21
Calibration Retry Count 0
Power Cycle Count 57
Read Soft Error Rate 0
Runtime Bad Block 0
End-to-End Error 0
Reported Uncorrect 1
Command Timeout 0
Airflow Temperature Cel 40
Temperature Celsius 40
Hardware ECC Recovered 459297
Reallocated Event Count 0
Current Pending Sector 0
Offline Uncorrectable 0
UDMA CRC Error Count 0
Multi Zone Error Rate 0
Soft Read Error Rate 0
ATA Error Count 0

Extended Attribute
Hot-add events 0
Hot-remove events 0
Lp stat events 0
Power glitches 0
Hard disk resets 0
Retries 0
Repaired sectors 0


Disk 3
SMART Information for Disk 3
Model: SAMSUNG HD103UJ
Firmware: 1AA01113
SMART Attribute
Raw Read Error Rate 1764
Spin Up Time 9910
Start Stop Count 121842
Reallocated Sector Count 0
Seek Error Rate 0
Seek Time Performance 0
Power On Hours 36479
Spin Retry Count 17
Calibration Retry Count 1
Power Cycle Count 58
Read Soft Error Rate 1764
Runtime Bad Block 0
End-to-End Error 0
Reported Uncorrect 1764
Command Timeout 0
Airflow Temperature Cel 38
Temperature Celsius 38
Hardware ECC Recovered 794889
Reallocated Event Count 0
Current Pending Sector 6
Offline Uncorrectable 0
UDMA CRC Error Count 0
Multi Zone Error Rate 0
Soft Read Error Rate 0
ATA Error Count 0

Extended Attribute
Hot-add events 0
Hot-remove events 0
Lp stat events 0
Power glitches 0
Hard disk resets 0
Retries 0
Repaired sectors 0


Disk 4
SMART Information for Disk 4
Model: SAMSUNG HD103UJ
Firmware: 1AA01113
SMART Attribute
Raw Read Error Rate 0
Spin Up Time 10280
Start Stop Count 122283
Reallocated Sector Count 0
Seek Error Rate 0
Seek Time Performance 0
Power On Hours 36539
Spin Retry Count 16
Calibration Retry Count 0
Power Cycle Count 54
Read Soft Error Rate 0
Runtime Bad Block 0
End-to-End Error 0
Reported Uncorrect 0
Command Timeout 0
Airflow Temperature Cel 35
Temperature Celsius 35
Hardware ECC Recovered 5038
Reallocated Event Count 0
Current Pending Sector 0
Offline Uncorrectable 0
UDMA CRC Error Count 0
Multi Zone Error Rate 0
Soft Read Error Rate 0
ATA Error Count 0

Extended Attribute
Hot-add events 0
Hot-remove events 0
Lp stat events 0
Power glitches 0
Hard disk resets 0
Retries 0
Repaired sectors 0
-- ff
~.~.~.~.~.~.~
MacBook Pro 2.66 GHz Intel Core i7, Mac OS 10.8.x
ReadyNAS NV, X-RAID, RAIDiator 4.1.13
APC 550G UPC
Message 1 of 6
StephenB
Guru

Re: High Raw Read Error Rate: is it time to replace my drive

Drive 3 also shows "Current Pending Sector 6" Current Pending Sectors are read failures - similar to reallocated sectors (except there is nothing to reallocate on a read failure).

I suggest powering down the NAS and testing the drive in a PC (perhaps with SeaTools, since Samsung sold their drive business to Seagate).
Message 2 of 6
fearless_fool
Aspirant

Re: High Raw Read Error Rate: is it time to replace my drive

Thanks for catching the Current Pending Sector stats.

But what is this "PC" of which you speak? 🙂

I have a linux box and a bunch of Macs -- any options for testing there?
-- ff
~.~.~.~.~.~.~
MacBook Pro 2.66 GHz Intel Core i7, Mac OS 10.8.x
ReadyNAS NV, X-RAID, RAIDiator 4.1.13
APC 550G UPC
Message 3 of 6
mdgm-ntgr
NETGEAR Employee Retired

Re: High Raw Read Error Rate: is it time to replace my drive

If the Linux box is an ordinary x86 box you could boot say the Ultimate Boot CD and run e.g. SeaTools.
Message 4 of 6
fearless_fool
Aspirant

Re: High Raw Read Error Rate: is it time to replace my drive

Very good. FWIW, according to http://www.ultimatebootcd.com/, Ultimate Boot CD ships with SeaTools already installed. (You may have meant this in your note, but it wasn't clear that this was the case.) Thanks for the tip.
-- ff
~.~.~.~.~.~.~
MacBook Pro 2.66 GHz Intel Core i7, Mac OS 10.8.x
ReadyNAS NV, X-RAID, RAIDiator 4.1.13
APC 550G UPC
Message 5 of 6
fearless_fool
Aspirant

Re: High Raw Read Error Rate: is it time to replace my drive

And even better, it appears that you can boot an intel-based Mac from UBCD, according to http://www.ultimatebootcd.com/forums/viewtopic.php?f=7&t=2622 and http://supermegaultragroovy.com/2010/05/12/booting-the-ultimate-boot-cd-in-a-mac-pro/. (The latter post was written by a ReadyNAS owner, so it is particularly relevant...)
-- ff
~.~.~.~.~.~.~
MacBook Pro 2.66 GHz Intel Core i7, Mac OS 10.8.x
ReadyNAS NV, X-RAID, RAIDiator 4.1.13
APC 550G UPC
Message 6 of 6
Top Contributors
Discussion stats
  • 5 replies
  • 5110 views
  • 0 kudos
  • 3 in conversation
Announcements