Reply
Topic Options
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
High Raw Read Error Rate: is it time to replace my drive?
- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
2014-01-19
07:32 AM
2014-01-19
07:32 AM
High Raw Read Error Rate: is it time to replace my drive?
The story:
The last few times I powered up my ReadyNAS (following a shutdown), it found and corrected errors. But the last two times I've tried, it got stuck at 98% during File System Check.
This morning, I forced skipping of FSCK (http://www.readynas.com/kb/faq/boot/how ... lume_check) so I could read the log file. The log file showed a SATA reset (see below), so I have initiated a RAID resync.
While that was going on, I decided to look at the SMART status of the four drives. One of them (Disk 3) shows Raw Read Error Rate of 1764; for the other three drives, that value is 0.
One other factor: I have enabled fast CIFS writes. I have a UPS, but its battery has (finally) gone bad. The system has powered itself off at least once without an "orderly" shutdown.
My question:
Would a marginal drive (with raw read errors) account for the problems I'm seeing (stuck on FSCK, SATA reset)? Or could the unscheduled shutdown be responsible for that?
Most importantly, is it time to purchase a replacement drive? (I've already ordered a replacement battery for the UPS.)
The log file:
The SMART logs:
Disk 1:
Drive 2
Disk 3
Disk 4
The last few times I powered up my ReadyNAS (following a shutdown), it found and corrected errors. But the last two times I've tried, it got stuck at 98% during File System Check.
This morning, I forced skipping of FSCK (http://www.readynas.com/kb/faq/boot/how ... lume_check) so I could read the log file. The log file showed a SATA reset (see below), so I have initiated a RAID resync.
While that was going on, I decided to look at the SMART status of the four drives. One of them (Disk 3) shows Raw Read Error Rate of 1764; for the other three drives, that value is 0.
One other factor: I have enabled fast CIFS writes. I have a UPS, but its battery has (finally) gone bad. The system has powered itself off at least once without an "orderly" shutdown.
My question:
Would a marginal drive (with raw read errors) account for the problems I'm seeing (stuck on FSCK, SATA reset)? Or could the unscheduled shutdown be responsible for that?
Most importantly, is it time to purchase a replacement drive? (I've already ordered a replacement battery for the UPS.)
The log file:
Sun Jan 19 06:33:23 PST 2014 RAID sync started on volume C.
Sun Jan 19 06:27:24 PST 2014 System is up.
Sun Jan 19 06:05:21 PST 2014 System powering off...
Wed Jan 8 21:18:54 PST 2014 A SATA reset has been performed on one or more of your disks that may have affected the RAID parity integrity. It is recommended that you perform a RAID volume resync from the RAID Settings tab ( accessible in the Volumes page => Volume tab in FrontView ). The resync process will run in the background, and you can continue to use the ReadyNAS in the meantime.
Wed Jan 8 21:16:13 PST 2014 System is up.
Wed Jan 8 20:52:33 PST 2014 Volume scan found and corrected errors.
Sun Jan 5 18:01:41 PST 2014 System powering off...
Tue Dec 31 00:53:42 PST 2013 System is up.
Tue Dec 31 00:53:45 PST 2013 System Update Status (chimborazo) : Your ReadyNAS device has been updated with a new firmware image. (RAIDiator 4.1.13)
Tue Dec 31 00:48:52 PST 2013 Volume scan found and corrected errors.
Sat Dec 21 20:31:59 PST 2013 System powering off...
Tue Dec 17 13:35:36 PST 2013 UPS is on battery power.
Fri Dec 13 04:19:41 PST 2013 Please reboot your ReadyNAS device to continue with the update process.
Sat Nov 30 14:03:51 PST 2013 System is up.
The SMART logs:
Disk 1:
SMART Information for Disk 1
Model: SAMSUNG HD103UJ
Firmware: 1AA01113
SMART Attribute
Raw Read Error Rate 0
Spin Up Time 9750
Start Stop Count 107626
Reallocated Sector Count 0
Seek Error Rate 0
Seek Time Performance 0
Power On Hours 36606
Spin Retry Count 14
Calibration Retry Count 0
Power Cycle Count 56
Read Soft Error Rate 0
Runtime Bad Block 0
End-to-End Error 0
Reported Uncorrect 0
Command Timeout 0
Airflow Temperature Cel 35
Temperature Celsius 36
Hardware ECC Recovered 1265083
Reallocated Event Count 0
Current Pending Sector 0
Offline Uncorrectable 0
UDMA CRC Error Count 0
Multi Zone Error Rate 0
Soft Read Error Rate 0
ATA Error Count 0
Extended Attribute
Hot-add events 0
Hot-remove events 0
Lp stat events 0
Power glitches 0
Hard disk resets 0
Retries 0
Repaired sectors 0
Drive 2
SMART Information for Disk 2
Model: SAMSUNG HD103UJ
Firmware: 1AA01113
SMART Attribute
Raw Read Error Rate 0
Spin Up Time 9910
Start Stop Count 121920
Reallocated Sector Count 0
Seek Error Rate 0
Seek Time Performance 0
Power On Hours 36499
Spin Retry Count 21
Calibration Retry Count 0
Power Cycle Count 57
Read Soft Error Rate 0
Runtime Bad Block 0
End-to-End Error 0
Reported Uncorrect 1
Command Timeout 0
Airflow Temperature Cel 40
Temperature Celsius 40
Hardware ECC Recovered 459297
Reallocated Event Count 0
Current Pending Sector 0
Offline Uncorrectable 0
UDMA CRC Error Count 0
Multi Zone Error Rate 0
Soft Read Error Rate 0
ATA Error Count 0
Extended Attribute
Hot-add events 0
Hot-remove events 0
Lp stat events 0
Power glitches 0
Hard disk resets 0
Retries 0
Repaired sectors 0
Disk 3
SMART Information for Disk 3
Model: SAMSUNG HD103UJ
Firmware: 1AA01113
SMART Attribute
Raw Read Error Rate 1764
Spin Up Time 9910
Start Stop Count 121842
Reallocated Sector Count 0
Seek Error Rate 0
Seek Time Performance 0
Power On Hours 36479
Spin Retry Count 17
Calibration Retry Count 1
Power Cycle Count 58
Read Soft Error Rate 1764
Runtime Bad Block 0
End-to-End Error 0
Reported Uncorrect 1764
Command Timeout 0
Airflow Temperature Cel 38
Temperature Celsius 38
Hardware ECC Recovered 794889
Reallocated Event Count 0
Current Pending Sector 6
Offline Uncorrectable 0
UDMA CRC Error Count 0
Multi Zone Error Rate 0
Soft Read Error Rate 0
ATA Error Count 0
Extended Attribute
Hot-add events 0
Hot-remove events 0
Lp stat events 0
Power glitches 0
Hard disk resets 0
Retries 0
Repaired sectors 0
Disk 4
SMART Information for Disk 4
Model: SAMSUNG HD103UJ
Firmware: 1AA01113
SMART Attribute
Raw Read Error Rate 0
Spin Up Time 10280
Start Stop Count 122283
Reallocated Sector Count 0
Seek Error Rate 0
Seek Time Performance 0
Power On Hours 36539
Spin Retry Count 16
Calibration Retry Count 0
Power Cycle Count 54
Read Soft Error Rate 0
Runtime Bad Block 0
End-to-End Error 0
Reported Uncorrect 0
Command Timeout 0
Airflow Temperature Cel 35
Temperature Celsius 35
Hardware ECC Recovered 5038
Reallocated Event Count 0
Current Pending Sector 0
Offline Uncorrectable 0
UDMA CRC Error Count 0
Multi Zone Error Rate 0
Soft Read Error Rate 0
ATA Error Count 0
Extended Attribute
Hot-add events 0
Hot-remove events 0
Lp stat events 0
Power glitches 0
Hard disk resets 0
Retries 0
Repaired sectors 0
Message 1 of 6
Labels:
- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
2014-01-19
12:53 PM
2014-01-19
12:53 PM
Re: High Raw Read Error Rate: is it time to replace my drive
Drive 3 also shows "Current Pending Sector 6" Current Pending Sectors are read failures - similar to reallocated sectors (except there is nothing to reallocate on a read failure).
I suggest powering down the NAS and testing the drive in a PC (perhaps with SeaTools, since Samsung sold their drive business to Seagate).
I suggest powering down the NAS and testing the drive in a PC (perhaps with SeaTools, since Samsung sold their drive business to Seagate).
Message 2 of 6
- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
2014-01-19
04:17 PM
2014-01-19
04:17 PM
Re: High Raw Read Error Rate: is it time to replace my drive
Thanks for catching the Current Pending Sector stats.
But what is this "PC" of which you speak? 🙂
I have a linux box and a bunch of Macs -- any options for testing there?
But what is this "PC" of which you speak? 🙂
I have a linux box and a bunch of Macs -- any options for testing there?
Message 3 of 6
- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
2014-01-19
05:42 PM
2014-01-19
05:42 PM
Re: High Raw Read Error Rate: is it time to replace my drive
If the Linux box is an ordinary x86 box you could boot say the Ultimate Boot CD and run e.g. SeaTools.
Message 4 of 6
- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
2014-01-20
12:49 AM
2014-01-20
12:49 AM
Re: High Raw Read Error Rate: is it time to replace my drive
Very good. FWIW, according to http://www.ultimatebootcd.com/, Ultimate Boot CD ships with SeaTools already installed. (You may have meant this in your note, but it wasn't clear that this was the case.) Thanks for the tip.
Message 5 of 6
- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
2014-01-20
10:42 AM
2014-01-20
10:42 AM
Re: High Raw Read Error Rate: is it time to replace my drive
And even better, it appears that you can boot an intel-based Mac from UBCD, according to http://www.ultimatebootcd.com/forums/viewtopic.php?f=7&t=2622 and http://supermegaultragroovy.com/2010/05/12/booting-the-ultimate-boot-cd-in-a-mac-pro/. (The latter post was written by a ReadyNAS owner, so it is particularly relevant...)
Message 6 of 6