Forum Discussion

Aspirant

Mar 09, 2019

Solved

ReadyNAS Ultra-6 data degraded after firmware update

I have an old ReadyNAS Ultra-6 that I upgraded to Firmware 6.9.5. After the upgrade it shows data degraded. However the data is not degraded, all drives are healthy, I can still access the shares...

Hopchen
Mar 11, 2019
Hi again

Thanks for posting the disk info. As suspected, disk sda has seen better days. From what I can see in the logs, the disk should be located in bay 1 (the first disk in the NAS). We can see that the disk has some Current Pending Sector errors.

Device: sda Controller: 0 Channel: 0 <<<=== Bay 1 Model: WDC WD60EFRX-68MYMN0 Serial: Firmware: 82.00A82 Class: SATA RPM: 5700 Sectors: 11721045168 Pool: data PoolType: RAID 5 PoolState: 3 PoolHostId: 33eac74a Health data ATA Error Count: 0 Reallocated Sectors: 0 Reallocation Events: 0 Spin Retry Count: 0 Current Pending Sector Count: 11 <<<=== Bad sectors on the disk Uncorrectable Sector Count: 0 Temperature: 32 Start/Stop Count: 136 Power-On Hours: 35247 Power Cycle Count: 77 Load Cycle Count: 15929

Pending sectors typically indicates imminent failure of the disk. An Acronis KB, describes the issue particularly well.

Current Pending Sector Count S.M.A.R.T. parameter is a critical parameter and indicates the current count of unstable sectors (waiting for remapping). The raw value of this attribute indicates the total number of sectors waiting for remapping. Later, when some of these sectors are read successfully, the value is decreased. If errors still occur when reading some sector, the hard drive will try to restore the data, transfer it to the reserved disk area (spare area) and mark this sector as remapped.

Please also consult your machines's or hard disks documentation.
Recommendations

This is a critical parameter. Degradation of this parameter may indicate imminent drive failure. Urgent data backup and hardware replacement is recommended.

https://kb.acronis.com/content/9133

It is quite likely that these bad sectors caused the NAS to kick the disk from the one of the data raids. The fact that those sectors appear stuck in "pending" is an indication that the sectors will probably never recover. Without further examination of the logs I'd say you need to replace that disk asap. Note: you must replace with a disk of same size or larger.

Your other disks appear to be healthy, which is good!

Cheers

StephenB

Guru - Experienced User

Mar 12, 2019

It's the entire volume that is degraded, not a disk.

But I agree that Netgear needs to a better job of identifying which disk(s) have dropped out of the array in the web UI. Forcing users to delve into mdstat.log, etc isn't great.

Also, they should implement user-defined thresholds for disk health (pending sectors, etc). 11 pending sectors are well below their hard-wired thresholds, so you will just see a "green" healthy disk in the web ui. IMO the hard-wired thresholds are too high (and based on posts here, Netgear support agrees that is the case.).

Hopchen

Prodigy

Mar 12, 2019

StephenB wrote:

It's the entire volume that is degraded, not a disk.

But I agree that Netgear needs to a better job of identifying which disk(s) have dropped out of the array in the web UI. Forcing users to delve into mdstat.log, etc isn't great.

Also, they should implement user-defined thresholds for disk health (pending sectors, etc). 11 pending sectors are well below their hard-wired thresholds, so you will just see a "green" healthy disk in the web ui. IMO the hard-wired thresholds are too high (and based on posts here, Netgear support agrees that is the case.).

I completely agree with this. It falls in the same category as the incredibly confusing message: "Remove Inactive Volumes". The challenge in this case was that sda dropped from md126 but it was still active in md127. Regardless, there should be an improved way of notifying the user other than the display blinking and showing: "Degraded". I think that user notification is an area where NETGEAR has a long way to go. It is fine if you know the system and can pull stats from logs, etc. However, for the average user, the front-end notifications are sub-par.

The disk error threshold is an interesting topic. NETGEAR has set a threshold for when to sent email notifications to the user: https://kb.netgear.com/30046/ReadyNAS-OS-6-Disk-Failure-Alerting

IMO, you cannot generalise disks errors to some arbitrary count as in, "it is only critical if you hit X mount of errors". Disks errors can hit different areas of the disk (some more important than others) and ATA errors can affect different aspects in how the disk operate.

They do need to re-think how they do this.

NETGEAR Academy

Boost your skills with the Netgear Academy - Get trained, certified and stay ahead with the latest Netgear technology!

Join Us!

ProSupport for Business

Comprehensive support plans for maximum network uptime and business peace of mind.

Learn More

Forum Discussion

ReadyNAS Ultra-6 data degraded after firmware update

Related Content

ReadyNAS RN104 Drive degraded, ATA error

ReadyNas 102 Status: Degraded

Degraded Resync???

RBR50 performance degradation after 2.5.0.38 firmware update

ReadyNAS 516 Degraded Volume

NETGEAR Academy

ProSupport for Business