NETGEAR is aware of a growing number of phone and online scams. To learn how to stay safe click here.

Forum Discussion

gregb_pro's avatar
gregb_pro
Aspirant
Aug 21, 2015

Replacement procedure for suspect drive in ReadyNAS Pro BE

BACKGROUND:
ReadyNAS Pro Business Edition with six WD 750 GB enterprise drives (WDC WD7502ABYS-01A6B0), all drives with 03.00C05 firmware. The drives show about 57,000 runtime hours (about 6.5 years). The NAS has never had a drive failure.
The ReadyNAS is running RAIDiator 4.2.27, and drives are configured with X-RAID2.
Operating temperatures have been consistent over the life of the system: SYS: 50 C (122 F), Temp CPU: 20.5 C (68 F). All six drives consistently run at 39-42 C (102-107 F).
I have several spare equivalent drives with zero runtime hours.


ISSUE:

Drive #2 (top middle) has a pronounced bearing noise with off-nominal (also pronounced) hand-sensed vibration.
SMART data shows no issues for any disk.

QUESTION:
What is the recommended replacement procedure for suspect drive in ReadyNAS Pro BE?

i.e. Wait for drive to fail and force system shutdown; or
replace hot (while running); or
shutdown and replace cold; or
pull suspect drive to verify pending failure, and shutdown or
...

Obviously backup first.

Thanks, Greg

 

6 Replies

Replies have been turned off for this discussion
  • StephenB's avatar
    StephenB
    Guru - Experienced User

    I always recommend a hot-swap - because it ensures that the NAS will detect the removal and insertion.  However, a cold insertion of an unformatted disk should also work.

     

    I'd also replace the drive now, I see no reason to wait.  

     

    You might want to run Western Digital's Lifeguard diag on the replacement first (that is a windows application, available on the WDC web site).  That's just to make sure nothing happened while the drive was on the shelf.

    • gregb_pro's avatar
      gregb_pro
      Aspirant

      Suspect drive #2 was removed as suggested (system up). Unfortunately, the "noise" did not subside with removal of the drive. Tested the drive with WD Lifeguard Diagnostic for Windows (v1.24) and wrote zeros to drive (full test, full zero). Re-installed drive back into ReadyNAS, and the system rebuilt back to X-RAID2 as expected. ReadyNAS log:

       

      Sun Aug 23 08:40:34 MDT 2015    Disk removal detected. [Disk 2]
      Sun Aug 23 08:40:35 MDT 2015    A disk was removed from the ReadyNAS. For [...]
      Sun Aug 23 08:40:42 MDT 2015    Disk failure detected.
      Sun Aug 23 08:40:42 MDT 2015    If the failed disk is used in a RAID level [...]
      Sun Aug 23 13:42:02 MDT 2015    New disk detected. If multiple disks have [...]
      Sun Aug 23 13:44:13 MDT 2015    Data volume will be rebuilt with disk 2.
      Sun Aug 23 13:44:35 MDT 2015    RAID sync started on volume C.
      Sun Aug 23 17:47:19 MDT 2015    RAID sync finished on volume C.

      System appeared to run well for 5 days.

       

      Still trying to diagnose noise source. Remove suspect drive from bay 1 while system up:

      Fri Aug 28 07:36:55 MDT 2015    Disk removal detected. [Disk 1]
      Fri Aug 28 07:36:55 MDT 2015    A disk was removed from the ReadyNAS.
      Fri Aug 28 07:37:21 MDT 2015    Disk failure detected.
      Fri Aug 28 07:37:21 MDT 2015    If the failed disk is used in a RAID level [...]

       

      Test and zero new zero-hour drive with WD Lifeguard Diagnostic for Windows (v1.24). Passed.
      Test and zero old drive #1 with WD Lifeguard Diagnostic for Windows (v1.24). Passed.
      Insert *new* drive into bay 1; surprisingly it FAILED SMART test: Arrgh!

       

      Fri Aug 28 14:42:28 MDT 2015    New disk detected. If multiple disks have been [...]
      Fri Aug 28 14:42:41 MDT 2015    Newly added disk has failed SMART test. Please check disk 1.


      Remove new drive from bay 1, then
      Insert old drive into bay 1; it also FAILED SMART test:

       

      Fri Aug 28 14:55:12 MDT 2015    Disk removal detected. [Disk 1]
      Fri Aug 28 14:55:12 MDT 2015    A disk was removed from the ReadyNAS. For full [...]
      Fri Aug 28 14:59:51 MDT 2015    New disk detected. If multiple disks have been [...]
      Fri Aug 28 15:00:04 MDT 2015    Newly added disk has failed SMART test. Please check disk 1.


      Remove old drive from bay 1; then
      Test and quick zero old drive with WD Lifeguard Diagnostic (Passed);
      Shutdown and reboot; then
      Insert old drive into bay 1; it also FAILED SMART test:

       

      Fri Aug 28 15:06:55 MDT 2015    Disk removal detected. [Disk 1]
      Fri Aug 28 15:06:55 MDT 2015    A disk was removed from the ReadyNAS. For full [...]
      Fri Aug 28 15:08:30 MDT 2015    Please close this browser session and use RAIDar [...]
      Fri Aug 28 15:09:47 MDT 2015    System is up.
      Fri Aug 28 15:19:16 MDT 2015    New disk detected. If multiple disks have been [...]
      Fri Aug 28 15:19:27 MDT 2015    Newly added disk has failed SMART test. Please check disk 1.

      So, bay 1 failed two successive SMART tests (two different drives), and failed again after a reboot.

      Both drives pass WD Lifeguard Diagnostic (one is zero-hour drive).

      Looking for recommendations.

      Thanks, gregb

       

      • gregb_pro's avatar
        gregb_pro
        Aspirant

        Tested drives again with WD Lifeguard Diagnostic for Windows; both show PASS.

        Both drives show no SMART issues (all zero's in expected warning indicators).

        Does RAIDiator 4.2.27 keep an internal database of drives and serial numbers with capability to reject drives based on database history? (although neither of these drives are actually failed).

        Thanks, gregb

NETGEAR Academy

Boost your skills with the Netgear Academy - Get trained, certified and stay ahead with the latest Netgear technology! 

Join Us!

ProSupport for Business

Comprehensive support plans for maximum network uptime and business peace of mind.

 

Learn More