NETGEAR is aware of a growing number of phone and online scams. To learn how to stay safe click here.

Forum Discussion

MTeixeira's avatar
MTeixeira
Aspirant
Jun 16, 2021
Solved

Two failed disks at same time

Hi all,

Last monday this NAS in a client reported at the exact same time the failure of two of the three drives - thus loosing the volume

No prior warnings existed and this beeing a backup NAS the disks were probably "sleeping"

Tried to reboot remotely but as I found out today it was stuck at reboot

I had to power it down and upon resuming all disks seem to be recognized however now i have the message "Remove inactive volumes to use the disk."

I have tried readonly mode with same resutls and the NAS is now testing the disks.

Is there anyway to recover the volume ?

If not, I Have two iSCSI targets and also the NAS is connected to a domain - can I make a new volume keeping the existing configuration or do I need to make a factory default.

Thank you

 

 

 

 

 

  • I Rebuilt the RAID5

    Took the exepcted time. No errors whatsoever

     

    Of course i dont trust this device anymore - it can be the disks but two failures at at the same instant with no smart errors and no errors on the test and on the rebuild points to some glitch on the NAS itself...

     

    As it is 6+ years old I bought a new NAS for main backup and this will be secondary while it works

    Thanks all

9 Replies

Replies have been turned off for this discussion
  • I would be very surprised if there were not SMART errors in smart_history.log in the logs zip for at least one of the disks. You can easily check this.

     

    I think NTGR would probably require a data recovery contract to look into this, and if this NAS is only holding backups of data stored elsewhere it doesn't sound like that would be worth it.

    • MTeixeira's avatar
      MTeixeira
      Aspirant

      Thank you for your reply

       

      I am rather new to netgear, where can i find this smart_history.log via ssh ?

       

      As to the configuration, is there a way to install new disks and create the volumes keeping the configuration or is the configuration in the disks themselves ?

       

      Thanks

       

       

      • MTeixeira's avatar
        MTeixeira
        Aspirant

        OK, found it via donwload logs

        No errors are reported.

         

        Couldn't find any logs that report an error...

        simply at 23h00 last monday: (23h00 is the time a backup starts to this device)

        [21/06/08 07:59:00 GMT] info:system:LOGMSG_START_READYNASD ReadyNASOS background service started.
        [21/06/14 23:00:02 GMT] warning:volume:LOGMSG_HEALTH_VOLUME Volume 'RAID5' health changed from 'Redundant' to 'Degraded'.
        [21/06/14 23:00:03 GMT] notice:disk:LOGMSG_ZFS_DISK_STATUS_CHANGED Disk in channel '2' (Internal) changed state from ONLINE to FAILED.
        [21/06/14 23:00:04 GMT] notice:disk:LOGMSG_ZFS_DISK_STATUS_CHANGED Disk in channel '3' (Internal) changed state from ONLINE to FAILED.
        [21/06/15 01:00:02 GMT] warning:volume:LOGMSG_HEALTH_VOLUME_WARN Volume 'RAID5' is 'Degraded'.

         

        I made a disk test today via boot menu, is the result somewhere ? Can't find it.

         

        I am concerned this may be some malfunction in the NAS itself. any pointers on how to diagnose this ?

         

        Thanks

         

  • Your second failed disk has probably a minor problem, maybe a block failure. This is the cause, why the bad sync tool of your bad raid5 firmware crashed on it.

    You could easily make a sector-level copy with a lowlevel disk cloning tool (for example, gddrescue is probably very useful), or sometime health status and use this disk as your new disk3. In this case, your array survived with a minor data corruption.

    I am sorry, probably it is too late, because the essence of the orthodox answer in this case: "multiple failure in a raid5, here is the apocalypse!"

    If you want very good, redundant raid, use software raid in linux. For example, its raid superblock data layout is public and documented... I am really sorry, for my this another heretic opinion.

     

    • StephenB's avatar
      StephenB
      Guru - Experienced User

      EthelDStill wrote:

       

      If you want very good, redundant raid, use software raid in linux. 

       


      Which is exactly what the ReadyNAS uses.

       

      Disk cloning could be needed, but I don't think we know enough yet.  If the array is out-of-sync, then cloning wouldn't be enough (and might not be needed at all). 

      • MTeixeira's avatar
        MTeixeira
        Aspirant

        I Rebuilt the RAID5

        Took the exepcted time. No errors whatsoever

         

        Of course i dont trust this device anymore - it can be the disks but two failures at at the same instant with no smart errors and no errors on the test and on the rebuild points to some glitch on the NAS itself...

         

        As it is 6+ years old I bought a new NAS for main backup and this will be secondary while it works

        Thanks all

NETGEAR Academy

Boost your skills with the Netgear Academy - Get trained, certified and stay ahead with the latest Netgear technology! 

Join Us!

ProSupport for Business

Comprehensive support plans for maximum network uptime and business peace of mind.

 

Learn More