NETGEAR is aware of a growing number of phone and online scams. To learn how to stay safe click here.

Forum Discussion

Joufflu's avatar
Joufflu
Aspirant
Oct 11, 2015

btrfs-cleaner always stuck at 100% CPU after 6.4.0 upgrade

Hello,

 

After 6.4.0 upgrade btrfs-cleaner process always uses 100% cpu even after many uptime hours.

SMB shares are ok (but slow)

Frontview times out after authentication.

 

Top command (ssh is ok) :

 

top - 16:33:54 up  2:51,  1 user,  load average: 14.25, 14.00, 11.14
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND 2524 root 20 0 0 0 0 R 96.4 0.0 135:53.96 btrfs-cleaner

Any ideas ?

Thank you

39 Replies

  • I wish I could help but I am in the same situation, my ReadyNAS is now useless due to this upgrade.

  •  

    I'm also unable to use the NAS, although it appears to be fully booted. I just ssh'd in to check it out. Indeed, btrfs-cleaner is (apparently) hard at work.

     

    In my case, if I wait for about 90 minutes after boot, the NAS becomes usable again. I suspect the btrfs-cleaner process will not be active at that point. I will confirm in about an hour (hopefully).

     

    UPDATE: As suspected, when btrfs-cleaner stopped, the NAS became fully usable. Also, I did an immediate reboot to see if btrfs-cleaner would go wild again - it didn't. The NAS seems to be working normally for now.

     

    • TonyKL's avatar
      TonyKL
      Guide

      Thanks for the info, I have had the box with the cleaner stopped but as soon as I touch the filesystem it's off again.  It must think I'm really dirty to need to spend all day cleaning.  I have about 6TB of data.

       

      I'm trying to clean up and delete old snapshots but ever one I delete freezes my NAS for a good 10 minutes +

      • spotcatbug's avatar
        spotcatbug
        Apprentice

         

        I'm not seeing that same symptom, where touching the filesystem sets off btrfs-cleaner again. I've been using the NAS all morning (since btrfs-cleaner stopped) without incident. I'm keeping an eye on a running top command, just to see if it pops in there again, but so far it's OK.

         

        This wouldn't be the first time I had the NAS go from unusable to usable since 6.4.0. I've had it do this a few times now. This time is different, however; it's remained usable through reboot. I did try the USB recovery and that kept it from happening through one reboot. I want to reboot again, to see if it continues to be usable, but I have work to do, so that will have to wait until later today.

         

  • btrfs-cleaner finally calm down on my NAS after hours of power on

    I now have access to Frontend : 1TB data, 165GB snapshots

     

     

    USER       PID %CPU %MEM    VSZ   RSS TTY      STAT START   TIME COMMAND
    root      2524 88.6  0.0      0     0 ?        S    Oct11 1618:06 [btrfs-cleaner]

     

  • I think that automatic snapshot prune does not work on my RN102.

     

    According to this KB :

    http://kb.netgear.com/app/answers/detail/a_id/23353/~/what-are-basic-snapshot-concepts-i-need-to-understand-before-operating-my

    "Daily snapshots are kept for four weeks."

     

    But I see many snapshots older than one month on shares configured with daily snapshots.

    It seem to keep a monthly backup and never delete it.

    I never made manual snapshots.

     

    I'm cleaning old snapshots but it's really long on large shares.

    • StephenB's avatar
      StephenB
      Guru

      Joufflu wrote:
       

      According to this KB :

      http://kb.netgear.com/app/answers/detail/a_id/23353/~/what-are-basic-snapshot-concepts-i-need-to-understand-before-operating-my

      "Daily snapshots are kept for four weeks."

       

      But I see many snapshots older than one month on shares configured with daily snapshots.

      It seem to keep a monthly backup and never delete it.

       


      That is the way it is designed to work.  Snapshots are "thinned" down to 1/month, but that monthly one is not deleted.

    • AlexPe's avatar
      AlexPe
      NETGEAR Expert

      Dear Joufflu,

       

      • Hourly snapshots are kept for 48 hours.
      • Daily snapshots are kept for four weeks.
      • Weekly snapshots are kept for eight weeks.

      Do you have any that are older than eight weeks? So the way snapshots work is it still takes weekly snapshots if you have daily snapshots selected and if you have hourly, it will take all three at selected intervals.

       

      So you might have weekly snapshots that are still present. The key to checking is to see when the oldest snapshots were taken and see if the dates are synonymous with an 8 week interval.

       

      Thanks

       

      Alex

      • StephenB's avatar
        StephenB
        Guru

        AlexPe wrote:
         
        • Hourly snapshots are kept for 48 hours.
        • Daily snapshots are kept for four weeks.
        • Weekly snapshots are kept for eight weeks.

         


        And monthly snapshots are kept forever (unless manually deleted).

  • I am also having these problems after update to 6.4

    btrfs-cleaner is taking up 100% cpu.

    Every time it removes a snapshot, the NAS is inaccessible for 10 minutes every hour. 

    I have set Snapshot Schedule on Never but still this happens.

    This is an unworkable situation for my company.

    A solution please!

    • JasperA's avatar
      JasperA
      Apprentice

      My NAS is inaccessible for over 7 days in a row now, constantly 100% busy doing whatever btrfs-cleaner is doing. The /data volume does not appear on my admin page anymore.

       

      I am running out of options and out of hope for this NAS.

  • My problems are 'solved', this is what I did may it will help:

     

    Started with an OS reinstall, my Nas is a 312.

    http://kb.netgear.com/app/answers/detail/a_id/22892/~/how-do-i-access-the-boot-menu-on-my-readynas-102,-202-or-312%3F

    This will leave your data intact. Check if problems are solved.

    In my case they were not.

     

    When I was finally able to access the admin page the first thing I did was disable snapshots.

    This stopped the creation of new shapshots but dit not stop the removal of snapshots once an hour. 

    This kept the NAS 100% occupied for aprox 10 minutes every hour, but it gave me time to create a backup of all the data.

     

    Backup done (and checked!)  I did a Factory reset, this WILL destroy all data on the disk.

    After this I recreated the shares making sure to disable the snapshot-function.

    Then I had to copy back all the data.

     

    Overall this process took me 3 days!

     

    But now I have a NAS that is once again up and running.

     

    Ariane

     

    • AlexPe's avatar
      AlexPe
      NETGEAR Expert

      ArianeS wrote:

      My problems are 'solved', this is what I did may it will help:

       

      Started with an OS reinstall, my Nas is a 312.

      http://kb.netgear.com/app/answers/detail/a_id/22892/~/how-do-i-access-the-boot-menu-on-my-readynas-102,-202-or-312%3F

      This will leave your data intact. Check if problems are solved.

      In my case they were not.

       

      When I was finally able to access the admin page the first thing I did was disable snapshots.

      This stopped the creation of new shapshots but dit not stop the removal of snapshots once an hour. 

      This kept the NAS 100% occupied for aprox 10 minutes every hour, but it gave me time to create a backup of all the data.

       

      "Backup done (and checked!)  I did a Factory reset, this WILL destroy all data on the disk.

      After this I recreated the shares making sure to disable the snapshot-function.

      Then I had to copy back all the data.

       

      Overall this process took me 3 days!

       

      But now I have a NAS that is once again up and running.

       

      Ariane"

       


      While I would not wish this to be the solution, it would work to resolve. The cleaner service is likely stuck on a rather large file or large quantity of files. We've been really trying to connect to a system that is having this issue to diagnose the issue directly. The log files we have obtained, point to systems that have been unmaintained. Since you have started the system from factory defaults, I'd suggest running scheduled maintenance now so that the file system stays maintained for the future.  Volume Maintenance Guidance

       

      regards,

       

      Alex

      • JasperA's avatar
        JasperA
        Apprentice

        I am deleting all snapshots by hand, one by one, from the web interface. That is... in the rare time I get it to respond at all.

        When that's done, I think I have a system that should work again, more or less. I expect btrfs-cleaner to be busy as hell for a day or 10 after I'm finished. I won't mention that it's stupid I'm not able to select multiple snapshots and delete them all.

         

        All this misery caused by beta software, adding features I don't need and enabling it by default. I'm not a guinea pig for testing this software, I had a business depending on a rock solid NAS. Which my ReadyNAS 516 obviously is not (anymore).

         

        I'll keep you posted. The solution Ariane is giving us won't work for me, because I can't backup all my data to another device. It's a terrible shame that the only option to get a somehow working NAS again is to disable the snapshot feature.
        Too bad Netgear is giving up on this problem so easily. My problem exists for almost a month now with no real solution to look forward to. I owned three ReadNAS'ses, but the 516 is definately the last one I've bought.

NETGEAR Academy

Boost your skills with the Netgear Academy - Get trained, certified and stay ahead with the latest Netgear technology! 

Join Us!

ProSupport for Business

Comprehensive support plans for maximum network uptime and business peace of mind.

 

Learn More