NETGEAR is aware of a growing number of phone and online scams. To learn how to stay safe click here.
hang
6 TopicsRN102 fails to boot when one mirrored drive fails or is missing
Hello everyboby, I have been looking for a solution or help but was not able to find any hints regarding the following problem. I own a RN102 device equiped with two disk forming a X-RAID mirror. So far, so good. The RAID is healthy and looks fine. The last few days the device complained that disk2 is likely to fail. So I tried to replace the device by pulling the disk accordingto the handbook. At this moment the RN102 stucks an is not reachable anymore. When I reset the device, the poer LED keeps on blinking. RAIDAR is able to find the device but indicating the the is no configuration on it. The first disk is show as already used with data. When I put the second disk back an reboot the RN, it comes up fine and operates normally. It seems that the OS an it's config is missing on the first disk. Any idea how to get the corrected? Thanks to everybody. Greetings ChrisSolved3.1KViews0likes19CommentsRN3312 BTRFS operations are completely hung
We have owned the RN3312 for a bit over 6 months, and all was seemingly fine. However, things went downhill recently and now pretty much the entire BTRFS partition is completely unusable at this point. Even leaving the NAS offline and just trying to do whatever internal metadata cleanup by itself in a reasonable time is not enough to recover. What has happened is a combination of the Bit Rot Protection / COW + Compression + Snapshots being turned on, on a partition used for file backups, and image backups (Veeam) for a single, large, fileserver. BTRFS is NOT production ready for such a setup, I firmly believe this option should be removed from the UI, or a huge warning displayed. Everything was going great until the first snapshots needed to be deleted, where I ran into the problem of btrfs-cleaner taking up 100% CPU. Symptoms: the admin UI would lock up on any file operation in certain directories. Directory accesses would hang forever, even over SMB. Of course all the backups to the NAS were timing out. I eventually was able to delete the snapshots by hard rebooting the system and removing them before btrfs-cleaner got too bad. But now, I have the problem where btrfs-transacti is taking up 100% CPU. I have left the system sitting offline for a week just spinning at 100% CPU (!), and there is no visible improvement - EVERY BTRFS operation still hangs, no matter what I try. There is little disk activity, it is not thrashing - makes me think there is something wrong in the internals of BTRFS, or that the CPU is too underpowered to handle the amount of storage metadata operations. top - 12:30:29 up 1:39, 2 users, load average: 115.21, 112.48, 99.52 Tasks: 334 total, 2 running, 332 sleeping, 0 stopped, 0 zombie %Cpu(s): 0.6 us, 23.0 sy, 0.0 ni, 72.1 id, 1.6 wa, 0.0 hi, 2.7 si, 0.0 st KiB Mem: 8113792 total, 2673896 used, 5439896 free, 4404 buffers KiB Swap: 2093052 total, 0 used, 2093052 free. 1980036 cached Mem PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND 3740 root 20 0 0 0 0 R 100.0 0.0 93:52.62 btrfs-tran+ 1 root 20 0 136632 6868 5144 S 0.0 0.1 0:02.45 systemd 2 root 20 0 0 0 0 S 0.0 0.0 0:00.00 kthreadd admin@archive:/data$ iostat Linux 4.4.68.x86_64.1 (archive) 07/11/2017 _x86_64_ (4 CPU) avg-cpu: %user %nice %system %iowait %steal %idle 0.55 0.00 25.73 1.56 0.00 72.15 Device: tps kB_read/s kB_wrtn/s kB_read kB_wrtn sda 3.26 55.82 57.19 337578 345828 sdb 3.25 55.07 57.08 333044 345196 sdc 3.45 74.32 57.02 449448 344808 sdd 3.28 53.95 57.21 326224 345936 sde 3.26 57.73 57.09 349104 345252 sdf 3.28 54.89 56.87 331951 343908 md0 1.68 27.57 39.28 166740 237520 md1 0.02 0.19 0.00 1172 0 md127 9.38 243.50 69.42 1472516 419788 < not a lot of activity... I have tried starting a balance to fix fragmentation, I believe there are operations blocking it inside the kernel, but even at -dusage=0 I gave up after giving it the weekend to do its thing. Trying to look for evidence is fragmented files is horrendously slow. But it is very bad now: admin@archive:/data$ ls *** hangs forever *** My hope at this point is to try and mount the system read-only and recover data onto a USB drive, the share with data is around 8 TB which might just fit after a couple of days/weeks? of copying... Then figuring out some way to drop the share? and rebuild it without selecting the 'Bit Rot Protection' or 'Compression' options. Hopefully I don't have to resort to copying the NAS to something else and wiping it - there is about 14 TB of data on it currently, and I don't have that much capacity available anywhere else... After going through this and after lots of research, I see lots of horror stories showing that BTRFS is extremely fragile and not ready for prime time. I believe it is reckless for Netgear to base a NAS on such an unproven FS. The features are not worth it if they explode in spectacular fashion after a couple of months. Symptoms include btrfs-transacti and btrfs-endio-wri taking up a lot of CPU time (in spikes, possibly triggered by syncs). You can use filefrag to locate heavily fragmented files (may not work correctly with compression). ... "a balance on 2TB of data that was heavily snapshotted - it took 3 months" "when I have to do balances ... I delete all the snapshots and allow a few months for the balance to finish" https://btrfs.wiki.kernel.org/index.php/Gotchas We are running version 6.7.4. We currently have 6 x 8 TB in X-RAID (certified drives.) I struggle to think what would happen if we filled up all 12 slots... Are there any other operations anyone from support wants to try before I start wiping it? Unfortunately our 90-day free support has expired before any of this happened, so I am left venting in public...4.8KViews0likes4CommentsSRX5308 non responsive 2-12 seconds LAN/WAN sporadically through day, drops all VPN connections too
I saw a simlar posting from 6/2015 that was not answered and closed "due to inactivity". I have (5) SRX5308 and they all exibit the same issue. In some cases I have RIP protocol enabled and in others I am using the SRX5308 as a standard firewall with a cable modem uplink. Sporatically and completely random and apparently the higher the firmweare version the more often it happens, the router becomes completely non-responsive for 2-12 seconds, and in most cases VPN connections if any are dropped. Weirdly enough, earlier firmware versions may have had entries in the logs about an exception with register values but newer firmware has absolutely nothing in the logs. I replaced the router with a Cisco 1841 router and the problem goes away compltely but obviously my netgear clients cant VPN in. Does not appear to be volume related either as it happens when the traffic is very low as well as when its averaging 20-30 Mbps. I opened a case with netgear but so far they havent any ideas and suggested it could be a device on the network causing a problem. I agree, its the netgear on the network thats causing the porblem. I like the firewall, especially its VPN thougthput but the constant hang even with its short duration prevents me from keeping this device on the network. Any suggestions? and since I suspect many of you will immediately start asking see the notes below: Currently running firmware: 4.3.4-2, also tried 4.3.3-6, 4.3.3-5, and I beleive an earlier one that came on the router when I bought it. 3 routers have VPN configured between them and one is completely stand alone (the one running RIP is stand alone at anothe location) The 3 with VPN are setup with NAT and the RIP is setup "Classical routing" All are configured for IPv4 only One of the 3 with VPN and NAT has a cable modem on WAN 1 and is configured for failover to DSL on WAN 2, the non-resposiveness still impacts WAN and LAN ports The NAT routers have public WAN IP's and private LAN IP's, the RIP one has public WAN and public LAN ip's. None have DMZ's configured The NAT has firewall rules for specific ports from WAN to LAN, no restrictions on outbound, the RIP router has no rules, all in and out permited, working as a router not a firewall. All have "respond to pings on internet ports" enabled All have "enable stealth mode" None have any blocking enabled (UDP or TCP flood) They all have VPN pasthrough checked None have session limits or throughput/bandwidth limits set None have content filtering enabled None have DHCP server enabled Hope that eliminates most initial questions...6.2KViews0likes12CommentsRN104 inaccessible and displaying btrfs_biomerge_bio_hook+a4 message
Model: RN10400 Firmware: 6.5.1 OS: Windows 10 Environment: Small Office Home Network Installed Storage: 4 x 4TB Hard Drives - 1 volume Issue: NAS unavailable. Hangs during bootup and displays btrfs_biomerge_bio_hook+a4 message in the display window with power light flashing. Does not appear in RAIDar. Performed USB Recovery and same error still occurs during bootup. Looking for next steps...3.8KViews0likes6CommentsReadyNAS 104 system hanging frequently after 6.4.0 upgrade
Hi All, I have a pretty new ReadyNAS 104 system (August, 2015). Currently populated with 3 x 4TB WD RED drives, and a single Segate 3TB drive (to be upgraded once funds allow). Since upgrading to FW 6.4.0 I have found that the system will hang whenever a maintenance process is performed - Scub, Balance or Defrag, which I had scheduled previously. I have now disabled all the schedules and left the system alone, but overnight it has hung again, after approximately 72 hours of normal operation. It responds to pings, but the GUI will not load, it has dropped off the network via all protocols, fails to allow you to SSH into it and pressing the power button (Once, twice and three times) has absolutely no effect. The only option is to pull the plug. Also, after every hang, forcing a power pull, the system starts a complete resync of the volume, which always starts at 25.10% and takes about 4 days to complete over the 15TB of raw storage. Obviously this is putting undue strain on the disks everytime it is happening. Happy to provide logs to any friendly Netgear Admins, and any advice from any of you lovely people would be gratefully received. Cheers, Tipster10KViews2likes26CommentsReadyNAS NV+ syslogd taking 100% CPU
Hi - I've got a Sparc-based ReadyNAS NV+ that I've had for years. Currently running RAIDiator 4.1.14; 4 x 2TB drives in Raid-X configuration, with about 300G/5.5T free. Lately, the NAS box will become non-responsive during transfers and I have to pull the power, and re-install the OS in order to make it happy. Unfortunately, it's not happy for more than a few hours at a time lately, and I've had to do this once a day or so. By non-responsive, I mean that I can ping it, but not telnet/ssh/frontview, and the front-panel power-button seems to do nothing with a brief press. I've bought its replacement, and am transferring ~ 6TB off it via CIFS, and it is still locking up. This last time, I left top running in a terminal window, which was still updating, though slowly. top - 18:24:58 up 1 day, 1:18, 1 user, load average: 9.04, 9.07, 9.02 Tasks: 67 total, 5 running, 62 sleeping, 0 stopped, 0 zombie Cpu(s): 1.6% us, 98.4% sy, 0.0% ni, 0.0% id, 0.0% wa, 0.0% hi, 0.0% si Mem: 226352k total, 221632k used, 4720k free, 8976k buffers Swap: 767904k total, 1856k used, 766048k free, 188352k cached PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND 1571 root 25 0 2128 960 800 R 97.7 0.4 1000:39 syslogd 1066 root 16 0 2992 1168 896 R 2.3 0.5 33:38.83 top 1 root 15 0 2000 592 496 S 0.0 0.3 0:05.09 init 2 root 34 19 0 0 0 S 0.0 0.0 0:00.03 ksoftirqd/0 3 root 10 -5 0 0 0 S 0.0 0.0 0:00.04 events/0 4 root 10 -5 0 0 0 D 0.0 0.0 0:00.03 khelper 5 root 10 -5 0 0 0 S 0.0 0.0 0:00.01 kthread 10 root 10 -5 0 0 0 S 0.0 0.0 0:09.36 kblockd/0 13 root 10 -5 0 0 0 S 0.0 0.0 0:00.01 khubd Syslogd with 1000 minutes of runtime, and 98.4% sys CPU seems very likely to be the cause of my problem. The rootfs wasn't full just prior to the CIFS transfer. nasgul:~# df / -i Filesystem Inodes IUsed IFree IUse% Mounted on /dev/hdi1 128000 11330 116670 9% / nasgul:~# df / -h Filesystem Size Used Avail Use% Mounted on /dev/hdi1 1.9G 642M 1.3G 33% / So, what's my next step here? A factory reset would be more palatable after I'm convinced that my data has been successfully backed up to the new NAS box. What can I do to keep the box up while I do my transfers and verifications? Thanks.3.1KViews0likes8Comments