Ready NAS2100 Drops from Network

Question

I have 4 of these devices, one of them seems to drop off the network. Its physically on and has green power lights but the blue disk lights are off, the web interface is offline and it does not ping, only way to get it back online is to hold the powerbutton down and powercycle it.
&nbsp;
I've looked through the logs, corrected smart disk errors etc, but cant seem to see anything in the logs which indicate somthing happening.I have no sleep/spin down settings set etc.Latest one occured somewhen between 19th and the 25th of&nbsp;March
&nbsp;
From the web console it appears to have all the same settings as the others which function perfectly.
&nbsp;
Tried to attach my logs but it errors as they are not jpg...

The file system_log-hd-nas1-20190325-110914.zip does not have a valid extension for an attachment and has been removed. jpg,gif,png,pdf are the valid extensions.

StephenB · Answer

You shouldn't post the logs publicly, as there is some privacy leakage when you do that.
&nbsp;
Instead, send a private message to&nbsp;JohnCM_S&nbsp;or&nbsp;Marc_V&nbsp;and ask them if they are willing to analyze them for you.&nbsp; You can include a download link in the PM (saving the log in dropbox, google drive, etc).

Hopchen · Answer

Yup, as StephenB said you can PM the mods for them to look at the logs. I can also take a look for you if you want (PM me link to uploaded log-set).

Helin0x · Answer

I've sent them over, appreciate you finding time to take a look.

Hopchen · Answer

Hi Helin0x&nbsp;
&nbsp;
I think the issue is either disks or chassis. It could also be the fact that the volume is 100% full. Anyway, here are some things that stand out.
&nbsp;
You are using two "Green" drives and they have recorded plenty of command timeouts, which is bad. One disk is also constantly recovering from ECC errors. I would advise to not use Green drives in a NAS at any time. They spin down on their own and they have aggressive power management - all if which is bad in a raid and will cause things like command timeouts.
Model Family: Seagate Barracuda Green (Adv. Format)
Command_Timeout: 26
Hardware_ECC_Recovered: 750873864

Model Family: Western Digital Caviar Green (Adv. Format)
Command_Timeout: 55
&nbsp;
We can also see that the NAS struggles to communicate with those drives, at times.
Sep 28 06:38:51 kernel: ata4.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen
Sep 28 06:38:51 kernel: ata4.00: failed command: SMART
Sep 28 06:38:51 kernel: ata4.00: cmd b0/d0:01:00:4f:c2/00:00:00:00:00/40 tag 0 pio 512 in
Sep 28 06:38:51 kernel: res 40/00:ff:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
Sep 28 06:38:51 kernel: ata4.00: status: { DRDY }
Sep 28 06:38:51 kernel: ata4: hard resetting link
Sep 28 06:38:51 kernel: ata4: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
Sep 28 06:38:51 kernel: ata4.00: configured for UDMA/100
Sep 28 06:38:51 kernel: ata4: EH complete
&nbsp;
Your volume is 100% full and that is a problem on its own.
Filesystem Size Used Avail Use% Mounted on
/dev/md0 4.0G 405M 3.4G 11% /
tmpfs 16K 0 16K 0% /USB
/dev/c/c 5.5T 5.4T 11G 100% /c &lt;&lt;&lt;=== Data volume
&nbsp;
It can lead to filesystem corruption and your filesystem certainly seems to have regular issues. Some examples:
Feb 24 00:00:05 kernel: EXT4-fs (dm-3): INFO: recovery required on readonly filesystem
Feb 24 00:00:06 kernel: EXT4-fs (dm-3): write access will be enabled during recovery
Feb 24 00:00:09 kernel: EXT4-fs (dm-3): recovery complete

Mar 3 00:00:07 kernel: EXT4-fs (dm-3): INFO: recovery required on readonly filesystem
Mar 3 00:00:07 kernel: EXT4-fs (dm-3): write access will be enabled during recovery
Mar 3 00:00:11 kernel: EXT4-fs (dm-3): recovery complete
&nbsp;
I would recommend the following:

Take a backup of your data if you don't have a backup already.
Take some data off the unit. Try not to go above 90%-ish.
Replace those Green drives with some actual NAS drives. Remember to replace with disks of same size or larger and replace one at a time --&gt; leave the raid finish sync --&gt; replace next one.
See how it does after that. It could also be chassis related but rule out the obvious suspects first.

&nbsp;
Cheers

Helin0x · Answer

Thanks for checking over my logs
&nbsp;
The capacity is due to it being an iscsi lun using all bar 10GB, as this is a set size the volume will never grow, this is also a common set up amongst the other readynas 2100 I have which dont exhibit this behaviour, in this scenario do you believe this would it still cause a problem?
&nbsp;
The others I have are not using green disks, I had greens lying around so used them to replace failed HGSTs to save on costs (these are only old archive data at this point), I think you can hack them with widdle3 to stop the self spool down but I'll just replace them with more HGST and see how it goes.
&nbsp;
I think the file system corruption was down to a failed disk in slot 1, it had a few hundred reallocated sectors, replaced this on 19th March and it subsequently passed the volume check:
&nbsp;
&nbsp;

Tue Mar 19 14:23:19 WET 2019
Data volume will be rebuilt with disk 1.

Sun Mar 24 00:08:23 WET 2019
The on-line filesystem consistency check completed without errors for Volume C.

&nbsp;
Was hoping that replacing it would put this issue to bed and but it was still putting the device offline, so here we are.

Forum Discussion

Ready NAS2100 Drops from Network

6 Replies

Related Content

Ready NAS 102 app problem

Ready NAS 6.10.7

Ready NAS 104 Network Name "?"

READY NAS 104

network

NETGEAR Academy

ProSupport for Business