NETGEAR is aware of a growing number of phone and online scams. To learn how to stay safe click here.
Forum Discussion
tony359
Mar 30, 2023Apprentice
ReadyNAS Pro 6 crashed again
Hello all, My ReadyNAS Pro6 periodically stops responding to the network. When that happens I can push the button to shutdown it but it will sit on "shutting down" forever and then I'll have to p...
tony359
Jun 06, 2023Apprentice
Thank you for your input, really much appreciated.
I cannot remember if I tried accessing via the second port - it's been many months of tests! I'll set it up and keep it disconnected and try accessing only when it hangs.
My Linux skills are limited - but I learn fast π
is this https://www.debian.org/CD/live/ ok to make a live USB install? Do I boot it by changing the boot order in the BIOS?
Once booted does it take me to terminal or GUI?
So I'd imagine I boot into Debian CD, then I issue
btrfs-check --readonly /dev/sda (or whatever drives I want to check).
and I repeat for all the other drives?
I guess this should not affect the NAS but as usual nobody can guarantee that π
This is the only reference of CSUM I have in my logs
D:\Downloads\System_log-Enterprise-NAS-20230531-191137\btrfs.log (21 hits)
Line 130: csum_type 0 (crc32c)
Line 131: csum_size 4
Line 132: csum 0xa36bf3b8 [match]
Line 195: backup_csum_root: 34406400 gen: 621032 level: 1
Line 206: backup_csum_root: 39092224 gen: 621033 level: 1
Line 217: backup_csum_root: 32161792 gen: 621029 level: 1
Line 228: backup_csum_root: 32538624 gen: 621030 level: 1
Line 236: csum_type 0 (crc32c)
Line 237: csum_size 4
Line 238: csum 0x881ca9bc [match]
Line 301: backup_csum_root: 17009663148032 gen: 166700 level: 1
Line 312: backup_csum_root: 17009663148032 gen: 166700 level: 1
Line 323: backup_csum_root: 17009663148032 gen: 166700 level: 1
Line 334: backup_csum_root: 17009663148032 gen: 166700 level: 1
Line 342: csum_type 0 (crc32c)
Line 343: csum_size 4
Line 344: csum 0xf5e3c214 [match]
Line 407: backup_csum_root: 17009663148032 gen: 166700 level: 1
Line 418: backup_csum_root: 17009663148032 gen: 166700 level: 1
Line 429: backup_csum_root: 17009663148032 gen: 166700 level: 1
Line 440: backup_csum_root: 17009663148032 gen: 166700 level: 1tony359
Jun 06, 2023Apprentice
Ok good news. I can access the NAS via the second NIC when it goes down. It was very slow authenticating but then it sped up.
Here it shows up my lack of linux knowledge to be honest! I wanted to reset the network (or maybe just one of the NICs) but ifdown and ifup are not available?
Also traceroute is not available?
fdisk -l correctly lists all the drives.
What would you suggest I checked once I've got access?
- tony359Jun 06, 2023Apprentice
Update.
I found
ifconfig ETH1 up|down
Disabling ETH1 and bringing it back up, made the NAS appear again on my other network. At that point, logging in via SSH did not show any delay as before and everything seems to be working normally.
So it looks like it could be a NIC issue - even though I know that sometimes when I plug a keyboard, the CAPS LOCK button does not respond, showing a complete unresponsive system.
Two scenarios I guess
1. Faulty NIC HW
2. Software issue involving the NIC and when the NIC is in that weird state, everything which is using the network hangs
Any ideas?
- StephenBJun 06, 2023Guru - Experienced User
tony359 wrote:
Two scenarios I guess
1. Faulty NIC HW
2. Software issue involving the NIC and when the NIC is in that weird state, everything which is using the network hangs
Are both NICs always connected to the switch? Or did you move the cable to the second NIC???
- tony359Jun 06, 2023Apprentice
The second NAS NIC would directly connect to the second NIC on my desktop. NIC1 and NIC2 are on a different network range of course.
For this test, the secondary NIC was physically unplugged - but configured and ready to go. When the NAS disappeared from the main network, I plugged the secondary cable to the NAS and I could see the NAS from my desktop.
I thought of keeping the second NAS NIC disconnected rather than always connected. I guess I'll leave them both on for a test - but one NIC is going through the switch and to the main network, the other is only directed connected to the desktop.
Under normal circumstances, I would only use one NIC on the NAS.
- tony359Jun 07, 2023Apprentice
Just for the Hardware Geeks, I decided to replace the capacitors of the PSU anyways. I know, waste of time. And massive one as those large traces with wired made the process very difficult.
HOWEVER, 90% of the caps were bulged and were reading either nothing or a fraction of the original capacitance!! But the NAS was working, I wonder how.
And I wonder how is it even possible that the replacement (temporary) PSU hasn't fixed the issue!
I think I'll replace the Seasonic anyways but a small PSU is always handy so I'll fix it anyways.
The NAS didn't disappear over the past couple of days.
- KDSJun 09, 2023Guide
Just on an off chance you haven't tried yet.
CPU & RAM swap out?
- tony359Jun 09, 2023ApprenticeConsidering itβs happening very rarely itβs not such a bad idea.
Iβve run overnights of ram tests but maybe it didnβt catch it because it happens very rarely.
I still have the original CPU so I could try that too.
That said, the fact that just the network went down last time is suspicious. A ram or cpu issue would have much bigger impact I reckon. I might want to put a switch in between the nas and the main switch. Itβs always been that switch and maybe itβs faulty. After all the nas stopped crashing when I took it off the main network - which takes the main switch out of the equation.
And it worked for a while while connected to my main desktop, again no main switch involved.
Uhmβ¦ I like this idea π - KDSJun 09, 2023Guide
I have my router dishing out DHCP addresses>>>Unmanaged 2.5G switch>>>both NICs into switch.
Static IP's on both Netgear NIC settings (IPV4) and router address.
Router set to static IP addresses for both NICs.
Since doing that both NICs are very stable.
Ram is 2 x 2GB PC800.
CPU is now E7600, Just upgraded from E5300, find this much faster than the Q6600, though my NAS is mainly used for backup, and file server, not really serving any Apps. E7600 runs faster and much cooler than Q6600.
- SandsharkJun 09, 2023Sensei
Is it a "green" switch? I've had a couple issues with ReadyNAS and green switches, though I've believed the problem units already had partly damaged LAN ports. My main switch has a "green" on/off selection. Try turning off power saving mode if yours does. Otherwise, a non-green switch in between might be the answer.
- tony359Jun 09, 2023Apprentice
It's a Netgear! π
GS108Ev2. "partly" managed. V1.00.12 (latest). DHCP disabled. DHCP is handled by the router (Fritzbox) which issues the same IP to the NAS MAC address. All settings are default to be honest.
I have tried a static IP in the past with no change - though, I'm confident those swollen capacitors might have contributed to SOME of the issues I was having.
Today's new issue is... the NAS is online, I can see the files. I can SSH into it. But web interface shows an "500 - internal server error". This is on both ports. Sigh π
Before I just reboot the box, how would I restart the web interface from SSH?
I'll install a dumb switch between the NAS and the main switch - with new cables.
The 7600 seems to be a good option. It's only 2 cores but it's faster than the cores in the 6600. I wonder how much a NAS used as a "file system" is actually using a multi-core CPU. And the 7600 as you say is cooler.
I think I'll fix this issue first then I might try the 7600 as well, thanks for the hint!
- tony359Jun 09, 2023Apprentice
I feel that the below is relevant with my issue. Again, the NAS is accessible, I can write a file on the data folder via nano. I just lost the web interface.
These weird failures are incredibly annoying. I'd like to test what itachi2 recommended, can someone possibly point me to the right direction? See https://community.netgear.com/t5/Using-your-ReadyNAS-in-Business/ReadyNAS-Pro-6-crashed-again/m-p/2316638/highlight/true#M199640
Thanks π
root@Enterprise-NAS:/# systemctl status apache2 Failed to get properties: Activation of org.freedesktop.systemd1 timed out root@Enterprise-NAS:/# root@Enterprise-NAS:/# root@Enterprise-NAS:/# root@Enterprise-NAS:/# root@Enterprise-NAS:/# systemctl restart apache2 Failed to restart apache2.service: Activation of org.freedesktop.systemd1 timed out See system logs and 'systemctl status apache2.service' for details. root@Enterprise-NAS:/# sudo systemctl status apache2 -bash: sudo: command not found root@Enterprise-NAS:/# su root@Enterprise-NAS:/# systemctl status apache2 Failed to get properties: Activation of org.freedesktop.systemd1 timed out root@Enterprise-NAS:/# systemctl status readynasd Failed to get properties: Activation of org.freedesktop.systemd1 timed out root@Enterprise-NAS:/# ps aux | grep readynasd root 26625 0.0 0.0 17836 1008 pts/2 S+ 19:58 0:00 grep readynasd root@Enterprise-NAS:/# service ctscand stop Failed to stop ctscand.service: Connection timed out See system logs and 'systemctl status ctscand.service' for details. Failed to get load state of ctscand.service: Connection timed out root@Enterprise-NAS:/# systemctl restart readynasd Failed to restart readynasd.service: Activation of org.freedesktop.systemd1 timed out See system logs and 'systemctl status readynasd.service' for details. root@Enterprise-NAS:/# systemctl status readynasd.service Failed to get properties: Activation of org.freedesktop.systemd1 timed out - KDSJun 09, 2023Guide
Just another hardware thing that has probably already happened.
1. After good PSU installed was CMOS cleared?
2. Has CMOS battery been checked?
3. Are you keeping it simple with just 1 HDD, possibly 2 (raid 1), with HDDs especially raid arrays cleaned and cleared on another PC prior to installing. Granted you may have data on your system, though remove those HDDs and start fresh, with known clean and good drives? I tested with some old 320GB junk drives I had kicking about. I also encountered NIC, web access, and HDD problems prior to replacing the PSU. My original 7200 WD HDDs were only seen as 5900, then when I added a newer 7200 WD HDD it was seen as 7200, it did not like the mismatch in HDD speed that it saw.
Though finally did clean HDDs. I think Web interface may be associated with what is already on the HDDs.
My HDD and hardware issues were resolved when I replaced PSU. Both types drives 3 x 7200 seen as 5900, and 3 x 7200 seen as 7200, and running together fine.
4. BTW are you using RAIDar 6.5.0.
- tony359Jun 09, 2023Apprentice
Just another hardware thing that has probably already happened.
1. After good PSU installed was CMOS cleared?
--
No, I did not update the BIOS so I didn't think of clearing the CMOS. I can try.
2. Has CMOS battery been checked?
--
No. Good point.
3. Are you keeping it simple with just 1 HDD, possibly 2 (raid 1), with HDDs especially raid arrays cleaned and cleared
--
No. Reason is: last time the system behaved, it lasted for 2 months. I cannot stay without my data for 2 months.
The only two options here are
a. Fix it with the current setup
b. try a factory default and migrate a backup
Testing with 2 random HDDs is likely not gaining any evidence I'm afraid.
I also encountered NIC, web access, and HDD problems prior to replacing the PSU. My original 7200 WD HDDs were only seen as 5900, then when I added a newer 7200 WD HDD it was seen as 7200, it did not like the mismatch in HDD speed that it saw.
--
Unfortunately the replacement PSU did not solve all the problems. I'm confident some of the issues I experienced were caused by the bad PSU but the NAS is still misbehaving I'm afraid.
All my HDDs are WD RED, 5400-ish (4TB are a bit slower than the 6TB).
4. BTW are you using RAIDar 6.5.0.
--
No. I am on OS6.
I appreciate a factory reset would be a good idea but I have 13TB on that NAS and I don't know where to store them for a backup. Yes, the NAS is more or less fully backed up (locally and online) but it would take me forever to restore those backups so I'd consider that as an emergency option only.
I could see if I could hire another NAS, transfer the data, reset and restore. But somehow I am not confident my problems would go away π
Thanks for your input!
- tony359Jun 10, 2023Apprentice
Little update.
I checked the battery, it's ok, 3.1V. I replaced it some time ago when I serviced the box.
I re-reset the BIOS (only thing I change is the default fan speed!)
I swapped position of HDD0 with HDD4. I sprayed dry contact cleaner on the backplane and on the HDDs, cleaned with a small q-tip.
Once the NAS was powered up again, HDD0 failed to show up on the BIOS splash page straight away. So it's not the HDD and, to be honest, I feel that that might be a red herring. I never had issues with HDD0 so maybe it's a BIOS bug which then does not affect the software. No idea. But I now know it's not the drive.
I've added a TP-Link switch between the main switch and the NAS.
Next: throwing the NAS out of the window.
- tony359Jun 11, 2023Apprentice
And no, the NAS disappeared again.
Solution: SSH into other port and ifconfig the other port DOWN and then UP again.
I could try swapping the config but I think I tried that in the past already.
If someone could give me some directions for checking the HDDs offline as mentioned above, that would be great! π
Thanks
- tony359Jun 11, 2023Apprentice
That's what I meant with "swapping the config" sorry. As in swap the IP addresses between ports.
I'll try but I think I tried that in the past already. 100% worth a try.
- tony359Jun 12, 2023Apprentice
The NICs are on two different IP range - one main network, one PC only.
What used to be on main network is now directly connected to the PC and what used to be connected to the PC is now connected to the main network and I've swapped the IP addresses accordingly.
I did that yesterday and I've just checked: NAS has disappeared. Sigh!
I SSH'd through the other NIC, restarted it and it worked as usual.
So
- It's not the specific NIC
- It's not the switch
It's curious that it's always the NIC on the main network failing and not the other.
Help π
- schumakuJun 12, 2023Guru - Experienced User
As you are in the lucky situation having an alternate LAN interface (and IP subnet) available. what does the kernel output show when the device "disappeared", ...?
# dmesg
The risk that a network adapter does become flakey is very small. More typical, the adapters resp. the data connectivity does completely disappear completely, and the UPnP OS does no longer detect the adapter.
Most problems on such NASes are caused by RAID becoming inoperable, due to aged or breaking storage blocks.
Do you have a known working, reliable SATA storage block at hand to set-up the NAS with one single device volume, or two on a RAID 1 volume? Remove the potentially unhealthy storage blocks, and restart a test from scratch.
- tony359Jun 12, 2023Apprentice
Thanks, I'll test next time.
Many have (rightly) recommended a test with a couple of random HDDs. I have plenty so that wouldn't be an issue.
My concern is that sometimes the NAS stays online for weeks without issues and I really cannot keep my data offline for so long.
Is there a way to do an offline test of my drives? Someone recommended booting from a Debian Live-USB but I would need some minor guidance on that. I know how to make the USB, I'm just making sure (as much as possible) I don't do anything that can destroy my data.
Thanks! π
- schumakuJun 12, 2023Guru - Experienced User
Start with retrieving the SMART data from the storage block (aka. disk). Next trigger a full SMART check (rapid, then full) of the storage block. Then retrieve the SMART data again.
You can do this on any platform, without erasing or re-partition or re-format the storage block - if done carefully of course.
- tony359Jun 12, 2023ApprenticeThanks.
Iβll Google how to do that. π
Just to double check: do you mean doing those checks on the NAS itself while itβs online? - StephenBJun 13, 2023Guru - Experienced User
tony359 wrote:
Is there a way to do an offline test of my drives? π
There is an on-line test in the maintenance menu you can use. That runs the full built-in smart test on all the drives in the volume.
You can also use smartctl -x /dev/sda from ssh to see more errors (UNCs in particular) on sda (or whatever disk you wish),
As far as off-line goes, the simplest way is to connect the drive to a Windows PC and run the vendor diag - Dashboard for WDC, and Seatools for Seagate. Unfortunately they don't run on MacOS.
But it seems to me that your symptoms are pointing either to the switch or perhaps the cable going from the NAS to the switch. It's always the NIC port connected to that switch that fails, and the other NIC always continues to work fine.
- tony359Jun 13, 2023Apprentice
Hi Stephen,
No, the ports were swapped last time - also the switch and the cable. So it's not a NIC or Network issue. Well. It ALWAYS fails on that NETWORK so it could be something on my main network. But on this occasion the NAS was wired to the main switch on another port and through an additional switch. So if it's something with that network, it's not a HW issue.
The online maintenance runs periodically. The logs show an "offline" test though. How should I read that? The drive is now 51888hrs.
SMART Extended Self-test Log Version: 1 (1 sectors) Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Extended offline Interrupted (host reset) 90% 50227 - # 2 Extended offline Completed without error 00% 48081 - # 3 Extended offline Completed without error 00% 45875 - # 4 Extended offline Completed without error 00% 43691 - # 5 Extended offline Completed without error 00% 41536 - # 6 Extended offline Completed without error 00% 39834 - # 7 Extended offline Completed without error 00% 37636 - # 8 Extended offline Completed without error 00% 35455 - # 9 Extended offline Completed without error 00% 33273 - #10 Extended offline Completed without error 00% 31118 - #11 Extended offline Completed without error 00% 28912 - #12 Extended offline Completed without error 00% 26707 - #13 Extended offline Completed without error 00% 24525 - #14 Extended offline Completed without error 00% 22554 - #15 Extended offline Completed without error 00% 20712 - #16 Extended offline Completed without error 00% 19182 - #17 Short offline Completed without error 00% 82 - #18 Short offline Completed without error 00% 63 -I ran smartctl -x in the past and posted the output here earlier on this thread. I didn't spot anything but I am not an expert. There are UNC errors on SDA (which I now moved to SDE) but at 7872 hours, a few years ago! π
Error 159 [14] occurred at disk power-on lifetime: 7872 hours (328 days + 0 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER -- ST COUNT LBA_48 LH LM LL DV DC -- -- -- == -- == == == -- -- -- -- -- 40 -- 51 00 00 00 00 4b 2b cc 40 40 00 Error: WP at LBA = 0x4b2bcc40 = 1261161536 Commands leading to the command that caused the error were: CR FEATR COUNT LBA_48 LH LM LL DV DC Powered_Up_Time Command/Feature_Name -- == -- == -- == == == -- -- -- -- -- --------------- -------------------- 61 04 00 00 08 00 00 4b 2b c8 40 40 08 14:59:14.849 WRITE FPDMA QUEUED 60 04 00 00 00 00 00 4b 2b cc 40 40 08 14:59:14.849 READ FPDMA QUEUED ef 00 10 00 02 00 00 00 00 00 00 a0 08 14:59:14.849 SET FEATURES [Enable SATA feature] 27 00 00 00 00 00 00 00 00 00 00 e0 08 14:59:14.849 READ NATIVE MAX ADDRESS EXT [OBS-ACS-3] ec 00 00 00 00 00 00 00 00 00 00 a0 08 14:59:14.849 IDENTIFY DEVICE Error 158 [13] occurred at disk power-on lifetime: 7872 hours (328 days + 0 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER -- ST COUNT LBA_48 LH LM LL DV DC -- -- -- == -- == == == -- -- -- -- -- 40 -- 51 00 00 00 00 4b 2b cc 40 40 00 Error: UNC at LBA = 0x4b2bcc40 = 1261161536 Commands leading to the command that caused the error were: CR FEATR COUNT LBA_48 LH LM LL DV DC Powered_Up_Time Command/Feature_Name -- == -- == -- == == == -- -- -- -- -- --------------- -------------------- 60 04 00 00 08 00 00 4b 2b cc 40 40 08 14:59:11.031 READ FPDMA QUEUED 61 04 00 00 00 00 00 4b 2b c8 40 40 08 14:59:11.031 WRITE FPDMA QUEUED ef 00 10 00 02 00 00 00 00 00 00 a0 08 14:59:11.031 SET FEATURES [Enable SATA feature] 27 00 00 00 00 00 00 00 00 00 00 e0 08 14:59:11.031 READ NATIVE MAX ADDRESS EXT [OBS-ACS-3] ec 00 00 00 00 00 00 00 00 00 00 a0 08 14:59:11.030 IDENTIFY DEVICE Error 157 [12] occurred at disk power-on lifetime: 7872 hours (328 days + 0 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER -- ST COUNT LBA_48 LH LM LL DV DC -- -- -- == -- == == == -- -- -- -- -- 40 -- 51 00 00 00 00 4b 2b cc 40 40 00 Error: WP at LBA = 0x4b2bcc40 = 1261161536 Commands leading to the command that caused the error were: CR FEATR COUNT LBA_48 LH LM LL DV DC Powered_Up_Time Command/Feature_Name -- == -- == -- == == == -- -- -- -- -- --------------- -------------------- 61 04 00 00 08 00 00 4b 2b c8 40 40 08 14:59:07.223 WRITE FPDMA QUEUED 60 04 00 00 00 00 00 4b 2b cc 40 40 08 14:59:07.223 READ FPDMA QUEUED ef 00 10 00 02 00 00 00 00 00 00 a0 08 14:59:07.223 SET FEATURES [Enable SATA feature] 27 00 00 00 00 00 00 00 00 00 00 e0 08 14:59:07.223 READ NATIVE MAX ADDRESS EXT [OBS-ACS-3] ec 00 00 00 00 00 00 00 00 00 00 a0 08 14:59:07.223 IDENTIFY DEVICE Error 156 [11] occurred at disk power-on lifetime: 7872 hours (328 days + 0 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER -- ST COUNT LBA_48 LH LM LL DV DC -- -- -- == -- == == == -- -- -- -- -- 40 -- 51 00 00 00 00 4b 2b cc 40 40 00 Error: UNC at LBA = 0x4b2bcc40 = 1261161536 Commands leading to the command that caused the error were: CR FEATR COUNT LBA_48 LH LM LL DV DC Powered_Up_Time Command/Feature_Name -- == -- == -- == == == -- -- -- -- -- --------------- -------------------- 60 04 00 00 08 00 00 4b 2b cc 40 40 08 14:59:03.405 READ FPDMA QUEUED 61 04 00 00 00 00 00 4b 2b c8 40 40 08 14:59:03.405 WRITE FPDMA QUEUED ef 00 10 00 02 00 00 00 00 00 00 a0 08 14:59:03.405 SET FEATURES [Enable SATA feature] 27 00 00 00 00 00 00 00 00 00 00 e0 08 14:59:03.405 READ NATIVE MAX ADDRESS EXT [OBS-ACS-3] ec 00 00 00 00 00 00 00 00 00 00 a0 08 14:59:03.405 IDENTIFY DEVICE Error 155 [10] occurred at disk power-on lifetime: 7872 hours (328 days + 0 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER -- ST COUNT LBA_48 LH LM LL DV DC -- -- -- == -- == == == -- -- -- -- -- 40 -- 51 00 00 00 00 4b 2b cc 40 40 00 Error: WP at LBA = 0x4b2bcc40 = 1261161536 Commands leading to the command that caused the error were: CR FEATR COUNT LBA_48 LH LM LL DV DC Powered_Up_Time Command/Feature_Name -- == -- == -- == == == -- -- -- -- -- --------------- -------------------- 61 04 00 00 08 00 00 4b 2b c8 40 40 08 14:58:59.720 WRITE FPDMA QUEUED 60 04 00 00 00 00 00 4b 2b cc 40 40 08 14:58:59.720 READ FPDMA QUEUED ef 00 10 00 02 00 00 00 00 00 00 a0 08 14:58:59.720 SET FEATURES [Enable SATA feature] 27 00 00 00 00 00 00 00 00 00 00 e0 08 14:58:59.720 READ NATIVE MAX ADDRESS EXT [OBS-ACS-3] ec 00 00 00 00 00 00 00 00 00 00 a0 08 14:58:59.719 IDENTIFY DEVICE Error 154 [9] occurred at disk power-on lifetime: 7872 hours (328 days + 0 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER -- ST COUNT LBA_48 LH LM LL DV DC -- -- -- == -- == == == -- -- -- -- -- 40 -- 51 00 00 00 00 4b 2b cc 40 40 00 Error: UNC at LBA = 0x4b2bcc40 = 1261161536 Commands leading to the command that caused the error were: CR FEATR COUNT LBA_48 LH LM LL DV DC Powered_Up_Time Command/Feature_Name -- == -- == -- == == == -- -- -- -- -- --------------- -------------------- 60 04 00 00 08 00 00 4b 2b cc 40 40 08 14:58:55.900 READ FPDMA QUEUED 61 04 00 00 00 00 00 4b 2b c8 40 40 08 14:58:55.900 WRITE FPDMA QUEUED ea 00 00 00 00 00 00 00 00 00 00 e0 08 14:58:55.873 FLUSH CACHE EXT 60 00 08 00 08 00 00 00 7f 22 18 40 08 14:58:55.838 READ FPDMA QUEUED 61 00 02 00 00 00 00 00 00 00 48 40 08 14:58:55.838 WRITE FPDMA QUEUED Error 153 [8] occurred at disk power-on lifetime: 7872 hours (328 days + 0 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER -- ST COUNT LBA_48 LH LM LL DV DC -- -- -- == -- == == == -- -- -- -- -- 40 -- 51 00 00 00 00 4b 2b c8 40 40 00 Error: UNC at LBA = 0x4b2bc840 = 1261160512 Commands leading to the command that caused the error were: CR FEATR COUNT LBA_48 LH LM LL DV DC Powered_Up_Time Command/Feature_Name -- == -- == -- == == == -- -- -- -- -- --------------- -------------------- 60 04 00 00 00 00 00 4b 2b c8 40 40 08 14:58:52.283 READ FPDMA QUEUED ef 00 10 00 02 00 00 00 00 00 00 a0 08 14:58:52.283 SET FEATURES [Enable SATA feature] 27 00 00 00 00 00 00 00 00 00 00 e0 08 14:58:52.283 READ NATIVE MAX ADDRESS EXT [OBS-ACS-3] ec 00 00 00 00 00 00 00 00 00 00 a0 08 14:58:52.282 IDENTIFY DEVICE ef 00 03 00 46 00 00 00 00 00 00 a0 08 14:58:52.282 SET FEATURES [Set transfer mode] Error 152 [7] occurred at disk power-on lifetime: 7872 hours (328 days + 0 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER -- ST COUNT LBA_48 LH LM LL DV DC -- -- -- == -- == == == -- -- -- -- -- 40 -- 51 00 00 00 00 4b 2b c8 40 40 00 Error: UNC at LBA = 0x4b2bc840 = 1261160512 Commands leading to the command that caused the error were: CR FEATR COUNT LBA_48 LH LM LL DV DC Powered_Up_Time Command/Feature_Name -- == -- == -- == == == -- -- -- -- -- --------------- -------------------- 60 04 00 00 00 00 00 4b 2b c8 40 40 08 14:58:48.786 READ FPDMA QUEUED ef 00 10 00 02 00 00 00 00 00 00 a0 08 14:58:48.786 SET FEATURES [Enable SATA feature] 27 00 00 00 00 00 00 00 00 00 00 e0 08 14:58:48.786 READ NATIVE MAX ADDRESS EXT [OBS-ACS-3] ec 00 00 00 00 00 00 00 00 00 00 a0 08 14:58:48.786 IDENTIFY DEVICE ef 00 03 00 46 00 00 00 00 00 00 a0 08 14:58:48.786 SET FEATURES [Set transfer mode]I am Windows so that's fine but wouldn't be better to run the tests on a Linux system so the file system can be checked as well? Also I think I think I'd prefer the disks to stay unmounted so I know I have less chances of damaging the RAID.
Can I start the NAS from a Debian live-USB? I could run the checks from there, assuming VGA works there. And what do you think of that suggestion of running btrfs-check on the drives? I don't dislike the idea of checking the file system.
The NAS disappeared again so I've run dmseg and it's attached (this forum lacks the ability to attach text files!).
Do I see lots of network going down messages after what seems to be a gap? And both ETH0 and ETH1.
Disabling and re-enabling ETH0 worked as usual.
And yes, I've now disabled IPv6 (it got re-enabled when I swapped the IPs I think)
Related Content
NETGEAR Academy
Boost your skills with the Netgear Academy - Get trained, certified and stay ahead with the latest Netgear technology!
Join Us!