RN316 disk randomly failed without warning

Question

Just got the email message that my relatively new replacement disk in my 6 disk array failed.. Without warning or errors.&nbsp; Is there a way to see why it failed?&nbsp; Email notifification at 2:01pm and all I can see from logs is:
&nbsp;
STATUS
[18/07/03 14:01:42 AEST] warning:volume:LOGMSG_HEALTH_VOLUME Volume Raid-6 health changed from Redundant to Degraded.[18/07/03 14:01:49 AEST] err:disk:LOGMSG_ZFS_DISK_STATUS_CHANGED Disk in channel 3 (Internal) changed state from ONLINE to FAILED.
&nbsp;
DISKINFO
Device: sdfController: 0Channel: 2Model: Serial: Firmware: Class: SATASectors: 5860533168Pool: Raid-6PoolType: RAID 6PoolState: 3PoolHostId: 540eddc2Health data  ATA Error Count: 0
&nbsp;
VOLUME
Disk sdf: HostID: 2fe73ef2 Flags: 0x0 Size: 5860533168 (2794 GB) Free: 14 Controller 0 Channel: 2 Model:  Serial:  Firmware:  Class: SATA (2) SMART data  Latest Self Test: Passed
&nbsp;
DMESG (note errors only happen 7 minutes after the disk has been taken offline)
[Sun Jul 1 06:13:34 2018] usb 4-2: DVB: adapter 0 frontend 0 frequency 0 out of range (45000000..860000000)[Tue Jul 3 14:08:18 2018] ata3.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen[Tue Jul 3 14:08:18 2018] ata3.00: failed command: FLUSH CACHE EXT[Tue Jul 3 14:08:18 2018] ata3.00: cmd ea/00:00:00:00:00/00:00:00:00:00/a0 tag 5 res 40/00:01:01:4f:c2/00:00:00:00:00/00 Emask 0x4 (timeout)[Tue Jul 3 14:08:18 2018] ata3.00: status: { DRDY }[Tue Jul 3 14:08:18 2018] ata3: hard resetting link[Tue Jul 3 14:08:24 2018] do_marvell_9170_recover: ignoring PCI device (8086:3a22) at PCI#0[Tue Jul 3 14:08:24 2018] do_marvell_9170_recover: ignoring PCI device (8086:3a22) at PCI#0[Tue Jul 3 14:08:28 2018] do_marvell_9170_recover: ignoring PCI device (8086:3a22) at PCI#0[Tue Jul 3 14:08:28 2018] ata3: softreset failed (1st FIS failed)[Tue Jul 3 14:08:28 2018] ata3: hard resetting link[Tue Jul 3 14:08:34 2018] do_marvell_9170_recover: ignoring PCI device (8086:3a22) at PCI#0[Tue Jul 3 14:08:34 2018] do_marvell_9170_recover: ignoring PCI device (8086:3a22) at PCI#0[Tue Jul 3 14:08:38 2018] do_marvell_9170_recover: ignoring PCI device (8086:3a22) at PCI#0[Tue Jul 3 14:08:38 2018] ata3: softreset failed (1st FIS failed)[Tue Jul 3 14:08:38 2018] ata3: hard resetting link[Tue Jul 3 14:08:44 2018] do_marvell_9170_recover: ignoring PCI device (8086:3a22) at PCI#0[Tue Jul 3 14:08:44 2018] do_marvell_9170_recover: ignoring PCI device (8086:3a22) at PCI#0[Tue Jul 3 14:09:13 2018] do_marvell_9170_recover: ignoring PCI device (8086:3a22) at PCI#0[Tue Jul 3 14:09:13 2018] ata3: softreset failed (1st FIS failed)[Tue Jul 3 14:09:13 2018] ata3: limiting SATA link speed to 1.5 Gbps[Tue Jul 3 14:09:13 2018] ata3: hard resetting link
&nbsp;
&nbsp;
&nbsp;

mdgm-ntgr · Answer

Do you see any errors for the disk in smart_history.log ?

Jophus · Answer

2018-06-04 20:23:14 ST3000DM007-1WY10G ZFQ02SC4 0 0 0 0 0 0 0 0 2018-06-05 14:32:54 ST3000DM007-1WY10G ZFQ02SC4 0 0 0 0 1 0 0 0
&nbsp;
Nope... this is the drive and these are the entries.&nbsp; 1 CMD_TIMEOUT 05/06/2018 2:32PM - one month ago.

mdgm-ntgr · Answer

Have you checked e.g. kernel.log or systemd-journal.log?&nbsp;Have you checked the disk using SeaTools?

Jophus · Answer

kernel shows nothing from 3PM on 30 June.
Nothing in systemd-journal since 00:13 (midnight) 1 July.
&nbsp;
Anywhere else?

Jophus · Answer

Seatools.. no - i haven't pulled it out of the NAS yet.

Forum Discussion

RN316 disk randomly failed without warning

8 Replies

Related Content

RAXE500 no longer get attack warnings

R7000 dropping 5Ghz randomly

Router randomly blocking websites

Weak security warning for EX3700 range extender

Insecure connection password warning in Firefox

NETGEAR Academy

ProSupport for Business