NETGEAR is aware of a growing number of phone and online scams. To learn how to stay safe click here.

Forum Discussion

tedder's avatar
tedder
Aspirant
Dec 29, 2012

NV+ had boot issues, do I have a bad drive? #20230163

I have a NV+, version 1, firmware 4.1.7, three 2gb drives.

I had trouble with my NV+ not booting. It would stick on "quota chk" after a given percentage (33.6%, I think?).

Anyhow, it's up now, RAIDar and Frontview both indicate it is resyncing. The logs in Frontview give this:

Access to the disk on channel (??) is producing I/O errors. Although the array is still redundant, please replace this drive as soon as possible, as it is likely to fail soon.


I downloaded the logs and notice some interesting bits.


Here's the disk_smart.log.. I don't see anything interesting, but I didn't see it on this thread either: https://www.readynas.com/forum/viewtopic.php?f=64&t=61710


***** smartctl output for hdc *****

smartctl version 5.36 [sparc64-unknown-linux-gnu] Copyright (C) 2002-6 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

=== START OF INFORMATION SECTION ===
Device Model: WDC WD20EARS-00MVWB0
Serial Number: WD-WCAZA0761499
Firmware Version: 51.0AB51
User Capacity: 2,000,398,934,016 bytes
Device is: Not in smartctl database [for details use: -P showall]
ATA Version is: 8
ATA Standard is: Exact ATA specification draft version not indicated
Local Time is: Sat Dec 29 20:40:00 2012 PST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status: (0x84) Offline data collection activity
was suspended by an interrupting command from host.
Auto Offline Data Collection: Enabled.
Self-test execution status: ( 0) The previous self-test routine completed
without error or no self-test has ever
been run.
Total time to complete Offline
data collection: (38580) seconds.
Offline data collection
capabilities: (0x7b) SMART execute Offline immediate.
Auto Offline data collection on/off support.
Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine
recommended polling time: ( 2) minutes.
Extended self-test routine
recommended polling time: ( 255) minutes.
Conveyance self-test routine
recommended polling time: ( 5) minutes.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0
3 Spin_Up_Time 0x0027 177 168 021 Pre-fail Always - 6108
4 Start_Stop_Count 0x0032 089 089 000 Old_age Always - 11184
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0
9 Power_On_Hours 0x0032 074 074 000 Old_age Always - 19532
10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0
11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 24
192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 23
193 Load_Cycle_Count 0x0032 001 001 000 Old_age Always - 738384
194 Temperature_Celsius 0x0022 122 102 000 Old_age Always - 28
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0
200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 46

SMART Error Log Version: 1
ATA Error Count: 36 (device log contains only the most recent five errors)
CR = Command Register [HEX]
FR = Features Register [HEX]
SC = Sector Count Register [HEX]
SN = Sector Number Register [HEX]
CL = Cylinder Low Register [HEX]
CH = Cylinder High Register [HEX]
DH = Device/Head Register [HEX]
DC = Device Command Register [HEX]
ER = Error register [HEX]
ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 36 occurred at disk power-on lifetime: 673 hours (28 days + 1 hours)
When the command that caused the error occurred, the device was active or idle.

After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
04 51 01 00 00 00 40 Error: ABRT

Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
b0 d6 01 e0 4f c2 40 00 4d+00:34:54.766 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 4d+00:34:54.608 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 4d+00:34:54.452 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 4d+00:34:54.295 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 4d+00:34:54.137 SMART WRITE LOG

Error 35 occurred at disk power-on lifetime: 673 hours (28 days + 1 hours)
When the command that caused the error occurred, the device was active or idle.

After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
04 51 01 00 00 00 40 Error: ABRT

Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
b0 d6 01 e0 4f c2 40 00 4d+00:34:54.608 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 4d+00:34:54.452 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 4d+00:34:54.295 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 4d+00:34:54.137 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 4d+00:34:53.979 SMART WRITE LOG

Error 34 occurred at disk power-on lifetime: 673 hours (28 days + 1 hours)
When the command that caused the error occurred, the device was active or idle.

After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
04 51 01 00 00 00 40 Error: ABRT

Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
b0 d6 01 e0 4f c2 40 00 4d+00:34:54.452 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 4d+00:34:54.295 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 4d+00:34:54.137 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 4d+00:34:53.979 SMART WRITE LOG
ec 00 01 01 00 00 a0 00 4d+00:34:53.799 IDENTIFY DEVICE

Error 33 occurred at disk power-on lifetime: 673 hours (28 days + 1 hours)
When the command that caused the error occurred, the device was active or idle.

After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
04 51 01 00 00 00 40 Error: ABRT

Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
b0 d6 01 e0 4f c2 40 00 4d+00:34:54.295 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 4d+00:34:54.137 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 4d+00:34:53.979 SMART WRITE LOG
ec 00 01 01 00 00 a0 00 4d+00:34:53.799 IDENTIFY DEVICE

Error 32 occurred at disk power-on lifetime: 673 hours (28 days + 1 hours)
When the command that caused the error occurred, the device was active or idle.

After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
04 51 01 00 00 00 40 Error: ABRT

Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
b0 d6 01 e0 4f c2 40 00 4d+00:34:54.137 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 4d+00:34:53.979 SMART WRITE LOG
ec 00 01 01 00 00 a0 00 4d+00:34:53.799 IDENTIFY DEVICE

SMART Self-test log structure revision number 1
No self-tests have been logged. [To run self-tests, use: smartctl -t]


SMART Selective self-test log data structure revision number 1
SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
1 0 0 Not_testing
2 0 0 Not_testing
3 0 0 Not_testing
4 0 0 Not_testing
5 0 0 Not_testing
Selective self-test flags (0x0):
After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.





***** smartctl output for hde *****

smartctl version 5.36 [sparc64-unknown-linux-gnu] Copyright (C) 2002-6 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

=== START OF INFORMATION SECTION ===
Device Model: WDC WD20EARS-00MVWB0
Serial Number: WD-WMAZA1000356
Firmware Version: 51.0AB51
User Capacity: 2,000,398,934,016 bytes
Device is: Not in smartctl database [for details use: -P showall]
ATA Version is: 8
ATA Standard is: Exact ATA specification draft version not indicated
Local Time is: Sat Dec 29 20:40:01 2012 PST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status: (0x84) Offline data collection activity
was suspended by an interrupting command from host.
Auto Offline Data Collection: Enabled.
Self-test execution status: ( 0) The previous self-test routine completed
without error or no self-test has ever
been run.
Total time to complete Offline
data collection: (37680) seconds.
Offline data collection
capabilities: (0x7b) SMART execute Offline immediate.
Auto Offline data collection on/off support.
Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine
recommended polling time: ( 2) minutes.
Extended self-test routine
recommended polling time: ( 255) minutes.
Conveyance self-test routine
recommended polling time: ( 5) minutes.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 198 198 051 Pre-fail Always - 1081
3 Spin_Up_Time 0x0027 176 171 021 Pre-fail Always - 6175
4 Start_Stop_Count 0x0032 090 090 000 Old_age Always - 10984
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0
9 Power_On_Hours 0x0032 075 075 000 Old_age Always - 18930
10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0
11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 25
192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 22
193 Load_Cycle_Count 0x0032 001 001 000 Old_age Always - 734949
194 Temperature_Celsius 0x0022 122 101 000 Old_age Always - 28
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 14
198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline - 11
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0
200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 152

SMART Error Log Version: 1
ATA Error Count: 24 (device log contains only the most recent five errors)
CR = Command Register [HEX]
FR = Features Register [HEX]
SC = Sector Count Register [HEX]
SN = Sector Number Register [HEX]
CL = Cylinder Low Register [HEX]
CH = Cylinder High Register [HEX]
DH = Device/Head Register [HEX]
DC = Device Command Register [HEX]
ER = Error register [HEX]
ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 24 occurred at disk power-on lifetime: 0 hours (0 days + 0 hours)
When the command that caused the error occurred, the device was active or idle.

After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
04 51 01 00 00 00 40 Error: ABRT

Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
b0 d6 01 e0 4f c2 40 00 00:00:13.494 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 00:00:13.488 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 00:00:13.482 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 00:00:13.476 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 00:00:13.470 SMART WRITE LOG

Error 23 occurred at disk power-on lifetime: 0 hours (0 days + 0 hours)
When the command that caused the error occurred, the device was active or idle.

After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
04 51 01 00 00 00 40 Error: ABRT

Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
b0 d6 01 e0 4f c2 40 00 00:00:13.488 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 00:00:13.482 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 00:00:13.476 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 00:00:13.470 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 00:00:13.464 SMART WRITE LOG

Error 22 occurred at disk power-on lifetime: 0 hours (0 days + 0 hours)
When the command that caused the error occurred, the device was active or idle.

After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
04 51 01 00 00 00 40 Error: ABRT

Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
b0 d6 01 e0 4f c2 40 00 00:00:13.482 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 00:00:13.476 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 00:00:13.470 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 00:00:13.464 SMART WRITE LOG
ec 08 01 01 00 00 00 00 00:00:13.343 IDENTIFY DEVICE

Error 21 occurred at disk power-on lifetime: 0 hours (0 days + 0 hours)
When the command that caused the error occurred, the device was active or idle.

After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
04 51 01 00 00 00 40 Error: ABRT

Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
b0 d6 01 e0 4f c2 40 00 00:00:13.476 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 00:00:13.470 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 00:00:13.464 SMART WRITE LOG
ec 08 01 01 00 00 00 00 00:00:13.343 IDENTIFY DEVICE

Error 20 occurred at disk power-on lifetime: 0 hours (0 days + 0 hours)
When the command that caused the error occurred, the device was active or idle.

After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
04 51 01 00 00 00 40 Error: ABRT

Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
b0 d6 01 e0 4f c2 40 00 00:00:13.470 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 00:00:13.464 SMART WRITE LOG
ec 08 01 01 00 00 00 00 00:00:13.343 IDENTIFY DEVICE

SMART Self-test log structure revision number 1
No self-tests have been logged. [To run self-tests, use: smartctl -t]


SMART Selective self-test log data structure revision number 1
SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
1 0 0 Not_testing
2 0 0 Not_testing
3 0 0 Not_testing
4 0 0 Not_testing
5 0 0 Not_testing
Selective self-test flags (0x0):
After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.





***** smartctl output for hdg *****

smartctl version 5.36 [sparc64-unknown-linux-gnu] Copyright (C) 2002-6 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

=== START OF INFORMATION SECTION ===
Device Model: WDC WD20EARS-00MVWB0
Serial Number: WD-WMAZA0999773
Firmware Version: 51.0AB51
User Capacity: 2,000,398,934,016 bytes
Device is: Not in smartctl database [for details use: -P showall]
ATA Version is: 8
ATA Standard is: Exact ATA specification draft version not indicated
Local Time is: Sat Dec 29 20:40:01 2012 PST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status: (0x84) Offline data collection activity
was suspended by an interrupting command from host.
Auto Offline Data Collection: Enabled.
Self-test execution status: ( 0) The previous self-test routine completed
without error or no self-test has ever
been run.
Total time to complete Offline
data collection: (37560) seconds.
Offline data collection
capabilities: (0x7b) SMART execute Offline immediate.
Auto Offline data collection on/off support.
Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine
recommended polling time: ( 2) minutes.
Extended self-test routine
recommended polling time: ( 255) minutes.
Conveyance self-test routine
recommended polling time: ( 5) minutes.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0
3 Spin_Up_Time 0x0027 180 169 021 Pre-fail Always - 5975
4 Start_Stop_Count 0x0032 089 089 000 Old_age Always - 11005
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0
9 Power_On_Hours 0x0032 075 075 000 Old_age Always - 18946
10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0
11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 19
192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 17
193 Load_Cycle_Count 0x0032 001 001 000 Old_age Always - 754110
194 Temperature_Celsius 0x0022 123 103 000 Old_age Always - 27
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0
200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 0

SMART Error Log Version: 1
ATA Error Count: 36 (device log contains only the most recent five errors)
CR = Command Register [HEX]
FR = Features Register [HEX]
SC = Sector Count Register [HEX]
SN = Sector Number Register [HEX]
CL = Cylinder Low Register [HEX]
CH = Cylinder High Register [HEX]
DH = Device/Head Register [HEX]
DC = Device Command Register [HEX]
ER = Error register [HEX]
ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 36 occurred at disk power-on lifetime: 86 hours (3 days + 14 hours)
When the command that caused the error occurred, the device was active or idle.

After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
04 51 01 00 00 00 40 Error: ABRT

Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.854 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.696 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.539 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.382 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.224 SMART WRITE LOG

Error 35 occurred at disk power-on lifetime: 86 hours (3 days + 14 hours)
When the command that caused the error occurred, the device was active or idle.

After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
04 51 01 00 00 00 40 Error: ABRT

Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.696 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.539 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.382 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.224 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.067 SMART WRITE LOG

Error 34 occurred at disk power-on lifetime: 86 hours (3 days + 14 hours)
When the command that caused the error occurred, the device was active or idle.

After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
04 51 01 00 00 00 40 Error: ABRT

Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.539 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.382 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.224 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.067 SMART WRITE LOG
ec 00 01 01 00 00 a0 00 3d+13:38:55.886 IDENTIFY DEVICE

Error 33 occurred at disk power-on lifetime: 86 hours (3 days + 14 hours)
When the command that caused the error occurred, the device was active or idle.

After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
04 51 01 00 00 00 40 Error: ABRT

Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.382 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.224 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.067 SMART WRITE LOG
ec 00 01 01 00 00 a0 00 3d+13:38:55.886 IDENTIFY DEVICE

Error 32 occurred at disk power-on lifetime: 86 hours (3 days + 14 hours)
When the command that caused the error occurred, the device was active or idle.

After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
04 51 01 00 00 00 40 Error: ABRT

Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.224 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.067 SMART WRITE LOG
ec 00 01 01 00 00 a0 00 3d+13:38:55.886 IDENTIFY DEVICE

SMART Self-test log structure revision number 1
No self-tests have been logged. [To run self-tests, use: smartctl -t]


SMART Selective self-test log data structure revision number 1
SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
1 0 0 Not_testing
2 0 0 Not_testing
3 0 0 Not_testing
4 0 0 Not_testing
5 0 0 Not_testing
Selective self-test flags (0x0):
After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.





***** smartctl output for hdi *****

smartctl version 5.36 [sparc64-unknown-linux-gnu] Copyright (C) 2002-6 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

Smartctl open device: /dev/hdi failed: No such device or address





The following was in system.log, which looks like HDE (drive 2) is the problem? It also complains about HDC and HDG, so.. I don't know.
Dec 29 19:10:22 nas-96-E3-46 kernel: NEON flash: Found no Atmel device at location zero
Dec 29 19:10:22 nas-96-E3-46 kernel: This board is not supported.
Dec 29 19:10:22 nas-96-E3-46 kernel: You can use parm_extport=X module parm.
Dec 29 19:10:22 nas-96-E3-46 kernel: ID=6013 on i2c_addr=1f
Dec 29 19:10:22 nas-96-E3-46 kernel: GPIO2X=7c
Dec 29 19:10:22 nas-96-E3-46 kernel: lcd:driver loaded
Dec 29 19:10:22 nas-96-E3-46 kernel: X_RAID_START
Dec 29 19:10:22 nas-96-E3-46 kernel: startstop XRAID command = start, flash_cache=0
Dec 29 19:10:22 nas-96-E3-46 kernel: X_RAID clean shutdown indicator: 0x2.
Dec 29 19:10:22 nas-96-E3-46 kernel: 0 2 1 2 0 0 0 0
Dec 29 19:10:22 nas-96-E3-46 kernel: 0 0 1 0
Dec 29 19:10:22 nas-96-E3-46 kernel: 0 0 0 0
Dec 29 19:10:22 nas-96-E3-46 kernel: 1 0 0 0
Dec 29 19:10:22 nas-96-E3-46 kernel: 0 0 0 0
Dec 29 19:10:22 nas-96-E3-46 kernel: Update time for sb 1 = 50dfab8f.
Dec 29 19:10:22 nas-96-E3-46 kernel: Update time for sb 2 = 4ccff828.
Dec 29 19:10:22 nas-96-E3-46 kernel: Update time for sb 3 = 50dfab8f.
Dec 29 19:10:22 nas-96-E3-46 kernel: Update time for sb 4 = 0.
Dec 29 19:10:22 nas-96-E3-46 kernel: recent_ID = 1, select_ID=1, most_ID=2 right_mac=3
Dec 29 19:10:22 nas-96-E3-46 kernel: Selected sb 1, ctime=50dfab8f, id=9a96e346.
Dec 29 19:10:22 nas-96-E3-46 kernel: Use this image: 1
Dec 29 19:10:22 nas-96-E3-46 kernel:
Dec 29 19:10:22 nas-96-E3-46 kernel: VERSION/ID : SB=(V:0.1.0) ID=<9a96e346.00000000.00000000.00000000> CT:50dfab8f
Dec 29 19:10:22 nas-96-E3-46 kernel: RAID_INFO : DISKS(TOTAL:3 RAID:3 PARITY:2 ONL:2 WRK:2 FAILED:1 SPARE:0 BASE:0)
Dec 29 19:10:22 nas-96-E3-46 kernel: SZ:3907008688 UT:00000000 STATE:0 LUNS:2 EXTCMD:1 LSZ:3907008686
Dec 29 19:10:22 nas-96-E3-46 kernel: LOGICAL_DRIVE : 0: B:0000000002 E:0004096000 R:1 O:1 I:0:000000000 DM:5
Dec 29 19:10:22 nas-96-E3-46 kernel: LOGICAL_DRIVE : 1: B:0004096002 E:3902912686 R:4 O:1 I:0:000000000 DM:5
Dec 29 19:10:22 nas-96-E3-46 kernel: PHYSICAL_DRIVE: 0: DISK<N:0/1,hdc(22,0),ID:0,PT:1,SZ:3907008688,ST: B:online>
Dec 29 19:10:22 nas-96-E3-46 kernel: PHYSICAL_DRIVE: 1: DISK<N:1/2,hde(33,0),ID:1,PT:1,SZ:3907008688,ST: :faulty>
Dec 29 19:10:22 nas-96-E3-46 kernel: PHYSICAL_DRIVE: 2: DISK<N:2/3,hdg(34,0),ID:2,PT:1,SZ:3907008688,ST:P :online>
Dec 29 19:10:22 nas-96-E3-46 kernel: CURRENT_DRIVE : DISK<N:0/1,XXX(22,0),ID:0,PT:1,SZ:3907008688,ST: B:online>
Dec 29 19:10:22 nas-96-E3-46 kernel: Need to do drives searching.
Dec 29 19:10:22 nas-96-E3-46 kernel: Need to find leftover drive,total=3,ready=2
Dec 29 19:10:22 nas-96-E3-46 kernel: Found leftover disk in old cfg: position=2
Dec 29 19:10:22 nas-96-E3-46 kernel: Copy leftover disk: position=2
Dec 29 19:10:22 nas-96-E3-46 kernel: Find p d at 2, chn 2
Dec 29 19:10:22 nas-96-E3-46 kernel: drive 1 missing
Dec 29 19:10:22 nas-96-E3-46 kernel: Total=3; raid=3; ready=0; work=2; failed=1
Dec 29 19:10:22 nas-96-E3-46 kernel: Mark chn:2 as offline.
Dec 29 19:10:22 nas-96-E3-46 kernel: Check degraded mode, start_pos=1
Dec 29 19:10:22 nas-96-E3-46 kernel: Drive 1 present, state=1/ACT
Dec 29 19:10:22 nas-96-E3-46 kernel: Error: ide2 need to be non-present, but is.
Dec 29 19:10:22 nas-96-E3-46 kernel: Drive 2 present, state=0/FYT, 1
Dec 29 19:10:22 nas-96-E3-46 kernel: Drive 3 present, state=1/ACT
Dec 29 19:10:22 nas-96-E3-46 kernel: Need to run X_RAID in degraded mode, total dead=1
Dec 29 19:10:22 nas-96-E3-46 kernel: Adding active interface 0 as failed interface 2
Dec 29 19:10:22 nas-96-E3-46 kernel: Adding missing drive in 2, add=1 morethanproc=1, disk=0
Dec 29 19:10:22 nas-96-E3-46 kernel: Add dead device for not only disk, true device not changed.
Dec 29 19:10:22 nas-96-E3-46 kernel: ide2 at 0x280-0x287,0x288 on irq 33
Dec 29 19:10:22 nas-96-E3-46 kernel: Dump hwif 8041e1a8 structure, 0-8041d398
Dec 29 19:10:22 nas-96-E3-46 kernel: 1-8041daa0|1-8041e1a8|1-8041e8b0|1-8041efb8
Dec 29 19:10:22 nas-96-E3-46 kernel: 0-8041f6c0|0-8041fdc8|0-804204d0|1-80420bd8
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->name---------------------ide2
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->hwgroup------------------81f5a340
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->irq----------------------33
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->present------------------1
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->hold---------------------1
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->noprobe^I^I^I0
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->true_device^I^I1
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->state0^I^I^I0
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->drives[0].name----------hde
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->drives[0].present-------1
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->drives[0].id_read-------1
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->drives[0].noprobe^I0
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->drives[0].dead^I^I0
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->drives[0].id^I^I81f5ad40
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->drives[1].present-------0
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->drives[1].id_read-------0
Dec 29 19:10:22 nas-96-E3-46 kernel: ATA DISK drive 8041e240
Dec 29 19:10:22 nas-96-E3-46 kernel: hde: WDC WD20EARS-00MVWB0, hde: enable ATAEXT
Dec 29 19:10:22 nas-96-E3-46 kernel: Dump hwif 8041e1a8 structure, 0-8041d398
Dec 29 19:10:22 nas-96-E3-46 kernel: 1-8041daa0|1-8041e1a8|1-8041e8b0|1-8041efb8
Dec 29 19:10:22 nas-96-E3-46 kernel: 0-8041f6c0|0-8041fdc8|0-804204d0|1-80420bd8
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->name---------------------ide2
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->hwgroup------------------81f5a340
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->irq----------------------33
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->present------------------1
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->hold---------------------1
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->noprobe^I^I^I0
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->true_device^I^I1
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->state0^I^I^I0
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->drives[0].name----------hde
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->drives[0].present-------1
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->drives[0].id_read-------1
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->drives[0].noprobe^I0
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->drives[0].dead^I^I1
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->drives[0].id^I^I81f5ad40
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->drives[1].present-------0
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->drives[1].id_read-------0
Dec 29 19:10:22 nas-96-E3-46 kernel: ide2 at 0x280-0x287,0x288 on irq 33
Dec 29 19:10:22 nas-96-E3-46 kernel: idedisk_deaddisk_init on hde
Dec 29 19:10:22 nas-96-E3-46 kernel: ide-disk: hde: from special init, need to reset values.
Dec 29 19:10:22 nas-96-E3-46 kernel: hde: max request size: 512KiB
Dec 29 19:10:22 nas-96-E3-46 kernel: hde: use capacity 3907029168 sectors (2000398 MB)
Dec 29 19:10:22 nas-96-E3-46 kernel: Drive support hpa, still should not change max addr.
Dec 29 19:10:22 nas-96-E3-46 kernel: :<6>hde: 3907008688 sectors (2000388 MB), CHS=65535/255/63
Dec 29 19:10:22 nas-96-E3-46 kernel: Change X_RAID running mode from 0 to 1
Dec 29 19:10:22 nas-96-E3-46 kernel: :::Update backup SB.
Dec 29 19:10:22 nas-96-E3-46 kernel: X_RAID: recovery thread got woken up ...
Dec 29 19:10:22 nas-96-E3-46 kernel: No drive to use, stop recovery.
Dec 29 19:10:22 nas-96-E3-46 kernel: :<6> hdc: hdc1 hdc2 hdc3 < hdc5 >
Dec 29 19:10:22 nas-96-E3-46 kernel: :<6> hde: hde1 hde2 hde3 < hde5 >
Dec 29 19:10:22 nas-96-E3-46 kernel: :<6> hdg: unknown partition table
Dec 29 19:10:22 nas-96-E3-46 kernel: kjournald starting. Commit interval 5 seconds
Dec 29 19:10:22 nas-96-E3-46 kernel: EXT3 FS on hdc1, internal journal
Dec 29 19:10:22 nas-96-E3-46 kernel: EXT3-fs: mounted filesystem with ordered data mode.
Dec 29 19:10:22 nas-96-E3-46 kernel: linked, 1000mbps mode
Dec 29 19:10:22 nas-96-E3-46 kernel: ::hdc: drive_cmd: status=0x51 { DriveReady SeekComplete Error }
Dec 29 19:10:22 nas-96-E3-46 kernel: hdc: drive_cmd: error=0x04 { DriveStatusError }
Dec 29 19:10:22 nas-96-E3-46 kernel: ide: failed opcode was: 0xef
Dec 29 19:10:22 nas-96-E3-46 kernel:
Dec 29 19:10:22 nas-96-E3-46 kernel: CMD to offline/lost_intr chn(1): 8e7bfae0
Dec 29 19:10:22 nas-96-E3-46 kernel: hdg: drive_cmd: status=0x51 { DriveReady SeekComplete Error }
Dec 29 19:10:22 nas-96-E3-46 kernel: hdg: drive_cmd: error=0x04 { DriveStatusError }
Dec 29 19:10:22 nas-96-E3-46 kernel: ide: failed opcode was: 0xef
Dec 29 19:10:22 nas-96-E3-46 kernel: hdc: drive_cmd: status=0x51 { DriveReady SeekComplete Error }
Dec 29 19:10:22 nas-96-E3-46 kernel: hdc: drive_cmd: error=0x04 { DriveStatusError }
Dec 29 19:10:22 nas-96-E3-46 kernel: ide: failed opcode was: 0xef


Here's the relevant bit from diagnostics.log:

Disks
-------------------------------
* Disk 1 (model WDC WD20EARS-00MVWB0, serial number WD-WCAZA0761499) has 36 ATA errors. ATA errors are logged when the disk fails to complete an internal command. This can be a sign that the disk is starting to fail.
* Disk 2 (model WDC WD20EARS-00MVWB0, serial number WD-WMAZA1000356) has 24 ATA errors. ATA errors are logged when the disk fails to complete an internal command. This can be a sign that the disk is starting to fail.
* Disk 3 (model WDC WD20EARS-00MVWB0, serial number WD-WMAZA0999773) has 36 ATA errors. ATA errors are logged when the disk fails to complete an internal command. This can be a sign that the disk is starting to fail.


Seriously, all three? Yes, I know the theory that "like drives" will fail at the same time, but it seems suspicious. So can someone tell me if I should be worried?

4 Replies

Replies have been turned off for this discussion
  • mdgm-ntgr's avatar
    mdgm-ntgr
    NETGEAR Employee Retired
    Those ATA errors could just be due to a compatibility issue on old firmware. There are definitely issues with disk 2. Contact support and see what they suggest
  • mdgm wrote:
    Those ATA errors could just be due to a compatibility issue on old firmware. There are definitely issues with disk 2. Contact support and see what they suggest

    Why do you think there are issues with Disk 2?

    I put in a support ticket and marked this thread with the ID.
  • mdgm-ntgr's avatar
    mdgm-ntgr
    NETGEAR Employee Retired
    Well there's things like in the SMART stats for disk 2, the current pending sector count is not zero.

NETGEAR Academy

Boost your skills with the Netgear Academy - Get trained, certified and stay ahead with the latest Netgear technology! 

Join Us!

ProSupport for Business

Comprehensive support plans for maximum network uptime and business peace of mind.

 

Learn More