NETGEAR is aware of a growing number of phone and online scams. To learn how to stay safe click here.
Forum Discussion
tedder
Dec 29, 2012Aspirant
NV+ had boot issues, do I have a bad drive? #20230163
I have a NV+, version 1, firmware 4.1.7, three 2gb drives.
I had trouble with my NV+ not booting. It would stick on "quota chk" after a given percentage (33.6%, I think?).
Anyhow, it's up now, RAIDar and Frontview both indicate it is resyncing. The logs in Frontview give this:
I downloaded the logs and notice some interesting bits.
Here's the disk_smart.log.. I don't see anything interesting, but I didn't see it on this thread either: https://www.readynas.com/forum/viewtopic.php?f=64&t=61710
The following was in system.log, which looks like HDE (drive 2) is the problem? It also complains about HDC and HDG, so.. I don't know.
Here's the relevant bit from diagnostics.log:
Seriously, all three? Yes, I know the theory that "like drives" will fail at the same time, but it seems suspicious. So can someone tell me if I should be worried?
I had trouble with my NV+ not booting. It would stick on "quota chk" after a given percentage (33.6%, I think?).
Anyhow, it's up now, RAIDar and Frontview both indicate it is resyncing. The logs in Frontview give this:
Access to the disk on channel (??) is producing I/O errors. Although the array is still redundant, please replace this drive as soon as possible, as it is likely to fail soon.
I downloaded the logs and notice some interesting bits.
Here's the disk_smart.log.. I don't see anything interesting, but I didn't see it on this thread either: https://www.readynas.com/forum/viewtopic.php?f=64&t=61710
***** smartctl output for hdc *****
smartctl version 5.36 [sparc64-unknown-linux-gnu] Copyright (C) 2002-6 Bruce Allen
Home page is http://smartmontools.sourceforge.net/
=== START OF INFORMATION SECTION ===
Device Model: WDC WD20EARS-00MVWB0
Serial Number: WD-WCAZA0761499
Firmware Version: 51.0AB51
User Capacity: 2,000,398,934,016 bytes
Device is: Not in smartctl database [for details use: -P showall]
ATA Version is: 8
ATA Standard is: Exact ATA specification draft version not indicated
Local Time is: Sat Dec 29 20:40:00 2012 PST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
General SMART Values:
Offline data collection status: (0x84) Offline data collection activity
was suspended by an interrupting command from host.
Auto Offline Data Collection: Enabled.
Self-test execution status: ( 0) The previous self-test routine completed
without error or no self-test has ever
been run.
Total time to complete Offline
data collection: (38580) seconds.
Offline data collection
capabilities: (0x7b) SMART execute Offline immediate.
Auto Offline data collection on/off support.
Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine
recommended polling time: ( 2) minutes.
Extended self-test routine
recommended polling time: ( 255) minutes.
Conveyance self-test routine
recommended polling time: ( 5) minutes.
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0
3 Spin_Up_Time 0x0027 177 168 021 Pre-fail Always - 6108
4 Start_Stop_Count 0x0032 089 089 000 Old_age Always - 11184
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0
9 Power_On_Hours 0x0032 074 074 000 Old_age Always - 19532
10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0
11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 24
192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 23
193 Load_Cycle_Count 0x0032 001 001 000 Old_age Always - 738384
194 Temperature_Celsius 0x0022 122 102 000 Old_age Always - 28
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0
200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 46
SMART Error Log Version: 1
ATA Error Count: 36 (device log contains only the most recent five errors)
CR = Command Register [HEX]
FR = Features Register [HEX]
SC = Sector Count Register [HEX]
SN = Sector Number Register [HEX]
CL = Cylinder Low Register [HEX]
CH = Cylinder High Register [HEX]
DH = Device/Head Register [HEX]
DC = Device Command Register [HEX]
ER = Error register [HEX]
ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.
Error 36 occurred at disk power-on lifetime: 673 hours (28 days + 1 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
04 51 01 00 00 00 40 Error: ABRT
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
b0 d6 01 e0 4f c2 40 00 4d+00:34:54.766 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 4d+00:34:54.608 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 4d+00:34:54.452 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 4d+00:34:54.295 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 4d+00:34:54.137 SMART WRITE LOG
Error 35 occurred at disk power-on lifetime: 673 hours (28 days + 1 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
04 51 01 00 00 00 40 Error: ABRT
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
b0 d6 01 e0 4f c2 40 00 4d+00:34:54.608 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 4d+00:34:54.452 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 4d+00:34:54.295 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 4d+00:34:54.137 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 4d+00:34:53.979 SMART WRITE LOG
Error 34 occurred at disk power-on lifetime: 673 hours (28 days + 1 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
04 51 01 00 00 00 40 Error: ABRT
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
b0 d6 01 e0 4f c2 40 00 4d+00:34:54.452 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 4d+00:34:54.295 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 4d+00:34:54.137 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 4d+00:34:53.979 SMART WRITE LOG
ec 00 01 01 00 00 a0 00 4d+00:34:53.799 IDENTIFY DEVICE
Error 33 occurred at disk power-on lifetime: 673 hours (28 days + 1 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
04 51 01 00 00 00 40 Error: ABRT
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
b0 d6 01 e0 4f c2 40 00 4d+00:34:54.295 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 4d+00:34:54.137 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 4d+00:34:53.979 SMART WRITE LOG
ec 00 01 01 00 00 a0 00 4d+00:34:53.799 IDENTIFY DEVICE
Error 32 occurred at disk power-on lifetime: 673 hours (28 days + 1 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
04 51 01 00 00 00 40 Error: ABRT
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
b0 d6 01 e0 4f c2 40 00 4d+00:34:54.137 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 4d+00:34:53.979 SMART WRITE LOG
ec 00 01 01 00 00 a0 00 4d+00:34:53.799 IDENTIFY DEVICE
SMART Self-test log structure revision number 1
No self-tests have been logged. [To run self-tests, use: smartctl -t]
SMART Selective self-test log data structure revision number 1
SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
1 0 0 Not_testing
2 0 0 Not_testing
3 0 0 Not_testing
4 0 0 Not_testing
5 0 0 Not_testing
Selective self-test flags (0x0):
After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
***** smartctl output for hde *****
smartctl version 5.36 [sparc64-unknown-linux-gnu] Copyright (C) 2002-6 Bruce Allen
Home page is http://smartmontools.sourceforge.net/
=== START OF INFORMATION SECTION ===
Device Model: WDC WD20EARS-00MVWB0
Serial Number: WD-WMAZA1000356
Firmware Version: 51.0AB51
User Capacity: 2,000,398,934,016 bytes
Device is: Not in smartctl database [for details use: -P showall]
ATA Version is: 8
ATA Standard is: Exact ATA specification draft version not indicated
Local Time is: Sat Dec 29 20:40:01 2012 PST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
General SMART Values:
Offline data collection status: (0x84) Offline data collection activity
was suspended by an interrupting command from host.
Auto Offline Data Collection: Enabled.
Self-test execution status: ( 0) The previous self-test routine completed
without error or no self-test has ever
been run.
Total time to complete Offline
data collection: (37680) seconds.
Offline data collection
capabilities: (0x7b) SMART execute Offline immediate.
Auto Offline data collection on/off support.
Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine
recommended polling time: ( 2) minutes.
Extended self-test routine
recommended polling time: ( 255) minutes.
Conveyance self-test routine
recommended polling time: ( 5) minutes.
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 198 198 051 Pre-fail Always - 1081
3 Spin_Up_Time 0x0027 176 171 021 Pre-fail Always - 6175
4 Start_Stop_Count 0x0032 090 090 000 Old_age Always - 10984
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0
9 Power_On_Hours 0x0032 075 075 000 Old_age Always - 18930
10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0
11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 25
192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 22
193 Load_Cycle_Count 0x0032 001 001 000 Old_age Always - 734949
194 Temperature_Celsius 0x0022 122 101 000 Old_age Always - 28
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 14
198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline - 11
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0
200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 152
SMART Error Log Version: 1
ATA Error Count: 24 (device log contains only the most recent five errors)
CR = Command Register [HEX]
FR = Features Register [HEX]
SC = Sector Count Register [HEX]
SN = Sector Number Register [HEX]
CL = Cylinder Low Register [HEX]
CH = Cylinder High Register [HEX]
DH = Device/Head Register [HEX]
DC = Device Command Register [HEX]
ER = Error register [HEX]
ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.
Error 24 occurred at disk power-on lifetime: 0 hours (0 days + 0 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
04 51 01 00 00 00 40 Error: ABRT
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
b0 d6 01 e0 4f c2 40 00 00:00:13.494 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 00:00:13.488 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 00:00:13.482 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 00:00:13.476 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 00:00:13.470 SMART WRITE LOG
Error 23 occurred at disk power-on lifetime: 0 hours (0 days + 0 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
04 51 01 00 00 00 40 Error: ABRT
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
b0 d6 01 e0 4f c2 40 00 00:00:13.488 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 00:00:13.482 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 00:00:13.476 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 00:00:13.470 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 00:00:13.464 SMART WRITE LOG
Error 22 occurred at disk power-on lifetime: 0 hours (0 days + 0 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
04 51 01 00 00 00 40 Error: ABRT
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
b0 d6 01 e0 4f c2 40 00 00:00:13.482 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 00:00:13.476 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 00:00:13.470 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 00:00:13.464 SMART WRITE LOG
ec 08 01 01 00 00 00 00 00:00:13.343 IDENTIFY DEVICE
Error 21 occurred at disk power-on lifetime: 0 hours (0 days + 0 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
04 51 01 00 00 00 40 Error: ABRT
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
b0 d6 01 e0 4f c2 40 00 00:00:13.476 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 00:00:13.470 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 00:00:13.464 SMART WRITE LOG
ec 08 01 01 00 00 00 00 00:00:13.343 IDENTIFY DEVICE
Error 20 occurred at disk power-on lifetime: 0 hours (0 days + 0 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
04 51 01 00 00 00 40 Error: ABRT
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
b0 d6 01 e0 4f c2 40 00 00:00:13.470 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 00:00:13.464 SMART WRITE LOG
ec 08 01 01 00 00 00 00 00:00:13.343 IDENTIFY DEVICE
SMART Self-test log structure revision number 1
No self-tests have been logged. [To run self-tests, use: smartctl -t]
SMART Selective self-test log data structure revision number 1
SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
1 0 0 Not_testing
2 0 0 Not_testing
3 0 0 Not_testing
4 0 0 Not_testing
5 0 0 Not_testing
Selective self-test flags (0x0):
After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
***** smartctl output for hdg *****
smartctl version 5.36 [sparc64-unknown-linux-gnu] Copyright (C) 2002-6 Bruce Allen
Home page is http://smartmontools.sourceforge.net/
=== START OF INFORMATION SECTION ===
Device Model: WDC WD20EARS-00MVWB0
Serial Number: WD-WMAZA0999773
Firmware Version: 51.0AB51
User Capacity: 2,000,398,934,016 bytes
Device is: Not in smartctl database [for details use: -P showall]
ATA Version is: 8
ATA Standard is: Exact ATA specification draft version not indicated
Local Time is: Sat Dec 29 20:40:01 2012 PST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
General SMART Values:
Offline data collection status: (0x84) Offline data collection activity
was suspended by an interrupting command from host.
Auto Offline Data Collection: Enabled.
Self-test execution status: ( 0) The previous self-test routine completed
without error or no self-test has ever
been run.
Total time to complete Offline
data collection: (37560) seconds.
Offline data collection
capabilities: (0x7b) SMART execute Offline immediate.
Auto Offline data collection on/off support.
Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine
recommended polling time: ( 2) minutes.
Extended self-test routine
recommended polling time: ( 255) minutes.
Conveyance self-test routine
recommended polling time: ( 5) minutes.
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0
3 Spin_Up_Time 0x0027 180 169 021 Pre-fail Always - 5975
4 Start_Stop_Count 0x0032 089 089 000 Old_age Always - 11005
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0
9 Power_On_Hours 0x0032 075 075 000 Old_age Always - 18946
10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0
11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 19
192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 17
193 Load_Cycle_Count 0x0032 001 001 000 Old_age Always - 754110
194 Temperature_Celsius 0x0022 123 103 000 Old_age Always - 27
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0
200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 0
SMART Error Log Version: 1
ATA Error Count: 36 (device log contains only the most recent five errors)
CR = Command Register [HEX]
FR = Features Register [HEX]
SC = Sector Count Register [HEX]
SN = Sector Number Register [HEX]
CL = Cylinder Low Register [HEX]
CH = Cylinder High Register [HEX]
DH = Device/Head Register [HEX]
DC = Device Command Register [HEX]
ER = Error register [HEX]
ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.
Error 36 occurred at disk power-on lifetime: 86 hours (3 days + 14 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
04 51 01 00 00 00 40 Error: ABRT
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.854 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.696 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.539 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.382 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.224 SMART WRITE LOG
Error 35 occurred at disk power-on lifetime: 86 hours (3 days + 14 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
04 51 01 00 00 00 40 Error: ABRT
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.696 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.539 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.382 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.224 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.067 SMART WRITE LOG
Error 34 occurred at disk power-on lifetime: 86 hours (3 days + 14 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
04 51 01 00 00 00 40 Error: ABRT
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.539 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.382 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.224 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.067 SMART WRITE LOG
ec 00 01 01 00 00 a0 00 3d+13:38:55.886 IDENTIFY DEVICE
Error 33 occurred at disk power-on lifetime: 86 hours (3 days + 14 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
04 51 01 00 00 00 40 Error: ABRT
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.382 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.224 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.067 SMART WRITE LOG
ec 00 01 01 00 00 a0 00 3d+13:38:55.886 IDENTIFY DEVICE
Error 32 occurred at disk power-on lifetime: 86 hours (3 days + 14 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
04 51 01 00 00 00 40 Error: ABRT
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.224 SMART WRITE LOG
b0 d6 01 e0 4f c2 40 00 3d+13:38:56.067 SMART WRITE LOG
ec 00 01 01 00 00 a0 00 3d+13:38:55.886 IDENTIFY DEVICE
SMART Self-test log structure revision number 1
No self-tests have been logged. [To run self-tests, use: smartctl -t]
SMART Selective self-test log data structure revision number 1
SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
1 0 0 Not_testing
2 0 0 Not_testing
3 0 0 Not_testing
4 0 0 Not_testing
5 0 0 Not_testing
Selective self-test flags (0x0):
After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
***** smartctl output for hdi *****
smartctl version 5.36 [sparc64-unknown-linux-gnu] Copyright (C) 2002-6 Bruce Allen
Home page is http://smartmontools.sourceforge.net/
Smartctl open device: /dev/hdi failed: No such device or address
The following was in system.log, which looks like HDE (drive 2) is the problem? It also complains about HDC and HDG, so.. I don't know.
Dec 29 19:10:22 nas-96-E3-46 kernel: NEON flash: Found no Atmel device at location zero
Dec 29 19:10:22 nas-96-E3-46 kernel: This board is not supported.
Dec 29 19:10:22 nas-96-E3-46 kernel: You can use parm_extport=X module parm.
Dec 29 19:10:22 nas-96-E3-46 kernel: ID=6013 on i2c_addr=1f
Dec 29 19:10:22 nas-96-E3-46 kernel: GPIO2X=7c
Dec 29 19:10:22 nas-96-E3-46 kernel: lcd:driver loaded
Dec 29 19:10:22 nas-96-E3-46 kernel: X_RAID_START
Dec 29 19:10:22 nas-96-E3-46 kernel: startstop XRAID command = start, flash_cache=0
Dec 29 19:10:22 nas-96-E3-46 kernel: X_RAID clean shutdown indicator: 0x2.
Dec 29 19:10:22 nas-96-E3-46 kernel: 0 2 1 2 0 0 0 0
Dec 29 19:10:22 nas-96-E3-46 kernel: 0 0 1 0
Dec 29 19:10:22 nas-96-E3-46 kernel: 0 0 0 0
Dec 29 19:10:22 nas-96-E3-46 kernel: 1 0 0 0
Dec 29 19:10:22 nas-96-E3-46 kernel: 0 0 0 0
Dec 29 19:10:22 nas-96-E3-46 kernel: Update time for sb 1 = 50dfab8f.
Dec 29 19:10:22 nas-96-E3-46 kernel: Update time for sb 2 = 4ccff828.
Dec 29 19:10:22 nas-96-E3-46 kernel: Update time for sb 3 = 50dfab8f.
Dec 29 19:10:22 nas-96-E3-46 kernel: Update time for sb 4 = 0.
Dec 29 19:10:22 nas-96-E3-46 kernel: recent_ID = 1, select_ID=1, most_ID=2 right_mac=3
Dec 29 19:10:22 nas-96-E3-46 kernel: Selected sb 1, ctime=50dfab8f, id=9a96e346.
Dec 29 19:10:22 nas-96-E3-46 kernel: Use this image: 1
Dec 29 19:10:22 nas-96-E3-46 kernel:
Dec 29 19:10:22 nas-96-E3-46 kernel: VERSION/ID : SB=(V:0.1.0) ID=<9a96e346.00000000.00000000.00000000> CT:50dfab8f
Dec 29 19:10:22 nas-96-E3-46 kernel: RAID_INFO : DISKS(TOTAL:3 RAID:3 PARITY:2 ONL:2 WRK:2 FAILED:1 SPARE:0 BASE:0)
Dec 29 19:10:22 nas-96-E3-46 kernel: SZ:3907008688 UT:00000000 STATE:0 LUNS:2 EXTCMD:1 LSZ:3907008686
Dec 29 19:10:22 nas-96-E3-46 kernel: LOGICAL_DRIVE : 0: B:0000000002 E:0004096000 R:1 O:1 I:0:000000000 DM:5
Dec 29 19:10:22 nas-96-E3-46 kernel: LOGICAL_DRIVE : 1: B:0004096002 E:3902912686 R:4 O:1 I:0:000000000 DM:5
Dec 29 19:10:22 nas-96-E3-46 kernel: PHYSICAL_DRIVE: 0: DISK<N:0/1,hdc(22,0),ID:0,PT:1,SZ:3907008688,ST: B:online>
Dec 29 19:10:22 nas-96-E3-46 kernel: PHYSICAL_DRIVE: 1: DISK<N:1/2,hde(33,0),ID:1,PT:1,SZ:3907008688,ST: :faulty>
Dec 29 19:10:22 nas-96-E3-46 kernel: PHYSICAL_DRIVE: 2: DISK<N:2/3,hdg(34,0),ID:2,PT:1,SZ:3907008688,ST:P :online>
Dec 29 19:10:22 nas-96-E3-46 kernel: CURRENT_DRIVE : DISK<N:0/1,XXX(22,0),ID:0,PT:1,SZ:3907008688,ST: B:online>
Dec 29 19:10:22 nas-96-E3-46 kernel: Need to do drives searching.
Dec 29 19:10:22 nas-96-E3-46 kernel: Need to find leftover drive,total=3,ready=2
Dec 29 19:10:22 nas-96-E3-46 kernel: Found leftover disk in old cfg: position=2
Dec 29 19:10:22 nas-96-E3-46 kernel: Copy leftover disk: position=2
Dec 29 19:10:22 nas-96-E3-46 kernel: Find p d at 2, chn 2
Dec 29 19:10:22 nas-96-E3-46 kernel: drive 1 missing
Dec 29 19:10:22 nas-96-E3-46 kernel: Total=3; raid=3; ready=0; work=2; failed=1
Dec 29 19:10:22 nas-96-E3-46 kernel: Mark chn:2 as offline.
Dec 29 19:10:22 nas-96-E3-46 kernel: Check degraded mode, start_pos=1
Dec 29 19:10:22 nas-96-E3-46 kernel: Drive 1 present, state=1/ACT
Dec 29 19:10:22 nas-96-E3-46 kernel: Error: ide2 need to be non-present, but is.
Dec 29 19:10:22 nas-96-E3-46 kernel: Drive 2 present, state=0/FYT, 1
Dec 29 19:10:22 nas-96-E3-46 kernel: Drive 3 present, state=1/ACT
Dec 29 19:10:22 nas-96-E3-46 kernel: Need to run X_RAID in degraded mode, total dead=1
Dec 29 19:10:22 nas-96-E3-46 kernel: Adding active interface 0 as failed interface 2
Dec 29 19:10:22 nas-96-E3-46 kernel: Adding missing drive in 2, add=1 morethanproc=1, disk=0
Dec 29 19:10:22 nas-96-E3-46 kernel: Add dead device for not only disk, true device not changed.
Dec 29 19:10:22 nas-96-E3-46 kernel: ide2 at 0x280-0x287,0x288 on irq 33
Dec 29 19:10:22 nas-96-E3-46 kernel: Dump hwif 8041e1a8 structure, 0-8041d398
Dec 29 19:10:22 nas-96-E3-46 kernel: 1-8041daa0|1-8041e1a8|1-8041e8b0|1-8041efb8
Dec 29 19:10:22 nas-96-E3-46 kernel: 0-8041f6c0|0-8041fdc8|0-804204d0|1-80420bd8
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->name---------------------ide2
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->hwgroup------------------81f5a340
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->irq----------------------33
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->present------------------1
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->hold---------------------1
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->noprobe^I^I^I0
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->true_device^I^I1
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->state0^I^I^I0
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->drives[0].name----------hde
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->drives[0].present-------1
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->drives[0].id_read-------1
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->drives[0].noprobe^I0
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->drives[0].dead^I^I0
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->drives[0].id^I^I81f5ad40
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->drives[1].present-------0
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->drives[1].id_read-------0
Dec 29 19:10:22 nas-96-E3-46 kernel: ATA DISK drive 8041e240
Dec 29 19:10:22 nas-96-E3-46 kernel: hde: WDC WD20EARS-00MVWB0, hde: enable ATAEXT
Dec 29 19:10:22 nas-96-E3-46 kernel: Dump hwif 8041e1a8 structure, 0-8041d398
Dec 29 19:10:22 nas-96-E3-46 kernel: 1-8041daa0|1-8041e1a8|1-8041e8b0|1-8041efb8
Dec 29 19:10:22 nas-96-E3-46 kernel: 0-8041f6c0|0-8041fdc8|0-804204d0|1-80420bd8
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->name---------------------ide2
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->hwgroup------------------81f5a340
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->irq----------------------33
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->present------------------1
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->hold---------------------1
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->noprobe^I^I^I0
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->true_device^I^I1
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->state0^I^I^I0
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->drives[0].name----------hde
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->drives[0].present-------1
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->drives[0].id_read-------1
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->drives[0].noprobe^I0
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->drives[0].dead^I^I1
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->drives[0].id^I^I81f5ad40
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->drives[1].present-------0
Dec 29 19:10:22 nas-96-E3-46 kernel: hwif->drives[1].id_read-------0
Dec 29 19:10:22 nas-96-E3-46 kernel: ide2 at 0x280-0x287,0x288 on irq 33
Dec 29 19:10:22 nas-96-E3-46 kernel: idedisk_deaddisk_init on hde
Dec 29 19:10:22 nas-96-E3-46 kernel: ide-disk: hde: from special init, need to reset values.
Dec 29 19:10:22 nas-96-E3-46 kernel: hde: max request size: 512KiB
Dec 29 19:10:22 nas-96-E3-46 kernel: hde: use capacity 3907029168 sectors (2000398 MB)
Dec 29 19:10:22 nas-96-E3-46 kernel: Drive support hpa, still should not change max addr.
Dec 29 19:10:22 nas-96-E3-46 kernel: :<6>hde: 3907008688 sectors (2000388 MB), CHS=65535/255/63
Dec 29 19:10:22 nas-96-E3-46 kernel: Change X_RAID running mode from 0 to 1
Dec 29 19:10:22 nas-96-E3-46 kernel: :::Update backup SB.
Dec 29 19:10:22 nas-96-E3-46 kernel: X_RAID: recovery thread got woken up ...
Dec 29 19:10:22 nas-96-E3-46 kernel: No drive to use, stop recovery.
Dec 29 19:10:22 nas-96-E3-46 kernel: :<6> hdc: hdc1 hdc2 hdc3 < hdc5 >
Dec 29 19:10:22 nas-96-E3-46 kernel: :<6> hde: hde1 hde2 hde3 < hde5 >
Dec 29 19:10:22 nas-96-E3-46 kernel: :<6> hdg: unknown partition table
Dec 29 19:10:22 nas-96-E3-46 kernel: kjournald starting. Commit interval 5 seconds
Dec 29 19:10:22 nas-96-E3-46 kernel: EXT3 FS on hdc1, internal journal
Dec 29 19:10:22 nas-96-E3-46 kernel: EXT3-fs: mounted filesystem with ordered data mode.
Dec 29 19:10:22 nas-96-E3-46 kernel: linked, 1000mbps mode
Dec 29 19:10:22 nas-96-E3-46 kernel: ::hdc: drive_cmd: status=0x51 { DriveReady SeekComplete Error }
Dec 29 19:10:22 nas-96-E3-46 kernel: hdc: drive_cmd: error=0x04 { DriveStatusError }
Dec 29 19:10:22 nas-96-E3-46 kernel: ide: failed opcode was: 0xef
Dec 29 19:10:22 nas-96-E3-46 kernel:
Dec 29 19:10:22 nas-96-E3-46 kernel: CMD to offline/lost_intr chn(1): 8e7bfae0
Dec 29 19:10:22 nas-96-E3-46 kernel: hdg: drive_cmd: status=0x51 { DriveReady SeekComplete Error }
Dec 29 19:10:22 nas-96-E3-46 kernel: hdg: drive_cmd: error=0x04 { DriveStatusError }
Dec 29 19:10:22 nas-96-E3-46 kernel: ide: failed opcode was: 0xef
Dec 29 19:10:22 nas-96-E3-46 kernel: hdc: drive_cmd: status=0x51 { DriveReady SeekComplete Error }
Dec 29 19:10:22 nas-96-E3-46 kernel: hdc: drive_cmd: error=0x04 { DriveStatusError }
Dec 29 19:10:22 nas-96-E3-46 kernel: ide: failed opcode was: 0xef
Here's the relevant bit from diagnostics.log:
Disks
-------------------------------
* Disk 1 (model WDC WD20EARS-00MVWB0, serial number WD-WCAZA0761499) has 36 ATA errors. ATA errors are logged when the disk fails to complete an internal command. This can be a sign that the disk is starting to fail.
* Disk 2 (model WDC WD20EARS-00MVWB0, serial number WD-WMAZA1000356) has 24 ATA errors. ATA errors are logged when the disk fails to complete an internal command. This can be a sign that the disk is starting to fail.
* Disk 3 (model WDC WD20EARS-00MVWB0, serial number WD-WMAZA0999773) has 36 ATA errors. ATA errors are logged when the disk fails to complete an internal command. This can be a sign that the disk is starting to fail.
Seriously, all three? Yes, I know the theory that "like drives" will fail at the same time, but it seems suspicious. So can someone tell me if I should be worried?
4 Replies
Replies have been turned off for this discussion
- mdgm-ntgrNETGEAR Employee RetiredThose ATA errors could just be due to a compatibility issue on old firmware. There are definitely issues with disk 2. Contact support and see what they suggest
- tedderAspirant
mdgm wrote: Those ATA errors could just be due to a compatibility issue on old firmware. There are definitely issues with disk 2. Contact support and see what they suggest
Why do you think there are issues with Disk 2?
I put in a support ticket and marked this thread with the ID. - mdgm-ntgrNETGEAR Employee RetiredWell there's things like in the SMART stats for disk 2, the current pending sector count is not zero.
- tedderAspirantThanks for your help, mdgm!
Related Content
NETGEAR Academy
Boost your skills with the Netgear Academy - Get trained, certified and stay ahead with the latest Netgear technology!
Join Us!