NETGEAR is aware of a growing number of phone and online scams. To learn how to stay safe click here.
Forum Discussion
jukkaforss
Jun 27, 2019Tutor
kernel bug ReadyNASOS 6.10.1 screen massage __extent_writepsge_io+1d3
Hi, My Readynas hit kernel bug, it running latests code 6.10.1. Dmesg output after it happened. I needed to power cycle to get it rebooted. [1108355.142034] ------------[ cut here ]------------
[1...
- Jun 28, 2019
Shrinking this one down ...
jukkaforss wrote:
sdb part 1
root@readynas:~# smartctl -x /dev/sdb ID# ATTRIBUTE_NAME FLAGS VALUE WORST THRESH FAIL RAW_VALUE 1 Raw_Read_Error_Rate POSR-K 200 200 051 - 159 9 Power_On_Hours -O--CK 028 028 000 - 52621 Error 9 [8] occurred at disk power-on lifetime: 50321 hours (2096 days + 17 hours) Error: UNC at LBA = 0x12fdf7710 = 5098141456 Error 8 [7] occurred at disk power-on lifetime: 50321 hours (2096 days + 17 hours) Error: WP at LBA = 0x12fdf7710 = 5098141456 Error 7 [6] occurred at disk power-on lifetime: 41988 hours (1749 days + 12 hours) Error: UNC at LBA = 0xcf083840 = 3473422400 Error 6 [5] occurred at disk power-on lifetime: 41988 hours (1749 days + 12 hours) Error: WP at LBA = 0xcf083840 = 3473422400 Error 5 [4] occurred at disk power-on lifetime: 38618 hours (1609 days + 2 hours) Error: UNC at LBA = 0xe824a0c0 = 3894714560 Error 4 [3] occurred at disk power-on lifetime: 38618 hours (1609 days + 2 hours) Error: WP at LBA = 0xe824a0b8 = 3894714552 Error 3 [2] occurred at disk power-on lifetime: 38618 hours (1609 days + 2 hours) Error: UNC at LBA = 0xe824a0b8 = 3894714552 Error 2 [1] occurred at disk power-on lifetime: 38615 hours (1608 days + 23 hours) Error: WP at LBA = 0x11616d4b8 = 4665562296
I saw a similar pattern on one of my WD60EFRX drives a while ago, and when I tested it with Lifeguard it failed. Though a second disk with the same pattern passed Lifeguard. So I recommend testing this disk (and perhaps replace it even if it does pass).
The most recent logged error was about 2000 hours ago (~ 3 months), so that particular error didn't cause the most recent crash. But I'm thinking that this disk likely triggered it anyway.
FWIW, I haven't seen any explanation of how to decode the raw read error rate. But it is quite a bit higher on this drive than your other ones.
StephenB
Jun 27, 2019Guru - Experienced User
I'd look in kernel.log and system.log for disk errors and btrfs errors.
- jukkaforssJun 27, 2019Tutor
Kernel log has same messages, but I couldn't found anything from disk_info and btrfs logs.
No ATA errors or any other problems with disks.
- StephenBJun 27, 2019Guru - Experienced User
If ssh is enabled then smartctl -x might also give a clue.
- jukkaforssJun 28, 2019Tutor
sda
root@readynas:~# smartctl -x /dev/sda smartctl 6.6 2017-11-05 r4594 [x86_64-linux-4.4.178.x86_64.1] (local build) Copyright (C) 2002-17, Bruce Allen, Christian Franke, www.smartmontools.org === START OF INFORMATION SECTION === Model Family: Western Digital Red Device Model: WDC WD30EFRX-68AX9N0 Serial Number: WD-WMC1T3093795 LU WWN Device Id: 5 0014ee 6adf64469 Firmware Version: 80.00A80 User Capacity: 3,000,592,982,016 bytes [3.00 TB] Sector Sizes: 512 bytes logical, 4096 bytes physical Device is: In smartctl database [for details use: -P show] ATA Version is: ACS-2 (minor revision not indicated) SATA Version is: SATA 3.0, 6.0 Gb/s (current: 3.0 Gb/s) Local Time is: Fri Jun 28 11:01:44 2019 EEST SMART support is: Available - device has SMART capability. SMART support is: Enabled AAM feature is: Unavailable APM feature is: Unavailable Rd look-ahead is: Enabled Write cache is: Enabled DSN feature is: Unavailable ATA Security is: Disabled, frozen [SEC2] Wt Cache Reorder: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x00) Offline data collection activity was never started. Auto Offline Data Collection: Disabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (40020) seconds. Offline data collection capabilities: (0x7b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 401) minutes. Conveyance self-test routine recommended polling time: ( 5) minutes. SCT capabilities: (0x70bd) SCT Status supported. SCT Error Recovery Control supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAGS VALUE WORST THRESH FAIL RAW_VALUE 1 Raw_Read_Error_Rate POSR-K 200 200 051 - 19 3 Spin_Up_Time POS--K 210 179 021 - 4475 4 Start_Stop_Count -O--CK 100 100 000 - 135 5 Reallocated_Sector_Ct PO--CK 200 200 140 - 0 7 Seek_Error_Rate -OSR-K 200 200 000 - 0 9 Power_On_Hours -O--CK 028 028 000 - 52622 10 Spin_Retry_Count -O--CK 100 100 000 - 0 11 Calibration_Retry_Count -O--CK 100 100 000 - 0 12 Power_Cycle_Count -O--CK 100 100 000 - 135 192 Power-Off_Retract_Count -O--CK 200 200 000 - 100 193 Load_Cycle_Count -O--CK 200 200 000 - 34 194 Temperature_Celsius -O---K 105 103 000 - 45 196 Reallocated_Event_Count -O--CK 200 200 000 - 0 197 Current_Pending_Sector -O--CK 200 200 000 - 0 198 Offline_Uncorrectable ----CK 100 253 000 - 0 199 UDMA_CRC_Error_Count -O--CK 200 200 000 - 0 200 Multi_Zone_Error_Rate ---R-- 200 200 000 - 0 ||||||_ K auto-keep |||||__ C event count ||||___ R error rate |||____ S speed/performance ||_____ O updated online |______ P prefailure warning General Purpose Log Directory Version 1 SMART Log Directory Version 1 [multi-sector log support] Address Access R/W Size Description 0x00 GPL,SL R/O 1 Log Directory 0x01 SL R/O 1 Summary SMART error log 0x02 SL R/O 5 Comprehensive SMART error log 0x03 GPL R/O 6 Ext. Comprehensive SMART error log 0x06 SL R/O 1 SMART self-test log 0x07 GPL R/O 1 Extended self-test log 0x09 SL R/W 1 Selective self-test log 0x10 GPL R/O 1 NCQ Command Error log 0x11 GPL R/O 1 SATA Phy Event Counters log 0x21 GPL R/O 1 Write stream error log 0x22 GPL R/O 1 Read stream error log 0x80-0x9f GPL,SL R/W 16 Host vendor specific log 0xa0-0xa7 GPL,SL VS 16 Device vendor specific log 0xa8-0xb7 GPL,SL VS 1 Device vendor specific log 0xbd GPL,SL VS 1 Device vendor specific log 0xc0 GPL,SL VS 1 Device vendor specific log 0xc1 GPL VS 93 Device vendor specific log 0xe0 GPL,SL R/W 1 SCT Command/Status 0xe1 GPL,SL R/W 1 SCT Data Transfer SMART Extended Comprehensive Error Log Version: 1 (6 sectors) No Errors Logged SMART Extended Self-test Log Version: 1 (1 sectors) Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Extended offline Completed without error 00% 51571 - # 2 Extended offline Completed without error 00% 49451 - # 3 Extended offline Completed without error 00% 47246 - # 4 Extended offline Completed without error 00% 45038 - # 5 Extended offline Completed without error 00% 42831 - # 6 Extended offline Completed without error 00% 40690 - # 7 Extended offline Completed without error 00% 38490 - # 8 Extended offline Completed without error 00% 36297 - # 9 Extended offline Completed without error 00% 31941 - #10 Extended offline Completed without error 00% 29743 - #11 Extended offline Completed without error 00% 27530 - #12 Extended offline Completed without error 00% 25340 - #13 Extended offline Completed without error 00% 23167 - #14 Extended offline Completed without error 00% 20964 - #15 Extended offline Completed without error 00% 18761 - #16 Extended offline Completed without error 00% 16586 - #17 Extended offline Completed without error 00% 14453 - #18 Extended offline Completed without error 00% 13048 - SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. SCT Status Version: 3 SCT Version (vendor specific): 258 (0x0102) SCT Support Level: 1 Device State: Active (0) Current Temperature: 45 Celsius Power Cycle Min/Max Temperature: 43/46 Celsius Lifetime Min/Max Temperature: 2/47 Celsius Under/Over Temperature Limit Count: 0/0 Vendor specific: 01 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 SCT Temperature History Version: 2 Temperature Sampling Period: 1 minute Temperature Logging Interval: 1 minute Min/Max recommended Temperature: 0/60 Celsius Min/Max Temperature Limit: -41/85 Celsius Temperature History Size (Index): 478 (303) Index Estimated Time Temperature Celsius 304 2019-06-28 03:04 46 *************************** ... ..( 5 skipped). .. *************************** 310 2019-06-28 03:10 46 *************************** 311 2019-06-28 03:11 45 ************************** 312 2019-06-28 03:12 45 ************************** 313 2019-06-28 03:13 45 ************************** 314 2019-06-28 03:14 44 ************************* ... ..( 65 skipped). .. ************************* 380 2019-06-28 04:20 44 ************************* 381 2019-06-28 04:21 43 ************************ ... ..( 2 skipped). .. ************************ 384 2019-06-28 04:24 43 ************************ 385 2019-06-28 04:25 44 ************************* ... ..(236 skipped). .. ************************* 144 2019-06-28 08:22 44 ************************* 145 2019-06-28 08:23 45 ************************** ... ..( 23 skipped). .. ************************** 169 2019-06-28 08:47 45 ************************** 170 2019-06-28 08:48 46 *************************** ... ..( 7 skipped). .. *************************** 178 2019-06-28 08:56 46 *************************** 179 2019-06-28 08:57 45 ************************** ... ..( 60 skipped). .. ************************** 240 2019-06-28 09:58 45 ************************** 241 2019-06-28 09:59 44 ************************* ... ..( 15 skipped). .. ************************* 257 2019-06-28 10:15 44 ************************* 258 2019-06-28 10:16 45 ************************** ... ..( 18 skipped). .. ************************** 277 2019-06-28 10:35 45 ************************** 278 2019-06-28 10:36 46 *************************** ... ..( 24 skipped). .. *************************** 303 2019-06-28 11:01 46 *************************** SCT Error Recovery Control: Read: 70 (7.0 seconds) Write: 70 (7.0 seconds) Device Statistics (GP/SMART Log 0x04) not supported Pending Defects log (GP Log 0x0c) not supported SATA Phy Event Counters (GP Log 0x11) ID Size Value Description 0x0001 2 0 Command failed due to ICRC error 0x0002 2 0 R_ERR response for data FIS 0x0003 2 0 R_ERR response for device-to-host data FIS 0x0004 2 0 R_ERR response for host-to-device data FIS 0x0005 2 0 R_ERR response for non-data FIS 0x0006 2 0 R_ERR response for device-to-host non-data FIS 0x0007 2 0 R_ERR response for host-to-device non-data FIS 0x0008 2 0 Device-to-host non-data FIS retries 0x0009 2 4 Transition from drive PhyRdy to drive PhyNRdy 0x000a 2 4 Device-to-host register FISes sent due to a COMRESET 0x000b 2 0 CRC errors within host-to-device FIS 0x000f 2 0 R_ERR response for host-to-device data FIS, CRC 0x0012 2 0 R_ERR response for host-to-device non-data FIS, CRC 0x8000 4 82376 Vendor specific
Related Content
- Mar 10, 2018Retired_Member
NETGEAR Academy

Boost your skills with the Netgear Academy - Get trained, certified and stay ahead with the latest Netgear technology!
Join Us!