- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
Re: kernel bug ReadyNASOS 6.10.1 screen massage __extent_writepsge_io+1d3
- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi,
My Readynas hit kernel bug, it running latests code 6.10.1.
Dmesg output after it happened. I needed to power cycle to get it rebooted.
[1108355.142034] ------------[ cut here ]------------ [1108355.146933] kernel BUG at fs/btrfs/extent_io.c:3400! [1108355.152192] invalid opcode: 0000 [#1] SMP [1108355.301048] Modules linked in: vpd(PO) [1108355.305129] CPU: 0 PID: 4578 Comm: nfsd Tainted: P O 4.4.178.x86_64.1 #1 [1108355.313300] Hardware name: NETGEAR ReadyNAS 314/To be filled by O.E.M., BIOS 4.6.5 11/05/2013 [1108355.322211] task: ffff8800c3362a00 ti: ffff8800c34d8000 task.ti: ffff8800c34d8000 [1108355.330033] RIP: 0010:[<ffffffff882b5f5a>] [<ffffffff882b5f5a>] __extent_writepage_io+0x1d3/0x398 [1108355.339383] RSP: 0018:ffff8800c34db8d8 EFLAGS: 00010206 [1108355.344991] RAX: ffff880108549870 RBX: ffffea0000d2ba80 RCX: 0000007d07005000 [1108355.352456] RDX: 0000007d07005000 RSI: 0000007d06ffd000 RDI: 0000000000000000 [1108355.359908] RBP: ffff8800c34db978 R08: 0000000000001000 R09: 0000000000000001 [1108355.367396] R10: ffffea0000f0bd40 R11: ffff8800ad6b5488 R12: 0000007d07014000 [1108355.374860] R13: ffff8800c34dbb58 R14: 0000000000000000 R15: 0000000000001000 [1108355.382324] FS: 0000000000000000(0000) GS:ffff88011fc00000(0000) knlGS:0000000000000000 [1108355.390783] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [1108355.396857] CR2: 00007f2ad2a30000 CR3: 00000000c72e9000 CR4: 00000000000006f0 [1108355.404345] Stack: [1108355.406591] ffff8800c6f31ab0 ffff8800c6f31900 000000fa00000000 0000000000001000 [1108355.414442] 0000007d06ffd000 0000000000000000 ffff8800c34dbaf8 ffff8800c6f31ab0 [1108355.422276] ffff8800c6f31900 0000000000000000 ffff8800c34db988 0000007d07014fff [1108355.430093] Call Trace: [1108355.432782] [<ffffffff882b7579>] __extent_writepage+0x176/0x1db [1108355.439119] [<ffffffff882b7885>] extent_write_cache_pages.isra.10.constprop.26+0x2a7/0x374 [1108355.447881] [<ffffffff8807d2ff>] ? ttwu_do_activate.constprop.23+0x57/0x5c [1108355.455197] [<ffffffff882b7d3f>] extent_writepages+0x47/0x58 [1108355.461274] [<ffffffff8829bfab>] ? uncompress_inline+0x148/0x148 [1108355.467720] [<ffffffff8829bc08>] btrfs_writepages+0x23/0x25 [1108355.473764] [<ffffffff880e68b3>] do_writepages+0x1e/0x28 [1108355.479483] [<ffffffff880de30e>] __filemap_fdatawrite_range+0xb2/0xca [1108355.486349] [<ffffffff880de3a3>] filemap_fdatawrite_range+0xe/0x10 [1108355.492969] [<ffffffff882ae6bb>] btrfs_fdatawrite_range+0x1b/0x41 [1108355.499477] [<ffffffff882ae71c>] start_ordered_ops+0x3b/0x5a [1108355.505561] [<ffffffff882ae794>] btrfs_sync_file+0x59/0x2da [1108355.511550] [<ffffffff883168b3>] ? security_file_open+0x79/0x80 [1108355.517877] [<ffffffff881416d2>] vfs_fsync_range+0x86/0x95 [1108355.523780] [<ffffffff881fb87e>] nfsd_vfs_write+0x219/0x265 [1108355.529758] [<ffffffff881fd531>] nfsd_write+0xa6/0xc6 [1108355.535202] [<ffffffff88202552>] nfsd3_proc_write+0x90/0xab [1108355.541175] [<ffffffff881f7ec3>] nfsd_dispatch+0xcd/0x189 [1108355.546991] [<ffffffff888c825b>] svc_process+0x582/0x6b6 [1108355.552711] [<ffffffff881f7924>] ? nfsd_destroy+0x57/0x57 [1108355.558516] [<ffffffff881f7a19>] nfsd+0xf5/0x147 [1108355.563527] [<ffffffff88078c4a>] kthread+0xdc/0xe4 [1108355.568693] [<ffffffff88078b6e>] ? kthread_worker_fn+0x129/0x129 [1108355.575098] [<ffffffff888e476f>] ret_from_fork+0x3f/0x80 [1108355.580807] [<ffffffff88078b6e>] ? kthread_worker_fn+0x129/0x129 [1108355.587229] Code: 48 3d 01 f0 ff ff 44 0f 43 d0 45 89 d6 e9 c3 01 00 00 48 8b 70 18 48 89 f1 48 03 48 20 48 89 75 80 48 89 ca 72 07 49 39 cc 72 06 <0f> 0b 48 83 ca ff 4c 29 e2 48 8b b5 78 ff ff ff 48 8b 78 70 4c [1108355.608271] RIP [<ffffffff882b5f5a>] __extent_writepage_io+0x1d3/0x398 [1108355.615266] RSP <ffff8800c34db8d8> [1108355.619704] ---[ end trace 6b6431a8ef19cb1c ]---
Solved! Go to Solution.
Accepted Solutions
- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Shrinking this one down ...
@jukkaforss wrote:
sdb part 1
root@readynas:~# smartctl -x /dev/sdb ID# ATTRIBUTE_NAME FLAGS VALUE WORST THRESH FAIL RAW_VALUE 1 Raw_Read_Error_Rate POSR-K 200 200 051 - 159 9 Power_On_Hours -O--CK 028 028 000 - 52621 Error 9 [8] occurred at disk power-on lifetime: 50321 hours (2096 days + 17 hours) Error: UNC at LBA = 0x12fdf7710 = 5098141456 Error 8 [7] occurred at disk power-on lifetime: 50321 hours (2096 days + 17 hours) Error: WP at LBA = 0x12fdf7710 = 5098141456 Error 7 [6] occurred at disk power-on lifetime: 41988 hours (1749 days + 12 hours) Error: UNC at LBA = 0xcf083840 = 3473422400 Error 6 [5] occurred at disk power-on lifetime: 41988 hours (1749 days + 12 hours) Error: WP at LBA = 0xcf083840 = 3473422400 Error 5 [4] occurred at disk power-on lifetime: 38618 hours (1609 days + 2 hours) Error: UNC at LBA = 0xe824a0c0 = 3894714560 Error 4 [3] occurred at disk power-on lifetime: 38618 hours (1609 days + 2 hours) Error: WP at LBA = 0xe824a0b8 = 3894714552 Error 3 [2] occurred at disk power-on lifetime: 38618 hours (1609 days + 2 hours) Error: UNC at LBA = 0xe824a0b8 = 3894714552 Error 2 [1] occurred at disk power-on lifetime: 38615 hours (1608 days + 23 hours) Error: WP at LBA = 0x11616d4b8 = 4665562296
I saw a similar pattern on one of my WD60EFRX drives a while ago, and when I tested it with Lifeguard it failed. Though a second disk with the same pattern passed Lifeguard. So I recommend testing this disk (and perhaps replace it even if it does pass).
The most recent logged error was about 2000 hours ago (~ 3 months), so that particular error didn't cause the most recent crash. But I'm thinking that this disk likely triggered it anyway.
FWIW, I haven't seen any explanation of how to decode the raw read error rate. But it is quite a bit higher on this drive than your other ones.
All Replies
- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Re: kernel bug ReadyNASOS 6.10.1 screen massage __extent_writepsge_io+1d3
I'd look in kernel.log and system.log for disk errors and btrfs errors.
- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Re: kernel bug ReadyNASOS 6.10.1 screen massage __extent_writepsge_io+1d3
Kernel log has same messages, but I couldn't found anything from disk_info and btrfs logs.
No ATA errors or any other problems with disks.
- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Re: kernel bug ReadyNASOS 6.10.1 screen massage __extent_writepsge_io+1d3
If ssh is enabled then smartctl -x might also give a clue.
- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Re: kernel bug ReadyNASOS 6.10.1 screen massage __extent_writepsge_io+1d3
sda
root@readynas:~# smartctl -x /dev/sda smartctl 6.6 2017-11-05 r4594 [x86_64-linux-4.4.178.x86_64.1] (local build) Copyright (C) 2002-17, Bruce Allen, Christian Franke, www.smartmontools.org === START OF INFORMATION SECTION === Model Family: Western Digital Red Device Model: WDC WD30EFRX-68AX9N0 Serial Number: WD-WMC1T3093795 LU WWN Device Id: 5 0014ee 6adf64469 Firmware Version: 80.00A80 User Capacity: 3,000,592,982,016 bytes [3.00 TB] Sector Sizes: 512 bytes logical, 4096 bytes physical Device is: In smartctl database [for details use: -P show] ATA Version is: ACS-2 (minor revision not indicated) SATA Version is: SATA 3.0, 6.0 Gb/s (current: 3.0 Gb/s) Local Time is: Fri Jun 28 11:01:44 2019 EEST SMART support is: Available - device has SMART capability. SMART support is: Enabled AAM feature is: Unavailable APM feature is: Unavailable Rd look-ahead is: Enabled Write cache is: Enabled DSN feature is: Unavailable ATA Security is: Disabled, frozen [SEC2] Wt Cache Reorder: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x00) Offline data collection activity was never started. Auto Offline Data Collection: Disabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (40020) seconds. Offline data collection capabilities: (0x7b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 401) minutes. Conveyance self-test routine recommended polling time: ( 5) minutes. SCT capabilities: (0x70bd) SCT Status supported. SCT Error Recovery Control supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAGS VALUE WORST THRESH FAIL RAW_VALUE 1 Raw_Read_Error_Rate POSR-K 200 200 051 - 19 3 Spin_Up_Time POS--K 210 179 021 - 4475 4 Start_Stop_Count -O--CK 100 100 000 - 135 5 Reallocated_Sector_Ct PO--CK 200 200 140 - 0 7 Seek_Error_Rate -OSR-K 200 200 000 - 0 9 Power_On_Hours -O--CK 028 028 000 - 52622 10 Spin_Retry_Count -O--CK 100 100 000 - 0 11 Calibration_Retry_Count -O--CK 100 100 000 - 0 12 Power_Cycle_Count -O--CK 100 100 000 - 135 192 Power-Off_Retract_Count -O--CK 200 200 000 - 100 193 Load_Cycle_Count -O--CK 200 200 000 - 34 194 Temperature_Celsius -O---K 105 103 000 - 45 196 Reallocated_Event_Count -O--CK 200 200 000 - 0 197 Current_Pending_Sector -O--CK 200 200 000 - 0 198 Offline_Uncorrectable ----CK 100 253 000 - 0 199 UDMA_CRC_Error_Count -O--CK 200 200 000 - 0 200 Multi_Zone_Error_Rate ---R-- 200 200 000 - 0 ||||||_ K auto-keep |||||__ C event count ||||___ R error rate |||____ S speed/performance ||_____ O updated online |______ P prefailure warning General Purpose Log Directory Version 1 SMART Log Directory Version 1 [multi-sector log support] Address Access R/W Size Description 0x00 GPL,SL R/O 1 Log Directory 0x01 SL R/O 1 Summary SMART error log 0x02 SL R/O 5 Comprehensive SMART error log 0x03 GPL R/O 6 Ext. Comprehensive SMART error log 0x06 SL R/O 1 SMART self-test log 0x07 GPL R/O 1 Extended self-test log 0x09 SL R/W 1 Selective self-test log 0x10 GPL R/O 1 NCQ Command Error log 0x11 GPL R/O 1 SATA Phy Event Counters log 0x21 GPL R/O 1 Write stream error log 0x22 GPL R/O 1 Read stream error log 0x80-0x9f GPL,SL R/W 16 Host vendor specific log 0xa0-0xa7 GPL,SL VS 16 Device vendor specific log 0xa8-0xb7 GPL,SL VS 1 Device vendor specific log 0xbd GPL,SL VS 1 Device vendor specific log 0xc0 GPL,SL VS 1 Device vendor specific log 0xc1 GPL VS 93 Device vendor specific log 0xe0 GPL,SL R/W 1 SCT Command/Status 0xe1 GPL,SL R/W 1 SCT Data Transfer SMART Extended Comprehensive Error Log Version: 1 (6 sectors) No Errors Logged SMART Extended Self-test Log Version: 1 (1 sectors) Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Extended offline Completed without error 00% 51571 - # 2 Extended offline Completed without error 00% 49451 - # 3 Extended offline Completed without error 00% 47246 - # 4 Extended offline Completed without error 00% 45038 - # 5 Extended offline Completed without error 00% 42831 - # 6 Extended offline Completed without error 00% 40690 - # 7 Extended offline Completed without error 00% 38490 - # 8 Extended offline Completed without error 00% 36297 - # 9 Extended offline Completed without error 00% 31941 - #10 Extended offline Completed without error 00% 29743 - #11 Extended offline Completed without error 00% 27530 - #12 Extended offline Completed without error 00% 25340 - #13 Extended offline Completed without error 00% 23167 - #14 Extended offline Completed without error 00% 20964 - #15 Extended offline Completed without error 00% 18761 - #16 Extended offline Completed without error 00% 16586 - #17 Extended offline Completed without error 00% 14453 - #18 Extended offline Completed without error 00% 13048 - SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. SCT Status Version: 3 SCT Version (vendor specific): 258 (0x0102) SCT Support Level: 1 Device State: Active (0) Current Temperature: 45 Celsius Power Cycle Min/Max Temperature: 43/46 Celsius Lifetime Min/Max Temperature: 2/47 Celsius Under/Over Temperature Limit Count: 0/0 Vendor specific: 01 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 SCT Temperature History Version: 2 Temperature Sampling Period: 1 minute Temperature Logging Interval: 1 minute Min/Max recommended Temperature: 0/60 Celsius Min/Max Temperature Limit: -41/85 Celsius Temperature History Size (Index): 478 (303) Index Estimated Time Temperature Celsius 304 2019-06-28 03:04 46 *************************** ... ..( 5 skipped). .. *************************** 310 2019-06-28 03:10 46 *************************** 311 2019-06-28 03:11 45 ************************** 312 2019-06-28 03:12 45 ************************** 313 2019-06-28 03:13 45 ************************** 314 2019-06-28 03:14 44 ************************* ... ..( 65 skipped). .. ************************* 380 2019-06-28 04:20 44 ************************* 381 2019-06-28 04:21 43 ************************ ... ..( 2 skipped). .. ************************ 384 2019-06-28 04:24 43 ************************ 385 2019-06-28 04:25 44 ************************* ... ..(236 skipped). .. ************************* 144 2019-06-28 08:22 44 ************************* 145 2019-06-28 08:23 45 ************************** ... ..( 23 skipped). .. ************************** 169 2019-06-28 08:47 45 ************************** 170 2019-06-28 08:48 46 *************************** ... ..( 7 skipped). .. *************************** 178 2019-06-28 08:56 46 *************************** 179 2019-06-28 08:57 45 ************************** ... ..( 60 skipped). .. ************************** 240 2019-06-28 09:58 45 ************************** 241 2019-06-28 09:59 44 ************************* ... ..( 15 skipped). .. ************************* 257 2019-06-28 10:15 44 ************************* 258 2019-06-28 10:16 45 ************************** ... ..( 18 skipped). .. ************************** 277 2019-06-28 10:35 45 ************************** 278 2019-06-28 10:36 46 *************************** ... ..( 24 skipped). .. *************************** 303 2019-06-28 11:01 46 *************************** SCT Error Recovery Control: Read: 70 (7.0 seconds) Write: 70 (7.0 seconds) Device Statistics (GP/SMART Log 0x04) not supported Pending Defects log (GP Log 0x0c) not supported SATA Phy Event Counters (GP Log 0x11) ID Size Value Description 0x0001 2 0 Command failed due to ICRC error 0x0002 2 0 R_ERR response for data FIS 0x0003 2 0 R_ERR response for device-to-host data FIS 0x0004 2 0 R_ERR response for host-to-device data FIS 0x0005 2 0 R_ERR response for non-data FIS 0x0006 2 0 R_ERR response for device-to-host non-data FIS 0x0007 2 0 R_ERR response for host-to-device non-data FIS 0x0008 2 0 Device-to-host non-data FIS retries 0x0009 2 4 Transition from drive PhyRdy to drive PhyNRdy 0x000a 2 4 Device-to-host register FISes sent due to a COMRESET 0x000b 2 0 CRC errors within host-to-device FIS 0x000f 2 0 R_ERR response for host-to-device data FIS, CRC 0x0012 2 0 R_ERR response for host-to-device non-data FIS, CRC 0x8000 4 82376 Vendor specific
- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Re: kernel bug ReadyNASOS 6.10.1 screen massage __extent_writepsge_io+1d3
sdc
root@readynas:~# smartctl -x /dev/sdc smartctl 6.6 2017-11-05 r4594 [x86_64-linux-4.4.178.x86_64.1] (local build) Copyright (C) 2002-17, Bruce Allen, Christian Franke, www.smartmontools.org === START OF INFORMATION SECTION === Model Family: Western Digital Red Device Model: WDC WD30EFRX-68AX9N0 Serial Number: WD-WMC1T3536142 LU WWN Device Id: 5 0014ee 603508514 Firmware Version: 80.00A80 User Capacity: 3,000,592,982,016 bytes [3.00 TB] Sector Sizes: 512 bytes logical, 4096 bytes physical Device is: In smartctl database [for details use: -P show] ATA Version is: ACS-2 (minor revision not indicated) SATA Version is: SATA 3.0, 6.0 Gb/s (current: 3.0 Gb/s) Local Time is: Fri Jun 28 11:02:55 2019 EEST SMART support is: Available - device has SMART capability. SMART support is: Enabled AAM feature is: Unavailable APM feature is: Unavailable Rd look-ahead is: Enabled Write cache is: Enabled DSN feature is: Unavailable ATA Security is: Disabled, frozen [SEC2] Wt Cache Reorder: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x00) Offline data collection activity was never started. Auto Offline Data Collection: Disabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (42420) seconds. Offline data collection capabilities: (0x7b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 425) minutes. Conveyance self-test routine recommended polling time: ( 5) minutes. SCT capabilities: (0x70bd) SCT Status supported. SCT Error Recovery Control supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAGS VALUE WORST THRESH FAIL RAW_VALUE 1 Raw_Read_Error_Rate POSR-K 200 200 051 - 5 3 Spin_Up_Time POS--K 211 183 021 - 4450 4 Start_Stop_Count -O--CK 100 100 000 - 133 5 Reallocated_Sector_Ct PO--CK 200 200 140 - 0 7 Seek_Error_Rate -OSR-K 200 200 000 - 0 9 Power_On_Hours -O--CK 029 029 000 - 52517 10 Spin_Retry_Count -O--CK 100 100 000 - 0 11 Calibration_Retry_Count -O--CK 100 100 000 - 0 12 Power_Cycle_Count -O--CK 100 100 000 - 133 192 Power-Off_Retract_Count -O--CK 200 200 000 - 98 193 Load_Cycle_Count -O--CK 200 200 000 - 34 194 Temperature_Celsius -O---K 101 099 000 - 49 196 Reallocated_Event_Count -O--CK 200 200 000 - 0 197 Current_Pending_Sector -O--CK 200 200 000 - 0 198 Offline_Uncorrectable ----CK 100 253 000 - 0 199 UDMA_CRC_Error_Count -O--CK 200 200 000 - 0 200 Multi_Zone_Error_Rate ---R-- 200 200 000 - 0 ||||||_ K auto-keep |||||__ C event count ||||___ R error rate |||____ S speed/performance ||_____ O updated online |______ P prefailure warning General Purpose Log Directory Version 1 SMART Log Directory Version 1 [multi-sector log support] Address Access R/W Size Description 0x00 GPL,SL R/O 1 Log Directory 0x01 SL R/O 1 Summary SMART error log 0x02 SL R/O 5 Comprehensive SMART error log 0x03 GPL R/O 6 Ext. Comprehensive SMART error log 0x06 SL R/O 1 SMART self-test log 0x07 GPL R/O 1 Extended self-test log 0x09 SL R/W 1 Selective self-test log 0x10 GPL R/O 1 NCQ Command Error log 0x11 GPL R/O 1 SATA Phy Event Counters log 0x21 GPL R/O 1 Write stream error log 0x22 GPL R/O 1 Read stream error log 0x80-0x9f GPL,SL R/W 16 Host vendor specific log 0xa0-0xa7 GPL,SL VS 16 Device vendor specific log 0xa8-0xb7 GPL,SL VS 1 Device vendor specific log 0xbd GPL,SL VS 1 Device vendor specific log 0xc0 GPL,SL VS 1 Device vendor specific log 0xc1 GPL VS 93 Device vendor specific log 0xe0 GPL,SL R/W 1 SCT Command/Status 0xe1 GPL,SL R/W 1 SCT Data Transfer SMART Extended Comprehensive Error Log Version: 1 (6 sectors) No Errors Logged SMART Extended Self-test Log Version: 1 (1 sectors) Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Extended offline Completed without error 00% 51468 - # 2 Extended offline Completed without error 00% 49348 - # 3 Extended offline Completed without error 00% 47142 - # 4 Extended offline Completed without error 00% 44933 - # 5 Extended offline Completed without error 00% 42735 - # 6 Extended offline Completed without error 00% 40586 - # 7 Extended offline Completed without error 00% 38385 - # 8 Extended offline Completed without error 00% 36197 - # 9 Extended offline Completed without error 00% 31836 - #10 Extended offline Completed without error 00% 29639 - #11 Extended offline Completed without error 00% 27427 - #12 Extended offline Completed without error 00% 25240 - #13 Extended offline Completed without error 00% 23064 - #14 Extended offline Completed without error 00% 20861 - #15 Extended offline Completed without error 00% 18661 - #16 Extended offline Completed without error 00% 14348 - #17 Extended offline Completed without error 00% 12944 - #18 Extended offline Completed without error 00% 12775 - SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. SCT Status Version: 3 SCT Version (vendor specific): 258 (0x0102) SCT Support Level: 1 Device State: Active (0) Current Temperature: 49 Celsius Power Cycle Min/Max Temperature: 47/50 Celsius Lifetime Min/Max Temperature: 2/51 Celsius Under/Over Temperature Limit Count: 0/0 Vendor specific: 01 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 SCT Temperature History Version: 2 Temperature Sampling Period: 1 minute Temperature Logging Interval: 1 minute Min/Max recommended Temperature: 0/60 Celsius Min/Max Temperature Limit: -41/85 Celsius Temperature History Size (Index): 478 (189) Index Estimated Time Temperature Celsius 190 2019-06-28 03:05 49 ****************************** ... ..( 8 skipped). .. ****************************** 199 2019-06-28 03:14 49 ****************************** 200 2019-06-28 03:15 48 ***************************** ... ..( 62 skipped). .. ***************************** 263 2019-06-28 04:18 48 ***************************** 264 2019-06-28 04:19 47 **************************** ... ..( 8 skipped). .. **************************** 273 2019-06-28 04:28 47 **************************** 274 2019-06-28 04:29 48 ***************************** ... ..( 24 skipped). .. ***************************** 299 2019-06-28 04:54 48 ***************************** 300 2019-06-28 04:55 47 **************************** ... ..( 9 skipped). .. **************************** 310 2019-06-28 05:05 47 **************************** 311 2019-06-28 05:06 48 ***************************** ... ..( 79 skipped). .. ***************************** 391 2019-06-28 06:26 48 ***************************** 392 2019-06-28 06:27 47 **************************** ... ..( 15 skipped). .. **************************** 408 2019-06-28 06:43 47 **************************** 409 2019-06-28 06:44 48 ***************************** ... ..( 74 skipped). .. ***************************** 6 2019-06-28 07:59 48 ***************************** 7 2019-06-28 08:00 47 **************************** ... ..( 5 skipped). .. **************************** 13 2019-06-28 08:06 47 **************************** 14 2019-06-28 08:07 48 ***************************** ... ..( 17 skipped). .. ***************************** 32 2019-06-28 08:25 48 ***************************** 33 2019-06-28 08:26 49 ****************************** ... ..( 22 skipped). .. ****************************** 56 2019-06-28 08:49 49 ****************************** 57 2019-06-28 08:50 50 ******************************* ... ..( 3 skipped). .. ******************************* 61 2019-06-28 08:54 50 ******************************* 62 2019-06-28 08:55 49 ****************************** ... ..( 63 skipped). .. ****************************** 126 2019-06-28 09:59 49 ****************************** 127 2019-06-28 10:00 48 ***************************** ... ..( 23 skipped). .. ***************************** 151 2019-06-28 10:24 48 ***************************** 152 2019-06-28 10:25 49 ****************************** ... ..( 16 skipped). .. ****************************** 169 2019-06-28 10:42 49 ****************************** 170 2019-06-28 10:43 50 ******************************* ... ..( 18 skipped). .. ******************************* 189 2019-06-28 11:02 50 ******************************* SCT Error Recovery Control: Read: 70 (7.0 seconds) Write: 70 (7.0 seconds) Device Statistics (GP/SMART Log 0x04) not supported Pending Defects log (GP Log 0x0c) not supported SATA Phy Event Counters (GP Log 0x11) ID Size Value Description 0x0001 2 0 Command failed due to ICRC error 0x0002 2 0 R_ERR response for data FIS 0x0003 2 0 R_ERR response for device-to-host data FIS 0x0004 2 0 R_ERR response for host-to-device data FIS 0x0005 2 0 R_ERR response for non-data FIS 0x0006 2 0 R_ERR response for device-to-host non-data FIS 0x0007 2 0 R_ERR response for host-to-device non-data FIS 0x0008 2 0 Device-to-host non-data FIS retries 0x0009 2 4 Transition from drive PhyRdy to drive PhyNRdy 0x000a 2 5 Device-to-host register FISes sent due to a COMRESET 0x000b 2 0 CRC errors within host-to-device FIS 0x000f 2 0 R_ERR response for host-to-device data FIS, CRC 0x0012 2 0 R_ERR response for host-to-device non-data FIS, CRC 0x8000 4 82446 Vendor specific
- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Re: kernel bug ReadyNASOS 6.10.1 screen massage __extent_writepsge_io+1d3
sdd
root@readynas:~# smartctl -x /dev/sdd smartctl 6.6 2017-11-05 r4594 [x86_64-linux-4.4.178.x86_64.1] (local build) Copyright (C) 2002-17, Bruce Allen, Christian Franke, www.smartmontools.org === START OF INFORMATION SECTION === Model Family: Western Digital Red Device Model: WDC WD30EFRX-68EUZN0 Serial Number: WD-WCC4N6CF8LYK LU WWN Device Id: 5 0014ee 262e2cb16 Firmware Version: 82.00A82 User Capacity: 3,000,592,982,016 bytes [3.00 TB] Sector Sizes: 512 bytes logical, 4096 bytes physical Rotation Rate: 5400 rpm Device is: In smartctl database [for details use: -P show] ATA Version is: ACS-2 (minor revision not indicated) SATA Version is: SATA 3.0, 6.0 Gb/s (current: 3.0 Gb/s) Local Time is: Fri Jun 28 11:02:57 2019 EEST SMART support is: Available - device has SMART capability. SMART support is: Enabled AAM feature is: Unavailable APM feature is: Unavailable Rd look-ahead is: Enabled Write cache is: Enabled DSN feature is: Unavailable ATA Security is: Disabled, frozen [SEC2] Wt Cache Reorder: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x00) Offline data collection activity was never started. Auto Offline Data Collection: Disabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (39060) seconds. Offline data collection capabilities: (0x7b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 392) minutes. Conveyance self-test routine recommended polling time: ( 5) minutes. SCT capabilities: (0x703d) SCT Status supported. SCT Error Recovery Control supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAGS VALUE WORST THRESH FAIL RAW_VALUE 1 Raw_Read_Error_Rate POSR-K 200 200 051 - 0 3 Spin_Up_Time POS--K 213 194 021 - 4341 4 Start_Stop_Count -O--CK 100 100 000 - 33 5 Reallocated_Sector_Ct PO--CK 200 200 140 - 0 7 Seek_Error_Rate -OSR-K 200 200 000 - 0 9 Power_On_Hours -O--CK 066 066 000 - 25031 10 Spin_Retry_Count -O--CK 100 253 000 - 0 11 Calibration_Retry_Count -O--CK 100 253 000 - 0 12 Power_Cycle_Count -O--CK 100 100 000 - 33 192 Power-Off_Retract_Count -O--CK 200 200 000 - 23 193 Load_Cycle_Count -O--CK 200 200 000 - 2645 194 Temperature_Celsius -O---K 102 100 000 - 48 196 Reallocated_Event_Count -O--CK 200 200 000 - 0 197 Current_Pending_Sector -O--CK 200 200 000 - 0 198 Offline_Uncorrectable ----CK 100 253 000 - 0 199 UDMA_CRC_Error_Count -O--CK 200 200 000 - 0 200 Multi_Zone_Error_Rate ---R-- 200 200 000 - 0 ||||||_ K auto-keep |||||__ C event count ||||___ R error rate |||____ S speed/performance ||_____ O updated online |______ P prefailure warning General Purpose Log Directory Version 1 SMART Log Directory Version 1 [multi-sector log support] Address Access R/W Size Description 0x00 GPL,SL R/O 1 Log Directory 0x01 SL R/O 1 Summary SMART error log 0x02 SL R/O 5 Comprehensive SMART error log 0x03 GPL R/O 6 Ext. Comprehensive SMART error log 0x06 SL R/O 1 SMART self-test log 0x07 GPL R/O 1 Extended self-test log 0x09 SL R/W 1 Selective self-test log 0x10 GPL R/O 1 NCQ Command Error log 0x11 GPL R/O 1 SATA Phy Event Counters log 0x21 GPL R/O 1 Write stream error log 0x22 GPL R/O 1 Read stream error log 0x80-0x9f GPL,SL R/W 16 Host vendor specific log 0xa0-0xa7 GPL,SL VS 16 Device vendor specific log 0xa8-0xb7 GPL,SL VS 1 Device vendor specific log 0xbd GPL,SL VS 1 Device vendor specific log 0xc0 GPL,SL VS 1 Device vendor specific log 0xc1 GPL VS 93 Device vendor specific log 0xe0 GPL,SL R/W 1 SCT Command/Status 0xe1 GPL,SL R/W 1 SCT Data Transfer SMART Extended Comprehensive Error Log Version: 1 (6 sectors) No Errors Logged SMART Extended Self-test Log Version: 1 (1 sectors) Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Extended offline Completed without error 00% 23976 - # 2 Extended offline Completed without error 00% 21858 - # 3 Extended offline Completed without error 00% 19655 - # 4 Extended offline Completed without error 00% 17448 - # 5 Extended offline Completed without error 00% 15244 - # 6 Extended offline Completed without error 00% 13103 - # 7 Extended offline Completed without error 00% 10902 - # 8 Extended offline Completed without error 00% 8704 - # 9 Extended offline Completed without error 00% 4357 - #10 Extended offline Completed without error 00% 2159 - SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. SCT Status Version: 3 SCT Version (vendor specific): 258 (0x0102) SCT Support Level: 1 Device State: Active (0) Current Temperature: 48 Celsius Power Cycle Min/Max Temperature: 46/49 Celsius Lifetime Min/Max Temperature: 2/50 Celsius Under/Over Temperature Limit Count: 0/0 Vendor specific: 01 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 SCT Temperature History Version: 2 Temperature Sampling Period: 1 minute Temperature Logging Interval: 1 minute Min/Max recommended Temperature: 0/60 Celsius Min/Max Temperature Limit: -41/85 Celsius Temperature History Size (Index): 478 (50) Index Estimated Time Temperature Celsius 51 2019-06-28 03:05 48 ***************************** ... ..( 43 skipped). .. ***************************** 95 2019-06-28 03:49 48 ***************************** 96 2019-06-28 03:50 47 **************************** ... ..( 51 skipped). .. **************************** 148 2019-06-28 04:42 47 **************************** 149 2019-06-28 04:43 48 ***************************** ... ..( 44 skipped). .. ***************************** 194 2019-06-28 05:28 48 ***************************** 195 2019-06-28 05:29 46 *************************** 196 2019-06-28 05:30 46 *************************** 197 2019-06-28 05:31 47 **************************** ... ..( 33 skipped). .. **************************** 231 2019-06-28 06:05 47 **************************** 232 2019-06-28 06:06 46 *************************** ... ..(261 skipped). .. *************************** 16 2019-06-28 10:28 46 *************************** 17 2019-06-28 10:29 47 **************************** ... ..( 11 skipped). .. **************************** 29 2019-06-28 10:41 47 **************************** 30 2019-06-28 10:42 48 ***************************** ... ..( 19 skipped). .. ***************************** 50 2019-06-28 11:02 48 ***************************** SCT Error Recovery Control: Read: 70 (7.0 seconds) Write: 70 (7.0 seconds) Device Statistics (GP/SMART Log 0x04) not supported Pending Defects log (GP Log 0x0c) not supported SATA Phy Event Counters (GP Log 0x11) ID Size Value Description 0x0001 2 0 Command failed due to ICRC error 0x0002 2 0 R_ERR response for data FIS 0x0003 2 0 R_ERR response for device-to-host data FIS 0x0004 2 0 R_ERR response for host-to-device data FIS 0x0005 2 0 R_ERR response for non-data FIS 0x0006 2 0 R_ERR response for device-to-host non-data FIS 0x0007 2 0 R_ERR response for host-to-device non-data FIS 0x0008 2 0 Device-to-host non-data FIS retries 0x0009 2 3 Transition from drive PhyRdy to drive PhyNRdy 0x000a 2 4 Device-to-host register FISes sent due to a COMRESET 0x000b 2 0 CRC errors within host-to-device FIS 0x000f 2 0 R_ERR response for host-to-device data FIS, CRC 0x0012 2 0 R_ERR response for host-to-device non-data FIS, CRC 0x8000 4 82448 Vendor specific
- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Re: kernel bug ReadyNASOS 6.10.1 screen massage __extent_writepsge_io+1d3
sdb part 1
root@readynas:~# smartctl -x /dev/sdb smartctl 6.6 2017-11-05 r4594 [x86_64-linux-4.4.178.x86_64.1] (local build) Copyright (C) 2002-17, Bruce Allen, Christian Franke, www.smartmontools.org === START OF INFORMATION SECTION === Model Family: Western Digital Red Device Model: WDC WD30EFRX-68AX9N0 Serial Number: WD-WMC1T3056108 LU WWN Device Id: 5 0014ee 658a0a0ff Firmware Version: 80.00A80 User Capacity: 3,000,592,982,016 bytes [3.00 TB] Sector Sizes: 512 bytes logical, 4096 bytes physical Device is: In smartctl database [for details use: -P show] ATA Version is: ACS-2 (minor revision not indicated) SATA Version is: SATA 3.0, 6.0 Gb/s (current: 3.0 Gb/s) Local Time is: Fri Jun 28 11:11:34 2019 EEST SMART support is: Available - device has SMART capability. SMART support is: Enabled AAM feature is: Unavailable APM feature is: Unavailable Rd look-ahead is: Enabled Write cache is: Enabled DSN feature is: Unavailable ATA Security is: Disabled, frozen [SEC2] Wt Cache Reorder: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x00) Offline data collection activity was never started. Auto Offline Data Collection: Disabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (39120) seconds. Offline data collection capabilities: (0x7b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 393) minutes. Conveyance self-test routine recommended polling time: ( 5) minutes. SCT capabilities: (0x70bd) SCT Status supported. SCT Error Recovery Control supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAGS VALUE WORST THRESH FAIL RAW_VALUE 1 Raw_Read_Error_Rate POSR-K 200 200 051 - 159 3 Spin_Up_Time POS--K 210 178 021 - 4475 4 Start_Stop_Count -O--CK 100 100 000 - 134 5 Reallocated_Sector_Ct PO--CK 200 200 140 - 0 7 Seek_Error_Rate -OSR-K 200 200 000 - 0 9 Power_On_Hours -O--CK 028 028 000 - 52621 10 Spin_Retry_Count -O--CK 100 100 000 - 0 11 Calibration_Retry_Count -O--CK 100 100 000 - 0 12 Power_Cycle_Count -O--CK 100 100 000 - 134 192 Power-Off_Retract_Count -O--CK 200 200 000 - 98 193 Load_Cycle_Count -O--CK 200 200 000 - 35 194 Temperature_Celsius -O---K 101 099 000 - 49 196 Reallocated_Event_Count -O--CK 200 200 000 - 0 197 Current_Pending_Sector -O--CK 200 200 000 - 0 198 Offline_Uncorrectable ----CK 100 253 000 - 0 199 UDMA_CRC_Error_Count -O--CK 200 200 000 - 0 200 Multi_Zone_Error_Rate ---R-- 200 200 000 - 0 ||||||_ K auto-keep |||||__ C event count ||||___ R error rate |||____ S speed/performance ||_____ O updated online |______ P prefailure warning General Purpose Log Directory Version 1 SMART Log Directory Version 1 [multi-sector log support] Address Access R/W Size Description 0x00 GPL,SL R/O 1 Log Directory 0x01 SL R/O 1 Summary SMART error log 0x02 SL R/O 5 Comprehensive SMART error log 0x03 GPL R/O 6 Ext. Comprehensive SMART error log 0x06 SL R/O 1 SMART self-test log 0x07 GPL R/O 1 Extended self-test log 0x09 SL R/W 1 Selective self-test log 0x10 GPL R/O 1 NCQ Command Error log 0x11 GPL R/O 1 SATA Phy Event Counters log 0x21 GPL R/O 1 Write stream error log 0x22 GPL R/O 1 Read stream error log 0x80-0x9f GPL,SL R/W 16 Host vendor specific log 0xa0-0xa7 GPL,SL VS 16 Device vendor specific log 0xa8-0xb7 GPL,SL VS 1 Device vendor specific log 0xbd GPL,SL VS 1 Device vendor specific log 0xc0 GPL,SL VS 1 Device vendor specific log 0xc1 GPL VS 93 Device vendor specific log 0xe0 GPL,SL R/W 1 SCT Command/Status 0xe1 GPL,SL R/W 1 SCT Data Transfer SMART Extended Comprehensive Error Log Version: 1 (6 sectors) Device Error Count: 9 CR = Command Register FEATR = Features Register COUNT = Count (was: Sector Count) Register LBA_48 = Upper bytes of LBA High/Mid/Low Registers ] ATA-8 LH = LBA High (was: Cylinder High) Register ] LBA LM = LBA Mid (was: Cylinder Low) Register ] Register LL = LBA Low (was: Sector Number) Register ] DV = Device (was: Device/Head) Register DC = Device Control Register ER = Error register ST = Status register Powered_Up_Time is measured from power on, and printed as DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes, SS=sec, and sss=millisec. It "wraps" after 49.710 days. Error 9 [8] occurred at disk power-on lifetime: 50321 hours (2096 days + 17 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER -- ST COUNT LBA_48 LH LM LL DV DC -- -- -- == -- == == == -- -- -- -- -- 40 -- 51 00 00 00 01 2f df 77 10 40 00 Error: UNC at LBA = 0x12fdf7710 = 5098141456 Commands leading to the command that caused the error were: CR FEATR COUNT LBA_48 LH LM LL DV DC Powered_Up_Time Command/Feature_Name -- == -- == -- == == == -- -- -- -- -- --------------- -------------------- 60 00 10 00 60 00 00 67 4d 52 b0 40 08 01:04:39.235 READ FPDMA QUEUED 61 00 40 00 58 00 01 24 f2 4a 00 40 08 01:04:39.235 WRITE FPDMA QUEUED 61 00 40 00 50 00 01 24 f2 49 80 40 08 01:04:39.235 WRITE FPDMA QUEUED 61 00 10 00 48 00 00 01 73 7a 18 40 08 01:04:39.235 WRITE FPDMA QUEUED 60 00 80 00 40 00 01 2f df 76 c0 40 08 01:04:39.235 READ FPDMA QUEUED Error 8 [7] occurred at disk power-on lifetime: 50321 hours (2096 days + 17 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER -- ST COUNT LBA_48 LH LM LL DV DC -- -- -- == -- == == == -- -- -- -- -- 40 -- 51 00 00 00 01 2f df 77 10 40 00 Error: WP at LBA = 0x12fdf7710 = 5098141456 Commands leading to the command that caused the error were: CR FEATR COUNT LBA_48 LH LM LL DV DC Powered_Up_Time Command/Feature_Name -- == -- == -- == == == -- -- -- -- -- --------------- -------------------- 61 00 40 00 50 00 01 24 f2 80 00 40 08 01:04:36.141 WRITE FPDMA QUEUED 61 00 40 00 48 00 01 24 f2 7f c0 40 08 01:04:36.140 WRITE FPDMA QUEUED 61 00 40 00 40 00 01 24 f2 59 c0 40 08 01:04:36.140 WRITE FPDMA QUEUED 61 00 40 00 38 00 01 24 f2 56 00 40 08 01:04:36.140 WRITE FPDMA QUEUED 61 00 40 00 30 00 01 24 f7 ad c0 40 08 01:04:36.140 WRITE FPDMA QUEUED Error 7 [6] occurred at disk power-on lifetime: 41988 hours (1749 days + 12 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER -- ST COUNT LBA_48 LH LM LL DV DC -- -- -- == -- == == == -- -- -- -- -- 40 -- 51 00 00 00 00 cf 08 38 40 40 00 Error: UNC at LBA = 0xcf083840 = 3473422400 Commands leading to the command that caused the error were: CR FEATR COUNT LBA_48 LH LM LL DV DC Powered_Up_Time Command/Feature_Name -- == -- == -- == == == -- -- -- -- -- --------------- -------------------- 60 00 d0 00 00 00 00 04 bd 4a 40 40 08 02:27:16.202 READ FPDMA QUEUED 60 01 80 00 f0 00 00 04 bd 48 40 40 08 02:27:16.202 READ FPDMA QUEUED 60 01 80 00 e8 00 00 04 bd 46 40 40 08 02:27:16.202 READ FPDMA QUEUED 60 01 80 00 e0 00 00 04 bd 44 40 40 08 02:27:16.202 READ FPDMA QUEUED 60 01 80 00 d0 00 00 04 bd 40 40 40 08 02:27:16.202 READ FPDMA QUEUED Error 6 [5] occurred at disk power-on lifetime: 41988 hours (1749 days + 12 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER -- ST COUNT LBA_48 LH LM LL DV DC -- -- -- == -- == == == -- -- -- -- -- 40 -- 51 00 00 00 00 cf 08 38 40 40 00 Error: WP at LBA = 0xcf083840 = 3473422400 Commands leading to the command that caused the error were: CR FEATR COUNT LBA_48 LH LM LL DV DC Powered_Up_Time Command/Feature_Name -- == -- == -- == == == -- -- -- -- -- --------------- -------------------- 61 00 40 00 a8 00 00 cf 02 e2 c0 40 08 02:27:13.298 WRITE FPDMA QUEUED 60 00 d0 00 a0 00 00 04 bd 3a 40 40 08 02:27:13.289 READ FPDMA QUEUED 60 01 30 00 98 00 00 04 bd 38 90 40 08 02:27:13.289 READ FPDMA QUEUED 60 00 50 00 90 00 00 04 bd 38 40 40 08 02:27:13.280 READ FPDMA QUEUED 60 00 b0 00 88 00 00 04 bd 37 10 40 08 02:27:13.280 READ FPDMA QUEUED Error 5 [4] occurred at disk power-on lifetime: 38618 hours (1609 days + 2 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER -- ST COUNT LBA_48 LH LM LL DV DC -- -- -- == -- == == == -- -- -- -- -- 40 -- 51 00 00 00 00 e8 24 a0 c0 40 00 Error: UNC at LBA = 0xe824a0c0 = 3894714560 Commands leading to the command that caused the error were: CR FEATR COUNT LBA_48 LH LM LL DV DC Powered_Up_Time Command/Feature_Name -- == -- == -- == == == -- -- -- -- -- --------------- -------------------- 60 01 80 00 88 00 00 5a 64 0e 40 40 08 06:24:21.982 READ FPDMA QUEUED 60 00 50 00 80 00 00 5a 64 10 40 40 08 06:24:21.982 READ FPDMA QUEUED 60 00 08 00 78 00 00 e8 24 a0 c0 40 08 06:24:21.982 READ FPDMA QUEUED 60 00 78 00 70 00 00 e8 24 a0 c8 40 08 06:24:21.982 READ FPDMA QUEUED 60 00 30 00 68 00 00 5a 64 05 90 40 08 06:24:21.982 READ FPDMA QUEUED Error 4 [3] occurred at disk power-on lifetime: 38618 hours (1609 days + 2 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER -- ST COUNT LBA_48 LH LM LL DV DC -- -- -- == -- == == == -- -- -- -- -- 40 -- 51 00 00 00 00 e8 24 a0 b8 40 00 Error: WP at LBA = 0xe824a0b8 = 3894714552 Commands leading to the command that caused the error were: CR FEATR COUNT LBA_48 LH LM LL DV DC Powered_Up_Time Command/Feature_Name -- == -- == -- == == == -- -- -- -- -- --------------- -------------------- 61 00 40 00 28 00 00 88 14 21 c0 40 08 06:24:19.084 WRITE FPDMA QUEUED 60 00 80 00 20 00 00 e8 24 a0 40 40 08 06:24:19.084 READ FPDMA QUEUED 60 00 78 00 18 00 00 e8 24 a0 c8 40 08 06:24:19.083 READ FPDMA QUEUED 60 00 08 00 10 00 00 e8 24 a0 c0 40 08 06:24:19.083 READ FPDMA QUEUED 60 00 50 00 08 00 00 5a 64 10 40 40 08 06:24:19.083 READ FPDMA QUEUED Error 3 [2] occurred at disk power-on lifetime: 38618 hours (1609 days + 2 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER -- ST COUNT LBA_48 LH LM LL DV DC -- -- -- == -- == == == -- -- -- -- -- 40 -- 51 00 00 00 00 e8 24 a0 b8 40 00 Error: UNC at LBA = 0xe824a0b8 = 3894714552 Commands leading to the command that caused the error were: CR FEATR COUNT LBA_48 LH LM LL DV DC Powered_Up_Time Command/Feature_Name -- == -- == -- == == == -- -- -- -- -- --------------- -------------------- 60 01 80 00 50 00 00 e8 24 a0 40 40 08 06:24:16.156 READ FPDMA QUEUED 60 00 80 00 48 00 00 e8 24 9f 40 40 08 06:24:16.156 READ FPDMA QUEUED ea 00 00 00 00 00 00 00 00 00 00 e0 08 06:24:16.131 FLUSH CACHE EXT 61 00 01 00 38 00 00 00 90 00 48 40 08 06:24:16.131 WRITE FPDMA QUEUED ea 00 00 00 00 00 00 00 00 00 00 e0 08 06:24:16.131 FLUSH CACHE EXT Error 2 [1] occurred at disk power-on lifetime: 38615 hours (1608 days + 23 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER -- ST COUNT LBA_48 LH LM LL DV DC -- -- -- == -- == == == -- -- -- -- -- 40 -- 51 00 00 00 01 16 16 d4 b8 40 00 Error: WP at LBA = 0x11616d4b8 = 4665562296 Commands leading to the command that caused the error were: CR FEATR COUNT LBA_48 LH LM LL DV DC Powered_Up_Time Command/Feature_Name -- == -- == -- == == == -- -- -- -- -- --------------- -------------------- 61 00 08 00 90 00 00 00 93 fe c0 40 08 02:53:56.605 WRITE FPDMA QUEUED 60 00 80 00 88 00 01 16 16 d4 40 40 08 02:53:56.605 READ FPDMA QUEUED 60 00 80 00 80 00 01 16 16 d4 c0 40 08 02:53:56.605 READ FPDMA QUEUED 60 00 80 00 78 00 01 16 16 d5 40 40 08 02:53:56.605 READ FPDMA QUEUED 60 00 08 00 70 00 00 00 71 98 98 40 08 02:53:56.605 READ FPDMA QUEUED
- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Re: kernel bug ReadyNASOS 6.10.1 screen massage __extent_writepsge_io+1d3
sdb part 2
SMART Extended Self-test Log Version: 1 (1 sectors) Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Extended offline Completed without error 00% 51573 - # 2 Extended offline Completed without error 00% 49455 - # 3 Extended offline Completed without error 00% 47246 - # 4 Extended offline Completed without error 00% 45039 - # 5 Extended offline Completed without error 00% 42872 - # 6 Extended offline Completed without error 00% 40691 - # 7 Extended offline Completed without error 00% 38490 - # 8 Extended offline Completed without error 00% 36311 - # 9 Extended offline Completed without error 00% 31941 - #10 Extended offline Completed without error 00% 29750 - #11 Extended offline Completed without error 00% 27530 - #12 Extended offline Completed without error 00% 25341 - #13 Extended offline Completed without error 00% 23167 - #14 Extended offline Completed without error 00% 20964 - #15 Extended offline Completed without error 00% 18761 - #16 Extended offline Completed without error 00% 16586 - #17 Extended offline Completed without error 00% 14452 - #18 Extended offline Completed without error 00% 13048 - SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. SCT Status Version: 3 SCT Version (vendor specific): 258 (0x0102) SCT Support Level: 1 Device State: Active (0) Current Temperature: 49 Celsius Power Cycle Min/Max Temperature: 47/50 Celsius Lifetime Min/Max Temperature: 2/51 Celsius Under/Over Temperature Limit Count: 0/0 Vendor specific: 01 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 SCT Temperature History Version: 2 Temperature Sampling Period: 1 minute Temperature Logging Interval: 1 minute Min/Max recommended Temperature: 0/60 Celsius Min/Max Temperature Limit: -41/85 Celsius Temperature History Size (Index): 478 (203) Index Estimated Time Temperature Celsius 204 2019-06-28 03:14 49 ****************************** ... ..( 8 skipped). .. ****************************** 213 2019-06-28 03:23 49 ****************************** 214 2019-06-28 03:24 48 ***************************** ... ..( 47 skipped). .. ***************************** 262 2019-06-28 04:12 48 ***************************** 263 2019-06-28 04:13 47 **************************** ... ..( 14 skipped). .. **************************** 278 2019-06-28 04:28 47 **************************** 279 2019-06-28 04:29 48 ***************************** ... ..( 26 skipped). .. ***************************** 306 2019-06-28 04:56 48 ***************************** 307 2019-06-28 04:57 47 **************************** ... ..( 8 skipped). .. **************************** 316 2019-06-28 05:06 47 **************************** 317 2019-06-28 05:07 48 ***************************** ... ..( 5 skipped). .. ***************************** 323 2019-06-28 05:13 48 ***************************** 324 2019-06-28 05:14 47 **************************** ... ..( 17 skipped). .. **************************** 342 2019-06-28 05:32 47 **************************** 343 2019-06-28 05:33 48 ***************************** ... ..(141 skipped). .. ***************************** 7 2019-06-28 07:55 48 ***************************** 8 2019-06-28 07:56 47 **************************** ... ..( 10 skipped). .. **************************** 19 2019-06-28 08:07 47 **************************** 20 2019-06-28 08:08 48 ***************************** ... ..( 16 skipped). .. ***************************** 37 2019-06-28 08:25 48 ***************************** 38 2019-06-28 08:26 49 ****************************** ... ..( 97 skipped). .. ****************************** 136 2019-06-28 10:04 49 ****************************** 137 2019-06-28 10:05 48 ***************************** ... ..( 17 skipped). .. ***************************** 155 2019-06-28 10:23 48 ***************************** 156 2019-06-28 10:24 49 ****************************** ... ..( 16 skipped). .. ****************************** 173 2019-06-28 10:41 49 ****************************** 174 2019-06-28 10:42 50 ******************************* ... ..( 23 skipped). .. ******************************* 198 2019-06-28 11:06 50 ******************************* 199 2019-06-28 11:07 49 ****************************** ... ..( 3 skipped). .. ****************************** 203 2019-06-28 11:11 49 ****************************** SCT Error Recovery Control: Read: 70 (7.0 seconds) Write: 70 (7.0 seconds) Device Statistics (GP/SMART Log 0x04) not supported Pending Defects log (GP Log 0x0c) not supported SATA Phy Event Counters (GP Log 0x11) ID Size Value Description 0x0001 2 0 Command failed due to ICRC error 0x0002 2 0 R_ERR response for data FIS 0x0003 2 0 R_ERR response for device-to-host data FIS 0x0004 2 0 R_ERR response for host-to-device data FIS 0x0005 2 0 R_ERR response for non-data FIS 0x0006 2 0 R_ERR response for device-to-host non-data FIS 0x0007 2 0 R_ERR response for host-to-device non-data FIS 0x0008 2 0 Device-to-host non-data FIS retries 0x0009 2 4 Transition from drive PhyRdy to drive PhyNRdy 0x000a 2 5 Device-to-host register FISes sent due to a COMRESET 0x000b 2 0 CRC errors within host-to-device FIS 0x000f 2 0 R_ERR response for host-to-device data FIS, CRC 0x0012 2 0 R_ERR response for host-to-device non-data FIS, CRC 0x8000 4 82963 Vendor specific
- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Shrinking this one down ...
@jukkaforss wrote:
sdb part 1
root@readynas:~# smartctl -x /dev/sdb ID# ATTRIBUTE_NAME FLAGS VALUE WORST THRESH FAIL RAW_VALUE 1 Raw_Read_Error_Rate POSR-K 200 200 051 - 159 9 Power_On_Hours -O--CK 028 028 000 - 52621 Error 9 [8] occurred at disk power-on lifetime: 50321 hours (2096 days + 17 hours) Error: UNC at LBA = 0x12fdf7710 = 5098141456 Error 8 [7] occurred at disk power-on lifetime: 50321 hours (2096 days + 17 hours) Error: WP at LBA = 0x12fdf7710 = 5098141456 Error 7 [6] occurred at disk power-on lifetime: 41988 hours (1749 days + 12 hours) Error: UNC at LBA = 0xcf083840 = 3473422400 Error 6 [5] occurred at disk power-on lifetime: 41988 hours (1749 days + 12 hours) Error: WP at LBA = 0xcf083840 = 3473422400 Error 5 [4] occurred at disk power-on lifetime: 38618 hours (1609 days + 2 hours) Error: UNC at LBA = 0xe824a0c0 = 3894714560 Error 4 [3] occurred at disk power-on lifetime: 38618 hours (1609 days + 2 hours) Error: WP at LBA = 0xe824a0b8 = 3894714552 Error 3 [2] occurred at disk power-on lifetime: 38618 hours (1609 days + 2 hours) Error: UNC at LBA = 0xe824a0b8 = 3894714552 Error 2 [1] occurred at disk power-on lifetime: 38615 hours (1608 days + 23 hours) Error: WP at LBA = 0x11616d4b8 = 4665562296
I saw a similar pattern on one of my WD60EFRX drives a while ago, and when I tested it with Lifeguard it failed. Though a second disk with the same pattern passed Lifeguard. So I recommend testing this disk (and perhaps replace it even if it does pass).
The most recent logged error was about 2000 hours ago (~ 3 months), so that particular error didn't cause the most recent crash. But I'm thinking that this disk likely triggered it anyway.
FWIW, I haven't seen any explanation of how to decode the raw read error rate. But it is quite a bit higher on this drive than your other ones.