NETGEAR is aware of a growing number of phone and online scams. To learn how to stay safe click here.
Forum Discussion
InterClaw
Jan 09, 2013Aspirant
CIFS freezes, SMART errors the culprit?
I was gonna post about having freezes when copying files around using CIFS, but then I checked the SMART data of the drives and I think there's been some changes since last time I did that. I don't know how concerned I should be, but could potentially one or two of my drives be acting up and that's what's causing the freezes?
Just fyi also, the behavior I'm experiencing is that I'm trying to copy a folder of archives from one share to another and it manages to write one file/subfolder and then freezes the process. Eventually the file copy dialog times out (try again/skip/cancel style). My thought was that maybe it's having problems actually writing even that first file to disk due to problems with the disk(s) or something...
I should also mention that all disks are flagged green and that the NAS is over 90% full. I've disabled Transmission. I've also recently had a problem with a complete freeze of the system and forced reboot with restriping - which has also hanged like 3 times. (I have a new Pro 6 that I'm setting up, but I'm not done with it yet. I have a few things I want to try out on it without any data before I migrate over to it.)
Anyway, here's the SMART data. Any help analyzing the health of these drives would be greatly appreciated. I've installed these drives as 3, then 1, then 2. That explains the model numbers etc. There also used to be a power saving feature for these drives that stopped them, which explains those numbers for the first three drives. The numbers I'm concerned about are marked below:
SMART Information for Disk 1
Model: WDC WD2002FYPS-01U1B0
Serial: WD-WCAVYxxxxxxx
Firmware: 04.05G04
SMART Attribute
Raw Read Error Rate 13
Spin Up Time 9083
Start Stop Count 703
Reallocated Sector Count 0
Seek Error Rate 0
Power On Hours 27689
Spin Retry Count 0
Calibration Retry Count 0
Power Cycle Count 91
Power-Off Retract Count 55
Load Cycle Count 1301
Temperature Celsius 40
Reallocated Event Count 0
Current Pending Sector 0
Offline Uncorrectable 0
UDMA CRC Error Count 0
Multi Zone Error Rate 0
ATA Error Count 0
SMART Information for Disk 2
Model: WDC WD2002FYPS-01U1B0
Serial: WD-WCAVYxxxxxxx
Firmware: 04.05G04
SMART Attribute
Raw Read Error Rate 0
Spin Up Time 9341
Start Stop Count 614
Reallocated Sector Count 0
Seek Error Rate 0
Power On Hours 27696
Spin Retry Count 0
Calibration Retry Count 0
Power Cycle Count 87
Power-Off Retract Count 50
Load Cycle Count 1180
Temperature Celsius 43
Reallocated Event Count 0
Current Pending Sector 0
Offline Uncorrectable 0
UDMA CRC Error Count 0
Multi Zone Error Rate 0
ATA Error Count 0
SMART Information for Disk 3
Model: WDC WD2002FYPS-01U1B0
Serial: WD-WCAVYxxxxxxx
Firmware: 04.05G04
SMART Attribute
Raw Read Error Rate 89994
Spin Up Time 9200
Start Stop Count 613
Reallocated Sector Count 0
Seek Error Rate 0
Power On Hours 27673
Spin Retry Count 0
Calibration Retry Count 0
Power Cycle Count 88
Power-Off Retract Count 52
Load Cycle Count 2826
Temperature Celsius 42
Reallocated Event Count 0
Current Pending Sector 0
Offline Uncorrectable 0
UDMA CRC Error Count 0
Multi Zone Error Rate 0
ATA Error Count 0
SMART Information for Disk 4
Model: WDC WD2002FYPS-01U1B1
Serial: WD-WCAVYxxxxxxx
Firmware: 04.05G05
SMART Attribute
Raw Read Error Rate 0
Spin Up Time 9300
Start Stop Count 81
Reallocated Sector Count 0
Seek Error Rate 0
Power On Hours 23957
Spin Retry Count 0
Calibration Retry Count 0
Power Cycle Count 80
Power-Off Retract Count 51
Load Cycle Count 63
Temperature Celsius 40
Reallocated Event Count 0
Current Pending Sector 27
Offline Uncorrectable 13
UDMA CRC Error Count 0
Multi Zone Error Rate 748
ATA Error Count 0
SMART Information for Disk 5
Model: WDC WD2002FYPS-02W3B0
Serial: WD-WCAVYxxxxxxx
Firmware: 04.01G01
SMART Attribute
Raw Read Error Rate 1
Spin Up Time 9641
Start Stop Count 53
Reallocated Sector Count 0
Seek Error Rate 0
Power On Hours 15082
Spin Retry Count 0
Calibration Retry Count 0
Power Cycle Count 52
Power-Off Retract Count 39
Load Cycle Count 37
Temperature Celsius 37
Reallocated Event Count 0
Current Pending Sector 0
Offline Uncorrectable 0
UDMA CRC Error Count 0
Multi Zone Error Rate 0
ATA Error Count 0
SMART Information for Disk 6
Model: WDC WD2002FYPS-02W3B0
Serial: WD-WCAVYxxxxxxx
Firmware: 04.01G01
SMART Attribute
Raw Read Error Rate 0
Spin Up Time 9791
Start Stop Count 44
Reallocated Sector Count 0
Seek Error Rate 0
Power On Hours 14850
Spin Retry Count 0
Calibration Retry Count 0
Power Cycle Count 43
Power-Off Retract Count 31
Load Cycle Count 26
Temperature Celsius 36
Reallocated Event Count 0
Current Pending Sector 0
Offline Uncorrectable 0
UDMA CRC Error Count 0
Multi Zone Error Rate 0
ATA Error Count 0
Just fyi also, the behavior I'm experiencing is that I'm trying to copy a folder of archives from one share to another and it manages to write one file/subfolder and then freezes the process. Eventually the file copy dialog times out (try again/skip/cancel style). My thought was that maybe it's having problems actually writing even that first file to disk due to problems with the disk(s) or something...
I should also mention that all disks are flagged green and that the NAS is over 90% full. I've disabled Transmission. I've also recently had a problem with a complete freeze of the system and forced reboot with restriping - which has also hanged like 3 times. (I have a new Pro 6 that I'm setting up, but I'm not done with it yet. I have a few things I want to try out on it without any data before I migrate over to it.)
Anyway, here's the SMART data. Any help analyzing the health of these drives would be greatly appreciated. I've installed these drives as 3, then 1, then 2. That explains the model numbers etc. There also used to be a power saving feature for these drives that stopped them, which explains those numbers for the first three drives. The numbers I'm concerned about are marked below:
SMART Information for Disk 1
Model: WDC WD2002FYPS-01U1B0
Serial: WD-WCAVYxxxxxxx
Firmware: 04.05G04
SMART Attribute
Raw Read Error Rate 13
Spin Up Time 9083
Start Stop Count 703
Reallocated Sector Count 0
Seek Error Rate 0
Power On Hours 27689
Spin Retry Count 0
Calibration Retry Count 0
Power Cycle Count 91
Power-Off Retract Count 55
Load Cycle Count 1301
Temperature Celsius 40
Reallocated Event Count 0
Current Pending Sector 0
Offline Uncorrectable 0
UDMA CRC Error Count 0
Multi Zone Error Rate 0
ATA Error Count 0
SMART Information for Disk 2
Model: WDC WD2002FYPS-01U1B0
Serial: WD-WCAVYxxxxxxx
Firmware: 04.05G04
SMART Attribute
Raw Read Error Rate 0
Spin Up Time 9341
Start Stop Count 614
Reallocated Sector Count 0
Seek Error Rate 0
Power On Hours 27696
Spin Retry Count 0
Calibration Retry Count 0
Power Cycle Count 87
Power-Off Retract Count 50
Load Cycle Count 1180
Temperature Celsius 43
Reallocated Event Count 0
Current Pending Sector 0
Offline Uncorrectable 0
UDMA CRC Error Count 0
Multi Zone Error Rate 0
ATA Error Count 0
SMART Information for Disk 3
Model: WDC WD2002FYPS-01U1B0
Serial: WD-WCAVYxxxxxxx
Firmware: 04.05G04
SMART Attribute
Raw Read Error Rate 89994
Spin Up Time 9200
Start Stop Count 613
Reallocated Sector Count 0
Seek Error Rate 0
Power On Hours 27673
Spin Retry Count 0
Calibration Retry Count 0
Power Cycle Count 88
Power-Off Retract Count 52
Load Cycle Count 2826
Temperature Celsius 42
Reallocated Event Count 0
Current Pending Sector 0
Offline Uncorrectable 0
UDMA CRC Error Count 0
Multi Zone Error Rate 0
ATA Error Count 0
SMART Information for Disk 4
Model: WDC WD2002FYPS-01U1B1
Serial: WD-WCAVYxxxxxxx
Firmware: 04.05G05
SMART Attribute
Raw Read Error Rate 0
Spin Up Time 9300
Start Stop Count 81
Reallocated Sector Count 0
Seek Error Rate 0
Power On Hours 23957
Spin Retry Count 0
Calibration Retry Count 0
Power Cycle Count 80
Power-Off Retract Count 51
Load Cycle Count 63
Temperature Celsius 40
Reallocated Event Count 0
Current Pending Sector 27
Offline Uncorrectable 13
UDMA CRC Error Count 0
Multi Zone Error Rate 748
ATA Error Count 0
SMART Information for Disk 5
Model: WDC WD2002FYPS-02W3B0
Serial: WD-WCAVYxxxxxxx
Firmware: 04.01G01
SMART Attribute
Raw Read Error Rate 1
Spin Up Time 9641
Start Stop Count 53
Reallocated Sector Count 0
Seek Error Rate 0
Power On Hours 15082
Spin Retry Count 0
Calibration Retry Count 0
Power Cycle Count 52
Power-Off Retract Count 39
Load Cycle Count 37
Temperature Celsius 37
Reallocated Event Count 0
Current Pending Sector 0
Offline Uncorrectable 0
UDMA CRC Error Count 0
Multi Zone Error Rate 0
ATA Error Count 0
SMART Information for Disk 6
Model: WDC WD2002FYPS-02W3B0
Serial: WD-WCAVYxxxxxxx
Firmware: 04.01G01
SMART Attribute
Raw Read Error Rate 0
Spin Up Time 9791
Start Stop Count 44
Reallocated Sector Count 0
Seek Error Rate 0
Power On Hours 14850
Spin Retry Count 0
Calibration Retry Count 0
Power Cycle Count 43
Power-Off Retract Count 31
Load Cycle Count 26
Temperature Celsius 36
Reallocated Event Count 0
Current Pending Sector 0
Offline Uncorrectable 0
UDMA CRC Error Count 0
Multi Zone Error Rate 0
ATA Error Count 0
4 Replies
Replies have been turned off for this discussion
- StephenBGuru - Experienced UserI'd update my backups and shift files to the new Pro before I replaced any disks on this one. With two suspect disks you'd be putting the volume at risk when you replace/resync.
Also, 90% full could explain a lot of performance problems. Hopefully your new Pro will have bigger drives.
Once you've taken care of the backup, I'd replace disk 4, but just watch disk 3 for now. The format for read error rate is vendor specific, so it is hard to understand the significance of the raw number. The load cycle count is unexplained, but still it is well below the drive specs. But current pending sector count is the "read" equivalent of a reallocated sector - the read attempt failed. If a write on the same sector had failed it would have been reallocated. (on a read, you don't know what the data is, so there is no point in reallocating). Offline uncorrectable counts are just as bad. - agreed with stephen, it looks like potentially 2 separate issues;
-90% free space is getting tight, the way linux allocates extents causes performance degradation as free space gets lower
- disk 4 is starting to fail, and when a sector read/write fails essentially the disk io is frozen for however many milliseconds while the drive tries to recover/remap the sector
note the free space freeze tends to occur at the start of the file copy, when the space is being allocated and is most notable when copying large multigig zip/iso/mkv type files, while sector problems can occur any time during a file copy. - InterClawAspirantThanks for your tips!
- InterClawAspirantI've moved my data over and the new NAS (running 3TB WD Reds, WD30EFRX) seems to be doing very well. :) So I though I'd initiate the first RMA on the 4th drive.
The problem is the etailer where I bought it does not carry the RE4-GP WD2002FYPS anymore. I'm guessing my replacement drive(s) will be the RE4 WD2003FYYS instead. These have the standard 7200 rpm spindle speed though, whereas the GP drives spin at like 5900 rpm.
Do you think I'd have problems with a combination of WD2002FYPS and WD2003FYYS?
Btw, both types are on the Pro Pioneer HCL though (if you look past the note about firmware on the FYPS... but that's another story...).
Related Content
- Jan 17, 2024Retired_Member
NETGEAR Academy
Boost your skills with the Netgear Academy - Get trained, certified and stay ahead with the latest Netgear technology!
Join Us!