SAS Hard Disk 15K Rpm On CentOS 5.8

Home » CentOS » SAS Hard Disk 15K Rpm On CentOS 5.8
CentOS 1 Comment

Hi,

I get these below information. Please help me understand about “SMART Health Status: FAILURE PREDICTION THRESHOLD EXCEEDED: ascq=0x5 [asc], ascq=5]” and what does that error mean. Does it mean that the SAS disk 0 is failing and has serious issues and needs to be replaced and also do i need to run health status test with different options/flags or switches.

/usr/sbin/smartctl -d megaraid,0 -a /dev/sda smartctl 5.42 2011-10-20 r3458 [x86_64-linux-2.6.18-308.1.1.el5] (local build)
Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net

Vendor: SEAGATE
Product: ST3300657SS
Revision: ES65
User Capacity: 300,000,000,000 bytes [300 GB]
Logical block size: 512 bytes Logical Unit id: 0x5000c500477d63bb Serial number: 6SJ46RZB
Device type: disk Transport protocol: SAS
Local Time is: Fri Jul 27 06:50:49 2012 IST
Device supports SMART and is Enabled Temperature Warning Disabled or Not Supported SMART Health Status: FAILURE PREDICTION THRESHOLD EXCEEDED: ascq=0x5
[asc], ascq=5]

Current Drive Temperature: 33 C
Drive Trip Temperature: 68 C
Elements in grown defect list: 2048
Vendor (Seagate) cache information
Blocks sent to initiator = 2518358542
Blocks received from initiator = 3507177513
Blocks read from cache and sent to initiator = 19620730
Number of read and write commands whose size Number of read and write commands whose size > segment size = 103
Vendor (Seagate/Hitachi) factory information
number of hours powered up = 3903.10
number of minutes until next internal SMART test = 32

Error counter log:
Errors Corrected by Total Correction Gigabytes Total
ECC rereads/ errors algorithm processed uncorrected
fast | delayed rewrites corrected invocations [10^9
bytes] errors read: 7599922 9416 0 7609338 7609338
360.930 0
write: 0 0 0 0 0
1851.172 0
verify: 96309427 1108629 0 97418056 97419201
62032.969 3455

Non-medium error count: 3

SMART Self-test log Num Test Status segment LifeTime LBA_first_err [SK ASC ASQ]
Description number (hours)
# 1 Background long Completed 32 1
– [- – -]
# 2 Background short Completed 32 1
– [- – -]

Long (extended) Self Test duration: 3200 seconds [53.3 minutes]
[root@~]# /usr/sbin/smartctl -d megaraid,1 -a /dev/sda smartctl 5.42 2011-10-20 r3458 [x86_64-linux-2.6.18-308.1.1.el5] (local build)
Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net

Vendor: SEAGATE
Product: ST3300657SS
Revision: ES65
User Capacity: 300,000,000,000 bytes [300 GB]
Logical block size: 512 bytes Logical Unit id: 0x5000c500477d49fb Serial number: 6SJ44JEH
Device type: disk Transport protocol: SAS
Local Time is: Fri Jul 27 06:50:55 2012 IST
Device supports SMART and is Enabled Temperature Warning Disabled or Not Supported SMART Health Status: OK

Current Drive Temperature: 32 C
Drive Trip Temperature: 68 C
Elements in grown defect list: 0
Vendor (Seagate) cache information
Blocks sent to initiator = 1288378070
Blocks received from initiator = 3519302272
Blocks read from cache and sent to initiator = 19977846
Number of read and write commands whose size Number of read and write commands whose size > segment size = 102
Vendor (Seagate/Hitachi) factory information
number of hours powered up = 3903.00
number of minutes until next internal SMART test = 32

Error counter log:
Errors Corrected by Total Correction Gigabytes Total
ECC rereads/ errors algorithm processed uncorrected
fast | delayed rewrites corrected invocations [10^9
bytes] errors read: 16718097 0 0 16718097 16718097
352.119 0
write: 0 0 0 0 0
1810.643 0
verify: 197332427 0 0 197332427 197332427
5156.549 0

Non-medium error count: 0

SMART Self-test log Num Test Status segment LifeTime LBA_first_err [SK ASC ASQ]
Description number (hours)
# 1 Background long Completed 32 1
– [- – -]
# 2 Background short Completed 32 1
– [- – -]

Long (extended) Self Test duration: 3200 seconds [53.3 minutes]

Regards

Kaushal

One thought on - SAS Hard Disk 15K Rpm On CentOS 5.8

  • From: Kaushal Shriyan

    The way I understand it:
    – Disks have errors.
    – After a defined number of errors, the disk is declared as having a “predictive failure”. Not sure if it is only the spare space getting almost fully used, or if it is more statistical…

    JD

LEAVE A COMMENT