Results 1 to 10 of 10
  1. #1
    Join Date
    Mar 2009
    Posts
    2,323

    home kernel: [1920401.698026] ata1: lost interrupt (Status 0x50)

    Hi,

    i use cloudlinux to do mdadm software raid 1 with two hd,

    yesterday,i find when i run some operation,

    the server will be slow,and i go to check /var/log/messages,

    it shows:


    Apr 24 18:10:22 home kernel: [1920401.698026] ata1: lost interrupt (Status 0x50)
    Apr 24 18:10:22 home kernel: [1920401.698046] ata1.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen
    Apr 24 18:10:22 home kernel: [1920401.699093] ata1.00: failed command: WRITE DMA EXT
    Apr 24 18:10:22 home kernel: [1920401.700126] ata1.00: cmd 35/00:00:00:8c:09/00:04:0c:00:00/e0 tag 0 dma 524288 out
    Apr 24 18:10:22 home kernel: [1920401.700127] res 40/00:ff:fd:04:b3/40:01:10:00:00/e0 Emask 0x4 (timeout)
    Apr 24 18:10:22 home kernel: [1920401.702144] ata1.00: status: { DRDY }
    Apr 24 18:10:22 home kernel: [1920401.703183] ata1: soft resetting link
    Apr 24 18:10:22 home kernel: [1920401.878301] ata1.00: configured for UDMA/33
    Apr 24 18:10:22 home kernel: [1920401.881354] ata1.01: configured for UDMA/133
    Apr 24 18:10:22 home kernel: [1920401.881369] ata1: EH complete



    is it because my hd issue ? or board ?


    thanx

  2. #2
    Join Date
    Nov 2004
    Location
    Switzerland
    Posts
    810
    Hi,

    We need to know everything

    - Which OS are you using? What is the kernel level?
    (uname -a would do)

    - We need details about theses drives (speed, brand...)? (you can do hdparm -I /dev/sda and same for /dev/sdb)

    May be just a software bug but it is safer to make some backups even if you have a RAID.
    .:. Enterprise SAN Consultant .:.

  3. #3
    Join Date
    Dec 2005
    Location
    Berkshire, UK
    Posts
    3,070
    Its a timeout during a write operation so usually a bad thing..

    Check the health of your drives first of all

    If you have two drives, sda and sdb run:

    smartctl --all /dev/sda
    smartctl --all /dev/sdb

    If you are not sure what all the values mean, paste here and we can help
    SmartServerMangement (SSM)
    Specialists in SolusVM, VPS & cPanel Server Management

  4. #4
    Join Date
    Mar 2009
    Posts
    2,323
    Hi,

    - Which OS are you using? What is the kernel level?
    cloudlinux,
    2.6.32-231.21.1.lve0.9.18.1.x86_64 #1 SMP Thu Jan 5 06:59:41 EST 2012 x86_64 x86_64 x86_64 GNU/Linux



    - We need details about theses drives (speed, brand...)? (you can do hdparm -I /dev/sda and same for /dev/sdb)


    root@home01 [~]# hdparm -I /dev/sda

    /dev/sda:

    ATA device, with non-removable media
    Model Number: WDC WD5002AALX-00J37A0
    Serial Number: WD-WCAYUS622787
    Firmware Revision: 15.01H15
    Transport: Serial, SATA 1.0a, SATA II Extensions, SATA Rev 2.5, SATA Rev 2.6, SATA Rev 3.0
    Standards:
    Supported: 8 7 6 5
    Likely used: 8
    Configuration:
    Logical max current
    cylinders 16383 16383
    heads 16 16
    sectors/track 63 63
    --
    CHS current addressable sectors: 16514064
    LBA user addressable sectors: 268435455
    LBA48 user addressable sectors: 976771055
    Logical/Physical Sector size: 512 bytes
    device size with M = 1024*1024: 476938 MBytes
    device size with M = 1000*1000: 500106 MBytes (500 GB)
    cache/buffer size = unknown
    Capabilities:
    LBA, IORDY(can be disabled)
    Queue depth: 32
    Standby timer values: spec'd by Standard, with device specific minimum
    R/W multiple sector transfer: Max = 16 Current = 0
    DMA: mdma0 mdma1 mdma2 udma0 udma1 *udma2 udma3 udma4 udma5 udma6
    Cycle time: min=120ns recommended=120ns
    PIO: pio0 pio1 pio2 pio3 pio4
    Cycle time: no flow control=120ns IORDY flow control=120ns
    Commands/features:
    Enabled Supported:
    * SMART feature set
    Security Mode feature set
    * Power Management feature set
    * Write cache
    * Look-ahead
    * Host Protected Area feature set
    * WRITE_BUFFER command
    * READ_BUFFER command
    * NOP cmd
    * DOWNLOAD_MICROCODE
    Power-Up In Standby feature set
    * SET_FEATURES required to spinup after power up
    SET_MAX security extension
    * 48-bit Address feature set
    * Device Configuration Overlay feature set
    * Mandatory FLUSH_CACHE
    * FLUSH_CACHE_EXT
    * SMART error logging
    * SMART self-test
    * General Purpose Logging feature set
    * 64-bit World wide name
    * {READ,WRITE}_DMA_EXT_GPL commands
    * Segmented DOWNLOAD_MICROCODE
    * Gen1 signaling speed (1.5Gb/s)
    * Gen2 signaling speed (3.0Gb/s)
    * unknown 76[3]
    * Native Command Queueing (NCQ)
    * Host-initiated interface power management
    * Phy event counters
    * NCQ priority information
    DMA Setup Auto-Activate optimization
    * Software settings preservation
    * SMART Command Transport (SCT) feature set
    * SCT Long Sector Access (AC1)
    * SCT LBA Segment Access (AC2)
    * SCT Features Control (AC4)
    * SCT Data Tables (AC5)
    unknown 206[12] (vendor specific)
    unknown 206[13] (vendor specific)
    Security:
    Master password revision code = 65534
    supported
    not enabled
    not locked
    not frozen
    not expired: security count
    supported: enhanced erase
    92min for SECURITY ERASE UNIT. 92min for ENHANCED SECURITY ERASE UNIT.
    Logical Unit WWN Device Identifier: 50014ee104057220
    NAA : 5
    IEEE OUI : 0014ee
    Unique ID : 104057220
    Checksum: correct





    root@home01 [~]# hdparm -I /dev/sdb

    /dev/sdb:

    ATA device, with non-removable media
    Model Number: WDC WD5002AALX-00J37A0
    Serial Number: WD-WCAYUS557292
    Firmware Revision: 15.01H15
    Transport: Serial, SATA 1.0a, SATA II Extensions, SATA Rev 2.5, SATA Rev 2.6, SATA Rev 3.0
    Standards:
    Supported: 8 7 6 5
    Likely used: 8
    Configuration:
    Logical max current
    cylinders 16383 16383
    heads 16 16
    sectors/track 63 63
    --
    CHS current addressable sectors: 16514064
    LBA user addressable sectors: 268435455
    LBA48 user addressable sectors: 976771055
    Logical/Physical Sector size: 512 bytes
    device size with M = 1024*1024: 476938 MBytes
    device size with M = 1000*1000: 500106 MBytes (500 GB)
    cache/buffer size = unknown
    Capabilities:
    LBA, IORDY(can be disabled)
    Queue depth: 32
    Standby timer values: spec'd by Standard, with device specific minimum
    R/W multiple sector transfer: Max = 16 Current = 16
    DMA: mdma0 mdma1 mdma2 udma0 udma1 udma2 udma3 udma4 udma5 *udma6
    Cycle time: min=120ns recommended=120ns
    PIO: pio0 pio1 pio2 pio3 pio4
    Cycle time: no flow control=120ns IORDY flow control=120ns
    Commands/features:
    Enabled Supported:
    * SMART feature set
    Security Mode feature set
    * Power Management feature set
    * Write cache
    * Look-ahead
    * Host Protected Area feature set
    * WRITE_BUFFER command
    * READ_BUFFER command
    * NOP cmd
    * DOWNLOAD_MICROCODE
    Power-Up In Standby feature set
    * SET_FEATURES required to spinup after power up
    SET_MAX security extension
    * 48-bit Address feature set
    * Device Configuration Overlay feature set
    * Mandatory FLUSH_CACHE
    * FLUSH_CACHE_EXT
    * SMART error logging
    * SMART self-test
    * General Purpose Logging feature set
    * 64-bit World wide name
    * {READ,WRITE}_DMA_EXT_GPL commands
    * Segmented DOWNLOAD_MICROCODE
    * Gen1 signaling speed (1.5Gb/s)
    * Gen2 signaling speed (3.0Gb/s)
    * unknown 76[3]
    * Native Command Queueing (NCQ)
    * Host-initiated interface power management
    * Phy event counters
    * NCQ priority information
    DMA Setup Auto-Activate optimization
    * Software settings preservation
    * SMART Command Transport (SCT) feature set
    * SCT Long Sector Access (AC1)
    * SCT LBA Segment Access (AC2)
    * SCT Features Control (AC4)
    * SCT Data Tables (AC5)
    unknown 206[12] (vendor specific)
    unknown 206[13] (vendor specific)
    Security:
    Master password revision code = 65534
    supported
    not enabled
    not locked
    not frozen
    not expired: security count
    supported: enhanced erase
    92min for SECURITY ERASE UNIT. 92min for ENHANCED SECURITY ERASE UNIT.
    Logical Unit WWN Device Identifier: 50014ee10407bd53
    NAA : 5
    IEEE OUI : 0014ee
    Unique ID : 10407bd53
    Checksum: correct




    thanx

  5. #5
    Join Date
    Mar 2009
    Posts
    2,323
    Hi,


    root@home01 [~]# smartctl --all /dev/sda
    -bash: smartctl: command not found


    root@home01 [~]# yum install smartctl
    Loaded plugins: fastestmirror, rhnplugin
    Loading mirror speeds from cached hostfile
    * cloudlinux-x86_64-server-6: xmlrpc.cln.cloudlinux.com
    cloudlinux-x86_64-server-6 | 1.0 kB 00:00
    Setting up Install Process
    No package smartctl available.
    Error: Nothing to do
    root@home01 [~]# smartctl --all /dev/sda
    -bash: smartctl: command not found



    thanx

  6. #6
    Join Date
    Mar 2009
    Posts
    2,323
    Hi,


    root@home01 [~]# smartctl --all /dev/sda
    smartctl 5.39.1 2010-01-28 r3054 [x86_64-unknown-linux-gnu] (local build)
    Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net

    === START OF INFORMATION SECTION ===
    Device Model: WDC WD5002AALX-00J37A0
    Serial Number: WD-WCAYUS622787
    Firmware Version: 15.01H15
    User Capacity: 500,106,780,160 bytes
    Device is: Not in smartctl database [for details use: -P showall]
    ATA Version is: 8
    ATA Standard is: Exact ATA specification draft version not indicated
    Local Time is: Sat Apr 28 09:22:40 2012 CST
    SMART support is: Available - device has SMART capability.
    SMART support is: Enabled

    === START OF READ SMART DATA SECTION ===
    SMART overall-health self-assessment test result: PASSED

    General SMART Values:
    Offline data collection status: (0x80) Offline data collection activity
    was never started.
    Auto Offline Data Collection: Enabled.
    Self-test execution status: ( 0) The previous self-test routine completed
    without error or no self-test has ever
    been run.
    Total time to complete Offline
    data collection: (9180) seconds.
    Offline data collection
    capabilities: (0x7b) SMART execute Offline immediate.
    Auto Offline data collection on/off support.
    Suspend Offline collection upon new
    command.
    Offline surface scan supported.
    Self-test supported.
    Conveyance Self-test supported.
    Selective Self-test supported.
    SMART capabilities: (0x0003) Saves SMART data before entering
    power-saving mode.
    Supports SMART auto save timer.
    Error logging capability: (0x01) Error logging supported.
    General Purpose Logging supported.
    Short self-test routine
    recommended polling time: ( 2) minutes.
    Extended self-test routine
    recommended polling time: ( 93) minutes.
    Conveyance self-test routine
    recommended polling time: ( 5) minutes.
    SCT capabilities: (0x3037) SCT Status supported.
    SCT Feature Control supported.
    SCT Data Table supported.

    SMART Attributes Data Structure revision number: 16
    Vendor Specific SMART Attributes with Thresholds:
    ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
    1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0
    3 Spin_Up_Time 0x0027 192 138 021 Pre-fail Always - 1358
    4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 340
    5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
    7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0
    9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 667
    10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0
    11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 0
    12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 338
    192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 237
    193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 102
    194 Temperature_Celsius 0x0022 111 101 000 Old_age Always - 32
    196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
    197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0
    198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0
    199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0
    200 Multi_Zone_Error_Rate 0x0008 100 253 000 Old_age Offline - 0

    SMART Error Log Version: 1
    ATA Error Count: 12 (device log contains only the most recent five errors)
    CR = Command Register [HEX]
    FR = Features Register [HEX]
    SC = Sector Count Register [HEX]
    SN = Sector Number Register [HEX]
    CL = Cylinder Low Register [HEX]
    CH = Cylinder High Register [HEX]
    DH = Device/Head Register [HEX]
    DC = Device Command Register [HEX]
    ER = Error register [HEX]
    ST = Status register [HEX]
    Powered_Up_Time is measured from power on, and printed as
    DDd+hh:mmS.sss where DD=days, hh=hours, mm=minutes,
    SS=sec, and sss=millisec. It "wraps" after 49.710 days.

    Error 12 occurred at disk power-on lifetime: 64 hours (2 days + 16 hours)
    When the command that caused the error occurred, the device was active or idle.

    After command completion occurred, registers were:
    ER ST SC SN CL CH DH
    -- -- -- -- -- -- --
    40 51 08 77 d8 f6 e8 Error: UNC 8 sectors at LBA = 0x08f6d877 = 150394999

    Commands leading to the command that caused the error were:
    CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
    -- -- -- -- -- -- -- -- ---------------- --------------------
    c8 00 08 70 d8 f6 e8 08 23:39:19.319 READ DMA
    ec 00 00 00 00 00 a0 08 23:39:19.311 IDENTIFY DEVICE
    ef 03 46 00 00 00 a0 08 23:39:19.309 SET FEATURES [Set transfer mode]

    Error 11 occurred at disk power-on lifetime: 64 hours (2 days + 16 hours)
    When the command that caused the error occurred, the device was active or idle.

    After command completion occurred, registers were:
    ER ST SC SN CL CH DH
    -- -- -- -- -- -- --
    40 51 08 77 d8 f6 e8 Error: UNC 8 sectors at LBA = 0x08f6d877 = 150394999

    Commands leading to the command that caused the error were:
    CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
    -- -- -- -- -- -- -- -- ---------------- --------------------
    c8 00 08 70 d8 f6 e8 08 23:39:17.667 READ DMA
    ec 00 00 00 00 00 a0 08 23:39:17.662 IDENTIFY DEVICE
    ef 03 46 00 00 00 a0 08 23:39:17.661 SET FEATURES [Set transfer mode]

    Error 10 occurred at disk power-on lifetime: 64 hours (2 days + 16 hours)
    When the command that caused the error occurred, the device was active or idle.

    After command completion occurred, registers were:
    ER ST SC SN CL CH DH
    -- -- -- -- -- -- --
    40 51 08 77 d8 f6 e8 Error: UNC 8 sectors at LBA = 0x08f6d877 = 150394999

    Commands leading to the command that caused the error were:
    CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
    -- -- -- -- -- -- -- -- ---------------- --------------------
    c8 00 08 70 d8 f6 e8 08 23:39:16.021 READ DMA
    ec 00 00 00 00 00 a0 08 23:39:16.013 IDENTIFY DEVICE
    ef 03 46 00 00 00 a0 08 23:39:16.013 SET FEATURES [Set transfer mode]

    Error 9 occurred at disk power-on lifetime: 64 hours (2 days + 16 hours)
    When the command that caused the error occurred, the device was active or idle.

    After command completion occurred, registers were:
    ER ST SC SN CL CH DH
    -- -- -- -- -- -- --
    40 51 08 77 d8 f6 e8 Error: UNC 8 sectors at LBA = 0x08f6d877 = 150394999

    Commands leading to the command that caused the error were:
    CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
    -- -- -- -- -- -- -- -- ---------------- --------------------
    c8 00 08 70 d8 f6 e8 08 23:39:14.372 READ DMA
    ec 00 00 00 00 00 a0 08 23:39:14.365 IDENTIFY DEVICE
    ef 03 46 00 00 00 a0 08 23:39:14.365 SET FEATURES [Set transfer mode]

    Error 8 occurred at disk power-on lifetime: 64 hours (2 days + 16 hours)
    When the command that caused the error occurred, the device was active or idle.

    After command completion occurred, registers were:
    ER ST SC SN CL CH DH
    -- -- -- -- -- -- --
    40 51 08 77 d8 f6 e8 Error: UNC 8 sectors at LBA = 0x08f6d877 = 150394999

    Commands leading to the command that caused the error were:
    CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
    -- -- -- -- -- -- -- -- ---------------- --------------------
    c8 00 08 70 d8 f6 e8 08 23:39:12.709 READ DMA
    ec 00 00 00 00 00 a0 08 23:39:12.703 IDENTIFY DEVICE
    ef 03 46 00 00 00 a0 08 23:39:12.700 SET FEATURES [Set transfer mode]

    SMART Self-test log structure revision number 1
    No self-tests have been logged. [To run self-tests, use: smartctl -t]


    SMART Selective self-test log data structure revision number 1
    SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
    1 0 0 Not_testing
    2 0 0 Not_testing
    3 0 0 Not_testing
    4 0 0 Not_testing
    5 0 0 Not_testing
    Selective self-test flags (0x0):
    After scanning selected spans, do NOT read-scan remainder of disk.
    If Selective self-test is pending on power-up, resume after 0 minute delay.







    root@home01 [~]# smartctl --all /dev/sdb
    smartctl 5.39.1 2010-01-28 r3054 [x86_64-unknown-linux-gnu] (local build)
    Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net

    === START OF INFORMATION SECTION ===
    Device Model: WDC WD5002AALX-00J37A0
    Serial Number: WD-WCAYUS557292
    Firmware Version: 15.01H15
    User Capacity: 500,106,780,160 bytes
    Device is: Not in smartctl database [for details use: -P showall]
    ATA Version is: 8
    ATA Standard is: Exact ATA specification draft version not indicated
    Local Time is: Sat Apr 28 09:23:25 2012 CST
    SMART support is: Available - device has SMART capability.
    SMART support is: Enabled

    === START OF READ SMART DATA SECTION ===
    SMART overall-health self-assessment test result: PASSED

    General SMART Values:
    Offline data collection status: (0x84) Offline data collection activity
    was suspended by an interrupting command from host.
    Auto Offline Data Collection: Enabled.
    Self-test execution status: ( 0) The previous self-test routine completed
    without error or no self-test has ever
    been run.
    Total time to complete Offline
    data collection: (9060) seconds.
    Offline data collection
    capabilities: (0x7b) SMART execute Offline immediate.
    Auto Offline data collection on/off support.
    Suspend Offline collection upon new
    command.
    Offline surface scan supported.
    Self-test supported.
    Conveyance Self-test supported.
    Selective Self-test supported.
    SMART capabilities: (0x0003) Saves SMART data before entering
    power-saving mode.
    Supports SMART auto save timer.
    Error logging capability: (0x01) Error logging supported.
    General Purpose Logging supported.
    Short self-test routine
    recommended polling time: ( 2) minutes.
    Extended self-test routine
    recommended polling time: ( 92) minutes.
    Conveyance self-test routine
    recommended polling time: ( 5) minutes.
    SCT capabilities: (0x3037) SCT Status supported.
    SCT Feature Control supported.
    SCT Data Table supported.

    SMART Attributes Data Structure revision number: 16
    Vendor Specific SMART Attributes with Thresholds:
    ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
    1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0
    3 Spin_Up_Time 0x0027 140 140 021 Pre-fail Always - 3958
    4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 161
    5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
    7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0
    9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 666
    10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0
    11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 0
    12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 159
    192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 64
    193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 96
    194 Temperature_Celsius 0x0022 111 102 000 Old_age Always - 32
    196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
    197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0
    198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0
    199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 1
    200 Multi_Zone_Error_Rate 0x0008 100 253 000 Old_age Offline - 0

    SMART Error Log Version: 1
    No Errors Logged

    SMART Self-test log structure revision number 1
    No self-tests have been logged. [To run self-tests, use: smartctl -t]


    SMART Selective self-test log data structure revision number 1
    SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
    1 0 0 Not_testing
    2 0 0 Not_testing
    3 0 0 Not_testing
    4 0 0 Not_testing
    5 0 0 Not_testing
    Selective self-test flags (0x0):
    After scanning selected spans, do NOT read-scan remainder of disk.
    If Selective self-test is pending on power-up, resume after 0 minute delay.







    thanx

  7. #7
    Join Date
    Apr 2009
    Location
    whitehouse
    Posts
    642
    Looks like have a dying sda drive. Would be good to get it replaced to be one the safe side.

    Quote Originally Posted by ttgt View Post
    Hi,


    root@home01 [~]# smartctl --all /dev/sda
    smartctl 5.39.1 2010-01-28 r3054 [x86_64-unknown-linux-gnu] (local build)
    Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net

    === START OF INFORMATION SECTION ===
    Device Model: WDC WD5002AALX-00J37A0
    Serial Number: WD-WCAYUS622787
    Firmware Version: 15.01H15
    User Capacity: 500,106,780,160 bytes
    Device is: Not in smartctl database [for details use: -P showall]
    ATA Version is: 8
    ATA Standard is: Exact ATA specification draft version not indicated
    Local Time is: Sat Apr 28 09:22:40 2012 CST
    SMART support is: Available - device has SMART capability.
    SMART support is: Enabled

    === START OF READ SMART DATA SECTION ===
    SMART overall-health self-assessment test result: PASSED

    General SMART Values:
    Offline data collection status: (0x80) Offline data collection activity
    was never started.
    Auto Offline Data Collection: Enabled.
    Self-test execution status: ( 0) The previous self-test routine completed
    without error or no self-test has ever
    been run.
    Total time to complete Offline
    data collection: (9180) seconds.
    Offline data collection
    capabilities: (0x7b) SMART execute Offline immediate.
    Auto Offline data collection on/off support.
    Suspend Offline collection upon new
    command.
    Offline surface scan supported.
    Self-test supported.
    Conveyance Self-test supported.
    Selective Self-test supported.
    SMART capabilities: (0x0003) Saves SMART data before entering
    power-saving mode.
    Supports SMART auto save timer.
    Error logging capability: (0x01) Error logging supported.
    General Purpose Logging supported.
    Short self-test routine
    recommended polling time: ( 2) minutes.
    Extended self-test routine
    recommended polling time: ( 93) minutes.
    Conveyance self-test routine
    recommended polling time: ( 5) minutes.
    SCT capabilities: (0x3037) SCT Status supported.
    SCT Feature Control supported.
    SCT Data Table supported.

    SMART Attributes Data Structure revision number: 16
    Vendor Specific SMART Attributes with Thresholds:
    ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
    1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0
    3 Spin_Up_Time 0x0027 192 138 021 Pre-fail Always - 1358
    4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 340
    5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
    7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0
    9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 667
    10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0
    11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 0
    12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 338
    192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 237
    193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 102
    194 Temperature_Celsius 0x0022 111 101 000 Old_age Always - 32
    196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
    197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0
    198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0
    199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0
    200 Multi_Zone_Error_Rate 0x0008 100 253 000 Old_age Offline - 0

    SMART Error Log Version: 1
    ATA Error Count: 12 (device log contains only the most recent five errors)
    CR = Command Register [HEX]
    FR = Features Register [HEX]
    SC = Sector Count Register [HEX]
    SN = Sector Number Register [HEX]
    CL = Cylinder Low Register [HEX]
    CH = Cylinder High Register [HEX]
    DH = Device/Head Register [HEX]
    DC = Device Command Register [HEX]
    ER = Error register [HEX]
    ST = Status register [HEX]
    Powered_Up_Time is measured from power on, and printed as
    DDd+hh:mmS.sss where DD=days, hh=hours, mm=minutes,
    SS=sec, and sss=millisec. It "wraps" after 49.710 days.

    Error 12 occurred at disk power-on lifetime: 64 hours (2 days + 16 hours)
    When the command that caused the error occurred, the device was active or idle.

    After command completion occurred, registers were:
    ER ST SC SN CL CH DH
    -- -- -- -- -- -- --
    40 51 08 77 d8 f6 e8 Error: UNC 8 sectors at LBA = 0x08f6d877 = 150394999

    Commands leading to the command that caused the error were:
    CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
    -- -- -- -- -- -- -- -- ---------------- --------------------
    c8 00 08 70 d8 f6 e8 08 23:39:19.319 READ DMA
    ec 00 00 00 00 00 a0 08 23:39:19.311 IDENTIFY DEVICE
    ef 03 46 00 00 00 a0 08 23:39:19.309 SET FEATURES [Set transfer mode]

    Error 11 occurred at disk power-on lifetime: 64 hours (2 days + 16 hours)
    When the command that caused the error occurred, the device was active or idle.

    After command completion occurred, registers were:
    ER ST SC SN CL CH DH
    -- -- -- -- -- -- --
    40 51 08 77 d8 f6 e8 Error: UNC 8 sectors at LBA = 0x08f6d877 = 150394999

    Commands leading to the command that caused the error were:
    CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
    -- -- -- -- -- -- -- -- ---------------- --------------------
    c8 00 08 70 d8 f6 e8 08 23:39:17.667 READ DMA
    ec 00 00 00 00 00 a0 08 23:39:17.662 IDENTIFY DEVICE
    ef 03 46 00 00 00 a0 08 23:39:17.661 SET FEATURES [Set transfer mode]

    Error 10 occurred at disk power-on lifetime: 64 hours (2 days + 16 hours)
    When the command that caused the error occurred, the device was active or idle.

    After command completion occurred, registers were:
    ER ST SC SN CL CH DH
    -- -- -- -- -- -- --
    40 51 08 77 d8 f6 e8 Error: UNC 8 sectors at LBA = 0x08f6d877 = 150394999

    Commands leading to the command that caused the error were:
    CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
    -- -- -- -- -- -- -- -- ---------------- --------------------
    c8 00 08 70 d8 f6 e8 08 23:39:16.021 READ DMA
    ec 00 00 00 00 00 a0 08 23:39:16.013 IDENTIFY DEVICE
    ef 03 46 00 00 00 a0 08 23:39:16.013 SET FEATURES [Set transfer mode]

    Error 9 occurred at disk power-on lifetime: 64 hours (2 days + 16 hours)
    When the command that caused the error occurred, the device was active or idle.

    After command completion occurred, registers were:
    ER ST SC SN CL CH DH
    -- -- -- -- -- -- --
    40 51 08 77 d8 f6 e8 Error: UNC 8 sectors at LBA = 0x08f6d877 = 150394999

    Commands leading to the command that caused the error were:
    CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
    -- -- -- -- -- -- -- -- ---------------- --------------------
    c8 00 08 70 d8 f6 e8 08 23:39:14.372 READ DMA
    ec 00 00 00 00 00 a0 08 23:39:14.365 IDENTIFY DEVICE
    ef 03 46 00 00 00 a0 08 23:39:14.365 SET FEATURES [Set transfer mode]

    Error 8 occurred at disk power-on lifetime: 64 hours (2 days + 16 hours)
    When the command that caused the error occurred, the device was active or idle.

    After command completion occurred, registers were:
    ER ST SC SN CL CH DH
    -- -- -- -- -- -- --
    40 51 08 77 d8 f6 e8 Error: UNC 8 sectors at LBA = 0x08f6d877 = 150394999

    Commands leading to the command that caused the error were:
    CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
    -- -- -- -- -- -- -- -- ---------------- --------------------
    c8 00 08 70 d8 f6 e8 08 23:39:12.709 READ DMA
    ec 00 00 00 00 00 a0 08 23:39:12.703 IDENTIFY DEVICE
    ef 03 46 00 00 00 a0 08 23:39:12.700 SET FEATURES [Set transfer mode]

    SMART Self-test log structure revision number 1
    No self-tests have been logged. [To run self-tests, use: smartctl -t]


    SMART Selective self-test log data structure revision number 1
    SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
    1 0 0 Not_testing
    2 0 0 Not_testing
    3 0 0 Not_testing
    4 0 0 Not_testing
    5 0 0 Not_testing
    Selective self-test flags (0x0):
    After scanning selected spans, do NOT read-scan remainder of disk.
    If Selective self-test is pending on power-up, resume after 0 minute delay.







    root@home01 [~]# smartctl --all /dev/sdb
    smartctl 5.39.1 2010-01-28 r3054 [x86_64-unknown-linux-gnu] (local build)
    Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net

    === START OF INFORMATION SECTION ===
    Device Model: WDC WD5002AALX-00J37A0
    Serial Number: WD-WCAYUS557292
    Firmware Version: 15.01H15
    User Capacity: 500,106,780,160 bytes
    Device is: Not in smartctl database [for details use: -P showall]
    ATA Version is: 8
    ATA Standard is: Exact ATA specification draft version not indicated
    Local Time is: Sat Apr 28 09:23:25 2012 CST
    SMART support is: Available - device has SMART capability.
    SMART support is: Enabled

    === START OF READ SMART DATA SECTION ===
    SMART overall-health self-assessment test result: PASSED

    General SMART Values:
    Offline data collection status: (0x84) Offline data collection activity
    was suspended by an interrupting command from host.
    Auto Offline Data Collection: Enabled.
    Self-test execution status: ( 0) The previous self-test routine completed
    without error or no self-test has ever
    been run.
    Total time to complete Offline
    data collection: (9060) seconds.
    Offline data collection
    capabilities: (0x7b) SMART execute Offline immediate.
    Auto Offline data collection on/off support.
    Suspend Offline collection upon new
    command.
    Offline surface scan supported.
    Self-test supported.
    Conveyance Self-test supported.
    Selective Self-test supported.
    SMART capabilities: (0x0003) Saves SMART data before entering
    power-saving mode.
    Supports SMART auto save timer.
    Error logging capability: (0x01) Error logging supported.
    General Purpose Logging supported.
    Short self-test routine
    recommended polling time: ( 2) minutes.
    Extended self-test routine
    recommended polling time: ( 92) minutes.
    Conveyance self-test routine
    recommended polling time: ( 5) minutes.
    SCT capabilities: (0x3037) SCT Status supported.
    SCT Feature Control supported.
    SCT Data Table supported.

    SMART Attributes Data Structure revision number: 16
    Vendor Specific SMART Attributes with Thresholds:
    ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
    1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0
    3 Spin_Up_Time 0x0027 140 140 021 Pre-fail Always - 3958
    4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 161
    5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
    7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0
    9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 666
    10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0
    11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 0
    12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 159
    192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 64
    193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 96
    194 Temperature_Celsius 0x0022 111 102 000 Old_age Always - 32
    196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
    197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0
    198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0
    199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 1
    200 Multi_Zone_Error_Rate 0x0008 100 253 000 Old_age Offline - 0

    SMART Error Log Version: 1
    No Errors Logged

    SMART Self-test log structure revision number 1
    No self-tests have been logged. [To run self-tests, use: smartctl -t]


    SMART Selective self-test log data structure revision number 1
    SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
    1 0 0 Not_testing
    2 0 0 Not_testing
    3 0 0 Not_testing
    4 0 0 Not_testing
    5 0 0 Not_testing
    Selective self-test flags (0x0):
    After scanning selected spans, do NOT read-scan remainder of disk.
    If Selective self-test is pending on power-up, resume after 0 minute delay.







    thanx
    James B
    EzeeloginSetup your Secure Linux SSH Gateway.
    |Manage & Administer Multiple Linux Servers Quickly & Securely.

  8. #8
    Join Date
    Mar 2009
    Posts
    2,323
    Hi,there are some partitions on the two hd,is it possible i can find which partition has issue ? exda1 or sda2..? thanx

  9. #9
    Join Date
    Dec 2005
    Location
    Berkshire, UK
    Posts
    3,070
    Not easily, its possible your data is still OK at this point, since there are no reallocated sectors etc on the drive however you need to replace it ASAP as it is failing.
    SmartServerMangement (SSM)
    Specialists in SolusVM, VPS & cPanel Server Management

  10. #10
    Join Date
    Mar 2009
    Posts
    2,323
    so,sda should has issue now,and sdb is fine,correct ? thanx

  11. Newsletters

    Subscribe Now & Get The WHT Quick Start Guide!

Similar Threads

  1. Would wait_timeout parameter interrupt mysqldump?
    By Dave_77 in forum Hosting Security and Technology
    Replies: 6
    Last Post: 05-04-2010, 11:34 AM
  2. kernel: ata1.00
    By ttgt in forum Hosting Security and Technology
    Replies: 4
    Last Post: 06-15-2009, 05:27 PM
  3. kernel: hda: status error
    By pmak0 in forum Hosting Security and Technology
    Replies: 7
    Last Post: 01-16-2004, 07:16 PM
  4. I'm Lost...Can you help me home??
    By SoftWareRevue in forum Web Hosting Lounge
    Replies: 20
    Last Post: 07-11-2001, 05:09 AM

Related Posts from theWHIR.com

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •