smartmontools ---haldoklom?

 ( bzs | 2017. május 26., péntek - 10:44 )

Sziasztok.

Mezei ubuntu reggeli kávém mellé azzal fogadott, hogy a hard disk health status megváltozott. Ez feldobta a napomat.
Kérdés:
Szerintetek is egyértelmű, mit kell tennem?

# smartctl --all /dev/sda
smartctl 6.4 2014-10-07 r4002 [i686-linux-4.2.0-42-lowlatency] (local build)
Copyright (C) 2002-14, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family: Toshiba 2.5" HDD MQ01ABD...
Device Model: TOSHIBA MQ01ABD075
Serial Number: 73GES10IS
LU WWN Device Id: 5 000039 4e5c864c8
Firmware Version: AX0R2J
User Capacity: 750.156.374.016 bytes [750 GB]
Sector Sizes: 512 bytes logical, 4096 bytes physical
Rotation Rate: 5400 rpm
Form Factor: 2.5 inches
Device is: In smartctl database [for details use: -P show]
ATA Version is: ATA8-ACS (minor revision not indicated)
SATA Version is: SATA 3.0, 6.0 Gb/s (current: 3.0 Gb/s)
Local Time is: Fri May 26 10:41:51 2017 CEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status: (0x00) Offline data collection activity
was never started.
Auto Offline Data Collection: Disabled.
Self-test execution status: ( 0) The previous self-test routine completed
without error or no self-test has ever
been run.
Total time to complete Offline
data collection: ( 120) seconds.
Offline data collection
capabilities: (0x5b) SMART execute Offline immediate.
Auto Offline data collection on/off support.
Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
No Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine
recommended polling time: ( 2) minutes.
Extended self-test routine
recommended polling time: ( 190) minutes.
SCT capabilities: (0x003d) SCT Status supported.
SCT Error Recovery Control supported.
SCT Feature Control supported.
SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000b 100 100 050 Pre-fail Always - 0
2 Throughput_Performance 0x0005 100 100 050 Pre-fail Offline - 0
3 Spin_Up_Time 0x0027 100 100 001 Pre-fail Always - 1671
4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 2836
5 Reallocated_Sector_Ct 0x0033 100 100 050 Pre-fail Always - 0
7 Seek_Error_Rate 0x000b 100 100 050 Pre-fail Always - 0
8 Seek_Time_Performance 0x0005 100 100 050 Pre-fail Offline - 0
9 Power_On_Hours 0x0032 085 085 000 Old_age Always - 6070
10 Spin_Retry_Count 0x0033 156 100 030 Pre-fail Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 2703
191 G-Sense_Error_Rate 0x0032 100 100 000 Old_age Always - 1462
192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age Always - 288
193 Load_Cycle_Count 0x0032 094 094 000 Old_age Always - 60820
194 Temperature_Celsius 0x0022 100 100 000 Old_age Always - 28 (Min/Max 8/48)
196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 0
197 Current_Pending_Sector 0x0032 100 100 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0030 100 100 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 27
220 Disk_Shift 0x0002 100 100 000 Old_age Always - 0
222 Loaded_Hours 0x0032 089 089 000 Old_age Always - 4755
223 Load_Retry_Count 0x0032 100 100 000 Old_age Always - 0
224 Load_Friction 0x0022 100 100 000 Old_age Always - 0
226 Load-in_Time 0x0026 100 100 000 Old_age Always - 184
240 Head_Flying_Hours 0x0001 100 100 001 Pre-fail Offline - 0

SMART Error Log Version: 1
ATA Error Count: 27 (device log contains only the most recent five errors)
CR = Command Register [HEX]
FR = Features Register [HEX]
SC = Sector Count Register [HEX]
SN = Sector Number Register [HEX]
CL = Cylinder Low Register [HEX]
CH = Cylinder High Register [HEX]
DH = Device/Head Register [HEX]
DC = Device Command Register [HEX]
ER = Error register [HEX]
ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 27 occurred at disk power-on lifetime: 2815 hours (117 days + 7 hours)
When the command that caused the error occurred, the device was active or idle.

After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
84 51 c0 98 08 8d e0 Error: ICRC, ABRT 192 sectors at LBA = 0x008d0898 = 9242776

Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
25 00 f0 68 08 8d e0 00 06:39:33.481 READ DMA EXT
25 00 10 58 08 8d e0 00 06:39:33.480 READ DMA EXT
25 00 f0 68 07 8d e0 00 06:39:33.477 READ DMA EXT
25 00 18 50 07 8d e0 00 06:39:33.463 READ DMA EXT
25 00 e8 60 06 8d e0 00 06:39:33.450 READ DMA EXT

Error 26 occurred at disk power-on lifetime: 2815 hours (117 days + 7 hours)
When the command that caused the error occurred, the device was active or idle.

After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
84 51 20 d0 c1 7f e0 Error: ICRC, ABRT 32 sectors at LBA = 0x007fc1d0 = 8372688

Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
25 00 f0 00 c1 7f e0 00 06:38:46.490 READ DMA EXT
25 00 c8 38 c0 7f e0 00 06:38:46.478 READ DMA EXT
25 00 30 00 c0 7f e0 00 06:38:46.477 READ DMA EXT
25 00 10 f0 bf 7f e0 00 06:38:46.467 READ DMA EXT
25 00 f0 00 bf 7f e0 00 06:38:46.464 READ DMA EXT

Error 25 occurred at disk power-on lifetime: 2815 hours (117 days + 7 hours)
When the command that caused the error occurred, the device was active or idle.

After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
84 51 c0 78 54 67 e0 Error: ICRC, ABRT 192 sectors at LBA = 0x00675478 = 6771832

Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
25 00 f0 48 54 67 e0 00 06:37:02.270 READ DMA EXT
25 00 10 38 54 67 e0 00 06:37:02.264 READ DMA EXT
25 00 f0 48 53 67 e0 00 06:37:02.257 READ DMA EXT
25 00 10 38 53 67 e0 00 06:37:02.248 READ DMA EXT
25 00 f0 48 52 67 e0 00 06:37:02.244 READ DMA EXT

Error 24 occurred at disk power-on lifetime: 2766 hours (115 days + 6 hours)
When the command that caused the error occurred, the device was active or idle.

After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
84 51 10 a0 71 56 e0 Error: ICRC, ABRT 16 sectors at LBA = 0x005671a0 = 5665184

Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
25 00 f0 c0 70 56 e0 00 01:11:21.833 READ DMA EXT
25 00 10 b0 70 56 e0 00 01:11:20.640 READ DMA EXT
25 00 f0 c0 6f 56 e0 00 01:11:20.636 READ DMA EXT
25 00 10 b0 6f 56 e0 00 01:11:19.443 READ DMA EXT
25 00 f0 c0 6e 56 e0 00 01:11:19.438 READ DMA EXT

Error 23 occurred at disk power-on lifetime: 2766 hours (115 days + 6 hours)
When the command that caused the error occurred, the device was active or idle.

After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
84 51 40 40 b1 55 e0 Error: ICRC, ABRT 64 sectors at LBA = 0x0055b140 = 5615936

Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
25 00 f0 90 b0 55 e0 00 01:05:58.805 READ DMA EXT
25 00 10 80 b0 55 e0 00 01:05:57.412 READ DMA EXT
25 00 f0 90 af 55 e0 00 01:05:57.408 READ DMA EXT
25 00 10 80 af 55 e0 00 01:05:56.224 READ DMA EXT
25 00 f0 90 ae 55 e0 00 01:05:56.219 READ DMA EXT

SMART Self-test log structure revision number 1
No self-tests have been logged. [To run self-tests, use: smartctl -t]

SMART Selective self-test log data structure revision number 1
SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
1 0 0 Not_testing
2 0 0 Not_testing
3 0 0 Not_testing
4 0 0 Not_testing
5 0 0 Not_testing
Selective self-test flags (0x0):
After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

Hozzászólás megjelenítési lehetőségek

A választott hozzászólás megjelenítési mód a „Beállítás” gombbal rögzíthető.

UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 27
mivel van egy ilyened és mivel a crc nem mindig ugyan annál a szektornál hibázik én azt mondanám, hogy cseréld ki a SATA kábelt. Ha ez nem segít dugd másik olyan sata portba az alaplapon amihez másik vezérlő tartozik (ha van ilyen) és próbáld ki úgy.
Ha továbbra is mókázik akkor valószínűleg a lemez lesz kuka, abban is a vezérlő. Mentésed ugye van? :)

Mindenről, a felhőben is van másolat (jelszavak nélkül persze...)
Kicserélem a kábelt. (Lenovo R400)

Csinálj backupot a fontos adatokról, de szerintem ez a vinyó még simán használható. Igazából azt sem értem, hogy mire fel jelzett most az ubuntud.

FathoM

Én sem értem, 2-3 havonta rendszertelenül 1 alkalommal felugrik egy alertboxban. Indításnál GUI indul, ha nem így lenne, kíváncsi lennék, paranccsorban is van-e valami előbukkanás mondjuk bootoláskor.
logokban nem tudom, miben, mit nézzek.