SMART adatok értelmezése

SMART adatok értelmezése

Hozzászólások

ez nem scsi? mert akkor nézd meg az "scsiinfo -a /dev/sda" paranccsal hogy a grown táblában van-e bejegyzés. a manufacturer táblában 100% van azt ne nézd :)

Nagyon koszonom :) Ez tenyleg ertelmezhetobb infot adott. A grown tablaban csak egy bejegyzes volt, remelem, ez mar eleg ok garis cserere ;)

Hi,

Van egy Samsung SP0411N 40 gigás vinyóm, eddig jól ment, de az utóbbi napokban érdekes dolgokat produkál (FreeBSD 5.3 megy a vason).
fsck /dev/ad0s2a-ra néha azt mondja, hogy OK minden, máskor meg DMA olvasási hibák jelennek meg a konzolon (egy, kettő, vagy négy):
ad0: FAILURE - READ_DMA status=59<READY,DSC,DRQ,ERROR> error=40<UNCORRECTABLE> LBA=25834536
CANNOT READ BLK: 2016

ad0: FAILURE - READ_DMA status=59<READY,DSC,DRQ,ERROR> error=40<UNCORRECTABLE> LBA=25834606
THE FOLLOWING DISK SECTORS COULD NOT BE READ: 2086,

A smartctl szerint a Reallocated_Sector_Count = 0, viszont a DMA hiba benne van a logban:

SMART Error Log Version: 1
Warning: ATA error count 6912 inconsistent with error log pointer 5

ATA Error Count: 6912 (device log contains only the most recent five errors)
CR = Command Register [HEX]
FR = Features Register [HEX]
SC = Sector Count Register [HEX]
SN = Sector Number Register [HEX]
CL = Cylinder Low Register [HEX]
CH = Cylinder High Register [HEX]
DH = Device/Head Register [HEX]
DC = Device Command Register [HEX]
ER = Error register [HEX]
ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 6912 occurred at disk power-on lifetime: 2593 hours (108 days + 1 hours)
When the command that caused the error occurred, the device was active or idle.

After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 01 6f 8a 34 e1 Error: UNC 1 sectors at LBA = 0x01348a6f = 20220527

Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 01 6f 8a 34 e1 00 00:08:55.750 READ DMA
c8 00 01 6e 8a 34 e1 00 00:08:55.438 READ DMA
c8 00 01 6d 8a 34 e1 00 00:08:55.375 READ DMA
c8 00 01 6c 8a 34 e1 00 00:08:55.375 READ DMA
c8 00 01 6b 8a 34 e1 00 00:08:55.375 READ DMA

Error 6911 occurred at disk power-on lifetime: 2593 hours (108 days + 1 hours)
When the command that caused the error occurred, the device was active or idle.

After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 80 28 8a 34 e1 Error: UNC 128 sectors at LBA = 0x01348a28 = 20220456

Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 80 28 8a 34 e1 00 00:08:51.313 READ DMA
c8 00 80 a8 8a 33 e1 00 00:08:51.313 READ DMA
c8 00 80 28 8a 33 e1 00 00:08:51.313 READ DMA
c8 00 80 a8 8a 32 e1 00 00:08:51.313 READ DMA
c8 00 80 28 8a 32 e1 00 00:08:51.313 READ DMA

Error 6910 occurred at disk power-on lifetime: 2593 hours (108 days + 1 hours)
When the command that caused the error occurred, the device was active or idle.

After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 01 71 8a 34 e1 Error: UNC 1 sectors at LBA = 0x01348a71 = 20220529

Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 01 71 8a 34 e1 00 00:07:56.313 READ DMA
c8 00 01 70 8a 34 e1 00 00:07:56.313 READ DMA
c8 00 01 6f 8a 34 e1 00 00:07:55.375 READ DMA
c8 00 01 6e 8a 34 e1 00 00:07:55.063 READ DMA
c8 00 01 6d 8a 34 e1 00 00:07:55.063 READ DMA

Error 6909 occurred at disk power-on lifetime: 2593 hours (108 days + 1 hours)
When the command that caused the error occurred, the device was active or idle.

After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 80 28 8a 34 e1 Error: UNC 128 sectors at LBA = 0x01348a28 = 20220456

Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 80 28 8a 34 e1 00 00:07:50.125 READ DMA
c8 00 80 a8 8a 33 e1 00 00:07:50.125 READ DMA
c8 00 80 28 8a 33 e1 00 00:07:50.125 READ DMA
c8 00 80 a8 8a 32 e1 00 00:07:50.125 READ DMA
c8 00 80 28 8a 32 e1 00 00:07:50.125 READ DMA

Error 6908 occurred at disk power-on lifetime: 2593 hours (108 days + 1 hours)
When the command that caused the error occurred, the device was active or idle.

After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 01 6f 8a 34 e1 Error: UNC 1 sectors at LBA = 0x01348a6f = 20220527

Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 01 6f 8a 34 e1 00 00:05:13.750 READ DMA
c8 00 01 6e 8a 34 e1 00 00:05:12.313 READ DMA
c8 00 01 6d 8a 34 e1 00 00:05:12.000 READ DMA
c8 00 01 6c 8a 34 e1 00 00:05:12.000 READ DMA
c8 00 01 6b 8a 34 e1 00 00:05:12.000 READ DMA

Lefuttattam a Samsung gyári tesztprogiját (hutil.exe), ami a felületellenőrzést ~33%-ig csinálta onnan pedig egy órán át köpte kifelé, hogy Error: LBA xxxxxxxx. Aztán meguntam és leállítottam.

Szóval, ****om sincs, hogy mi ez a hiba, vagy mi okozhatja :?:. Ráadásul a három tesz nem ugyanazt az eredményt szolgáltatja.
Még garis a vinyó, úgyhogy kicseréltethetem újra, de nincs nagy kedven újrahúzni a rendszert.

Laci

smartctl -A
mit ad a reallokalt szektorokra?

[quote:c0a6a5e557="x15"]smartctl -A
mit ad a reallokalt szektorokra?

ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
5 Reallocated_Sector_Ct 0x0033 253 253 010 Pre-fail Always - 0

DMA hiba az átvitelnél keletkezik, BADSECTOR pedig a lemezen...
nem értem az összefüggést :)

bocs nem tudja valaki hogy amíg hde-ként értem el a sata vinyót addig adott értékelhető eredményt a smartctl, de mióta sda-ként érem el, a válasz:
"Device does not support SMART"?
a kernel, vinyó nem változott, mindössze a libata rétegen éri el mostmár.

[quote:0582a509c5="x-daemon"]DMA hiba az átvitelnél keletkezik, BADSECTOR pedig a lemezen...
nem értem az összefüggést :)

Ez világos, de miért mindig ugyanazon a helyen jelentkezik a hiba :??

A smartctl az alábbi erdeményt adja. Jól gondolom, hogy a test log alapján egy hibás szektor tanyáz a vinyón?

[code:1:3007d8aa9e]
smartctl version 5.26 Copyright (C) 2002-3 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

Device: IBM-PSG ST318436LW !# Version: 3281
Serial number: 3BM0Z1JN00007133F7DL
Device type: disk
Local Time is: Mon Sep 20 17:31:55 2004 CEST
Device supports SMART and is Enabled
Temperature Warning Disabled or Not Supported
SMART Health Status: OK

Vendor (Seagate) cache information
Blocks sent to initiator = 3081462423
Blocks received from initiator = 4186749525
Blocks read from cache and sent to initiator = 487901224
Number of read and write commands whose size <= segment size = 452164967
Number of read and write commands whose size > segment size = 8517837
Vendor (Seagate) factory information
number of hours powered up = 21000.23
number of minutes until next internal SMART test = 20

Error counter log:
Errors Corrected Total Total Correction Gigabytes Total
delay: [rereads/ errors algorithm processed uncorrected
minor | major rewrites] corrected invocations [10^9 bytes] errors
read: 42730 0 0 42730 54910 3302.560 87
write: 0 0 0 0 0 4503.741 0
verify: 111 0 0 111 111 36.473 0

Non-medium error count: 0

SMART Self-test log
Num Test Status segment LifeTime LBA_first_err [SK ASC ASQ]
Description number (hours)
# 1 Background long Failed in segment --> - 20968 0x 5deaf7 [0x3 0x11 0x0]
# 2 Background long Failed in segment --> - 20062 0x 5deaf7 [0x3 0x11 0x0]
# 3 Background short Completed - 4 - [- - -]

Long (extended) Self Test duration: 1255 seconds [20.9 minutes]
[/code:1:3007d8aa9e]