esxi logok

Fórumok

Sziasztok!

Következő a helyzet: van egy dell pe t110 II szerver, amin jelenleg esxi5.1 fut, és ezeket találom a logokban:
vmkernel.log:
2013-10-10T12:26:43.351Z cpu7:2055)ScsiDeviceIO: 2331: Cmd(0x4124007e5640) 0x85, CmdSN 0x2dc from world 3156 to dev "naa.600508e00000000011e4f797d7f44808" failed H:0x0 D:0x2 P:0x0 Valid sense data: 0x5 0x20 0x0.
2013-10-10T12:26:43.351Z cpu7:2055)ScsiDeviceIO: 2331: Cmd(0x4124007e5640) 0x4d, CmdSN 0x2dd from world 3156 to dev "naa.600508e00000000011e4f797d7f44808" failed H:0x0 D:0x2 P:0x0 Valid sense data: 0x5 0x20 0x0.
2013-10-10T12:26:43.351Z cpu7:2055)ScsiDeviceIO: 2331: Cmd(0x4124007e5640) 0x1a, CmdSN 0x2de from world 3156 to dev "naa.600508e00000000011e4f797d7f44808" failed H:0x0 D:0x2 P:0x0 Valid sense data: 0x5 0x24 0x0.
2013-10-10T12:30:50.898Z cpu0:2498)NMP: nmp_ThrottleLogForDevice:2319: Cmd 0x1a (0x4124007880c0, 0) to dev "mpx.vmhba35:C0:T0:L0" on path "vmhba35:C0:T0:L0" Failed: H:0x0 D:0x2 P:0x0 Valid sense data: 0x5 0x20 0x0. Act:NONE
2013-10-10T12:30:50.898Z cpu0:2498)ScsiDeviceIO: 2331: Cmd(0x4124007880c0) 0x1a, CmdSN 0xb901 from world 0 to dev "mpx.vmhba35:C0:T0:L0" failed H:0x0 D:0x2 P:0x0 Valid sense data: 0x5 0x20 0x0.
2013-10-10T12:35:50.912Z cpu1:2485)NMP: nmp_ThrottleLogForDevice:2319: Cmd 0x1a (0x4124007bd2c0, 0) to dev "mpx.vmhba35:C0:T0:L0" on path "vmhba35:C0:T0:L0" Failed: H:0x0 D:0x2 P:0x0 Valid sense data: 0x5 0x20 0x0. Act:NONE
2013-10-10T12:35:50.912Z cpu1:2485)ScsiDeviceIO: 2331: Cmd(0x4124007bd2c0) 0x1a, CmdSN 0xb94f from world 0 to dev "mpx.vmhba35:C0:T0:L0" failed H:0x0 D:0x2 P:0x0 Valid sense data: 0x5 0x20 0x0.
2013-10-10T12:40:50.928Z cpu5:2053)NMP: nmp_ThrottleLogForDevice:2319: Cmd 0x1a (0x4124007f1900, 0) to dev "mpx.vmhba35:C0:T0:L0" on path "vmhba35:C0:T0:L0" Failed: H:0x0 D:0x2 P:0x0 Valid sense data: 0x5 0x20 0x0. Act:NONE
2013-10-10T12:40:50.928Z cpu5:2053)ScsiDeviceIO: 2331: Cmd(0x4124007f1900) 0x1a, CmdSN 0xb99d from world 0 to dev "mpx.vmhba35:C0:T0:L0" failed H:0x0 D:0x2 P:0x0 Valid sense data: 0x5 0x20 0x0.
2013-10-10T12:45:50.942Z cpu2:2591)NMP: nmp_ThrottleLogForDevice:2319: Cmd 0x1a (0x41240080e4c0, 0) to dev "mpx.vmhba35:C0:T0:L0" on path "vmhba35:C0:T0:L0" Failed: H:0x0 D:0x2 P:0x0 Valid sense data: 0x5 0x20 0x0. Act:NONE
2013-10-10T12:45:50.942Z cpu2:2591)ScsiDeviceIO: 2331: Cmd(0x41240080e4c0) 0x1a, CmdSN 0xb9eb from world 0 to dev "mpx.vmhba35:C0:T0:L0" failed H:0x0 D:0x2 P:0x0 Valid sense data: 0x5 0x20 0x0.
2013-10-10T12:50:50.954Z cpu0:2056)NMP: nmp_ThrottleLogForDevice:2319: Cmd 0x1a (0x4124007f1b00, 0) to dev "mpx.vmhba35:C0:T0:L0" on path "vmhba35:C0:T0:L0" Failed: H:0x0 D:0x2 P:0x0 Valid sense data: 0x5 0x20 0x0. Act:NONE
2013-10-10T12:50:50.954Z cpu0:2056)ScsiDeviceIO: 2331: Cmd(0x4124007f1b00) 0x1a, CmdSN 0xba39 from world 0 to dev "mpx.vmhba35:C0:T0:L0" failed H:0x0 D:0x2 P:0x0 Valid sense data: 0x5 0x20 0x0.
2013-10-10T12:55:50.969Z cpu6:2591)NMP: nmp_ThrottleLogForDevice:2319: Cmd 0x1a (0x41240079b4c0, 0) to dev "mpx.vmhba35:C0:T0:L0" on path "vmhba35:C0:T0:L0" Failed: H:0x0 D:0x2 P:0x0 Valid sense data: 0x5 0x20 0x0. Act:NONE
2013-10-10T12:55:50.969Z cpu6:2591)ScsiDeviceIO: 2331: Cmd(0x41240079b4c0) 0x1a, CmdSN 0xba89 from world 0 to dev "mpx.vmhba35:C0:T0:L0" failed H:0x0 D:0x2 P:0x0 Valid sense data: 0x5 0x20 0x0.
2013-10-10T12:56:43.444Z cpu6:2054)NMP: nmp_ThrottleLogForDevice:2319: Cmd 0x85 (0x41240074ebc0, 3156) to dev "naa.600508e00000000011e4f797d7f44808" on path "vmhba1:C1:T0:L0" Failed: H:0x0 D:0x2 P:0x0 Valid sense data: 0x5 0x20 0x0. Act:NONE

vmkwarning.log:

2013-10-05T09:56:39.049Z cpu2:2484)WARNING: VMK_PCI: 1170: device 00:00:1f.3 has no legacy interrupt(s)
2013-10-05T09:56:39.049Z cpu2:2484)WARNING: LinPCI: LinuxPCILegacyIntrVectorSet:80:Could not allocate legacy PCI interrupt for device 0000:00:1f.3
2013-10-05T09:56:40.698Z cpu4:2421)WARNING: Team.etherswitch: TeamES_Activate:309:Failed to initialize beaconing on portset 'pps': Not implemented.
2013-10-05T09:57:00.499Z cpu0:2593)WARNING: ScsiScan: 1276: Failed to add path vmhba1:C0:T0:L0 : Not found
2013-10-05T09:57:00.501Z cpu0:2593)WARNING: ScsiScan: 1276: Failed to add path vmhba1:C0:T1:L0 : Not found
2013-10-05T09:57:00.552Z cpu0:2594)WARNING: LinuxSignal: 761: ignored unexpected signal flags 0x2 (sig 17)
2013-10-05T09:57:00.604Z cpu7:2421)WARNING: Uplink: 3075: releasing cap 0x0!
2013-10-05T09:57:00.604Z cpu7:2421)WARNING: Uplink: 3075: releasing cap 0x0!
2013-10-05T09:57:00.604Z cpu7:2421)WARNING: Uplink: 3075: releasing cap 0x0!
2013-10-05T09:57:00.604Z cpu7:2421)WARNING: Uplink: 3075: releasing cap 0x0!
2013-10-05T10:14:25.177Z cpu3:5194)WARNING: UserTeletype: 1655: Unknown cmd 0x5409 (data 0x1) for slave
2013-10-05T10:14:54.510Z cpu6:5235)WARNING: UserLinux: 1331: unsupported: (void)
2013-10-05T10:20:15.377Z cpu2:2115)WARNING: VFAT: 4346: Failed to flush file times: Stale file handle
2013-10-05T10:25:37.478Z cpu5:2475)WARNING: LinuxSignal: 761: ignored unexpected signal flags 0x2 (sig 17)
2013-10-05T10:25:38.099Z cpu3:2484)WARNING: VMK_PCI: 1170: device 00:00:1f.3 has no legacy interrupt(s)
2013-10-05T10:25:38.099Z cpu3:2484)WARNING: LinPCI: LinuxPCILegacyIntrVectorSet:80:Could not allocate legacy PCI interrupt for device 0000:00:1f.3
2013-10-05T10:25:40.053Z cpu6:2421)WARNING: Team.etherswitch: TeamES_Activate:309:Failed to initialize beaconing on portset 'pps': Not implemented.
2013-10-05T10:25:59.026Z cpu6:2593)WARNING: ScsiScan: 1276: Failed to add path vmhba1:C0:T0:L0 : Not found
2013-10-05T10:25:59.028Z cpu6:2593)WARNING: ScsiScan: 1276: Failed to add path vmhba1:C0:T1:L0 : Not found
2013-10-05T10:25:59.080Z cpu1:2594)WARNING: LinuxSignal: 761: ignored unexpected signal flags 0x2 (sig 17)
2013-10-05T10:25:59.133Z cpu4:2421)WARNING: Uplink: 3075: releasing cap 0x0!
2013-10-05T10:25:59.133Z cpu4:2421)WARNING: Uplink: 3075: releasing cap 0x0!
2013-10-05T10:25:59.133Z cpu4:2421)WARNING: Uplink: 3075: releasing cap 0x0!
2013-10-05T10:25:59.133Z cpu4:2421)WARNING: Uplink: 3075: releasing cap 0x0!
2013-10-05T10:26:25.670Z cpu6:3738)WARNING: UserTeletype: 1655: Unknown cmd 0x5409 (data 0x1) for slave
2013-10-05T10:46:39.356Z cpu4:5086)WARNING: UserLinux: 1331: unsupported: (void)
2013-10-05T21:01:03.040Z cpu7:2115)WARNING: VFAT: 4346: Failed to flush file times: Stale file handle
2013-10-06T22:01:03.043Z cpu7:2115)WARNING: VFAT: 4346: Failed to flush file times: Stale file handle
2013-10-07T08:01:03.093Z cpu6:2115)WARNING: VFAT: 4346: Failed to flush file times: Stale file handle
2013-10-08T18:56:32.647Z cpu2:313815)WARNING: UserLinux: 1331: unsupported: (void)
2013-10-09T19:49:35.182Z cpu3:410983)WARNING: UserLinux: 1331: unsupported: (void)
2013-10-10T00:01:02.999Z cpu2:2115)WARNING: VFAT: 4346: Failed to flush file times: Stale file handle
2013-10-10T11:06:42.540Z cpu3:474200)WARNING: UserLinux: 1331: unsupported: (void)
2013-10-10T13:44:11.631Z cpu0:484557)WARNING: UserLinux: 1331: unsupported: (void)

Kezdehetek aggódni, hogy valami nem egészen jó, amellett, hogy egy perc h200i baromi lassú raidkártya van a gépben? (talán, egyszer cserélve lesz)

Update: tegnap valamilyen okból meghasalt a hypervisor, de logokban fogalmam sincs, mit keressek, első ránézésre a syslogban és a vmkernel.logban nem láttam semmi kirívót..
viszont, ilyet találtam a vobd.logban, jó sokat:
Successfully sent event (esx.audit.net.firewall.config.changed) after 1 failure.

ez vajon mi lehet?

Hozzászólások

Understanding SCSI host-side NMP errors/conditions in ESX 4.x and ESXi 5.x (1029039)
http://kb.vmware.com/selfservice/microsites/search.do?language=en_US&cm…

Ez alapján a táblázat legelső értéke vonatkozik a hibaüzeneteidre (H:0x0):

"This status is returned when there is no error on the host side. This is when you will see if there is a status for a Device or Plugin. It is also when you will see Valid sense data instead of Possible sense Data."

Szóval azt logolja a hibalogba hogy nincs hiba :)

Ezt is érdemes elolvasni:
http://kb.vmware.com/selfservice/microsites/search.do?cmd=displayKC&doc…

update, lásd fennt
--
>'The time has come,' the Walrus said<

ez lehetséges, bár nincs sajnos ipmi-n kívül más távmenedzsmentem, így nem tudtam ránézni a screenre, csak arra
lettem figyelmes, hogy elérhetetlen a hypervisor és az összes vm, ellenben ipmi megy, tehát nem az egész vas esett ki vmilyen okból
--
>'The time has come,' the Walrus said<

0x85, CmdSN 0x2dc from world 3156 to dev "naa.600508e00000000011e4f797d7f44808" failed H:0x0 D:0x2 P:0x0 Valid sense data: 0x5 0x20 0x0.

Itt a 0x85 parancs az egy ATA PASS THROUGH lenne, erre az 0x5, 0x20, 0x0 pedig azt jelenti, hogy ILLEGAL REQUEST

Szóval itt valami olyat akar csinálni, amit épp azon az eszközön nem lehet, de semmiképp sem hardver hiba. A probléma gyökerét valahol máshol kell keresni...