[Info-vax] faulty cpu?
willh
willhlbw2 at googlemail.com
Mon Aug 17 05:18:08 EDT 2009
On Aug 17, 10:16 am, willh <willhl... at googlemail.com> wrote:
> On Aug 15, 10:44 pm, kiwi-red <antonywar... at gmail.com> wrote:
>
>
>
>
>
> > anyone know what this means and if I can fix it before HP get back to
> > me? Its been 90 minutes since I logged the call.
>
> > ~PCO-I-(pco_03) Running diagnostics on HP: Default_HP
> > Running test 10, Initialize RAMBUS ... on 8 EV7s
> > Running test 11, Initialize Memory ... on 8 EV7s
> > Running test 12, Data Pattern March read/write ... on 8 EV7s
> > [2009/08/15 19:59:57]
> > ~DIA-W-(pco_03) Test 12 [Data Pattern March read/write] failed on cpu
> > [NS: 2 EW: 2] which is cab:00 drw:1 cpu:4
> > BEGIN DIAGNOSTIC TEST FAILURE INFO BEGIN
> > ~CLI-F-(tCLItelnet) Server Manager response fatal error
> > MBM> Test target cabinet:00 drawer:1 CPU4 Serial Num:JA34101631
> > test number: 12 (hex) [Data Pattern March read/write]
> > test status: 03
> > rsvd1: 00
> > result length: 0046
> > revision: V1.0-31
> > error number: 04 -- Remap error bit set, ZBOX0
> > rsvd2: 00
> > error format: 02
> > severity code: 01
> > FRU1: 5
> > FRU2: 0
> > FRU3: 0
> > FRU4: 0
> > P1: 00000000.00000000 Expected data
> > P2: 00000000.00000004 ZBOX0 DRAM Error Status 3
> > P3: ffffff9f.ffc14030 Address of status3 reg
> > P4: 00000000.a108333f ZBOX0 DRAM ERROR CTL reg
> > P5: 00000000.00000000 (null)
> > P6: 00000000.00000000 (null)
> > P7: 00000000.00000000 (null)
> > P8: 00000000.00000000 (null)
> > FRU suspect: RIMM, EV7
> > FRU extra: cab:00 drw:1 CPU4 Warning Zbox0 Raid Remap J5
> > END DIAGNOSTIC TEST FAILURE INFO END
> > [2009/08/15 20:00:02]
> > ~PCO-E-(pco_03) HALT ON ERROR.
>
> > MBM> power on
> > [2009/08/15 20:02:44]
> > ~PCO-I-(pco_03) Preparing to power on partition. HP: Default_HP
> > [2009/08/15 20:02:56]
> > ~PCO-I-(pco_03)
>
> > Configuring for 8 CPUs for HP:0 Default_HP
>
> > 0 1 2 3 4 5 6 7 8 9 A B C D E F
> > .w.....w........................................
> > 0 .P--F--P..F..F..F..F..F.........................
> > .|.....|........................................ P Processor
> > .|.....|........................................ F Filler
> > 1 .P--F--P..F..F..F..F..F......................... | connection
> > .|.....|........................................ - connection
> > .|.....|........................................
> > 2 .P--F--P..F..F..F..F..F......................... W <----------> E
>
> > .|.....|........................................ N ^ (0,0)
> > .|.....|........................................ | .
> > 3 .P--F--P..F..F..F..F..F......................... | .
> > .w.....w........................................ | .
> > ................................................ S v
> > (ns,ew)
> > 4 ................................................
> > ................................................
> > ................................................
> > 5 ................................................
> > ................................................
> > ................................................
> > 6 ................................................
> > ................................................
> > ................................................
> > 7 ................................................
> > ................................................
> > 0 1 2 3 4 5 6 7 8 9 A B C D E F
> > [2009/08/15 20:03:05]
> > ~PCO-I-(pco_03) Running diagnostics on HP: Default_HP
> > Running test 10, Initialize RAMBUS ... on 8 EV7s
> > Running test 11, Initialize Memory ... on 8 EV7s
> > Running test 12, Data Pattern March read/write ... on 8 EV7s
> > [2009/08/15 20:03:20]
> > ~DIA-W-(pco_03) Test 12 [Data Pattern March read/write] failed on cpu
> > [NS: 2 EW: 2] which is cab:00 drw:1 cpu:4
> > BEGIN DIAGNOSTIC TEST FAILURE INFO BEGIN
> > Test target cabinet:00 drawer:1 CPU4 Serial Num:JA34101631
> > test number: 12 (hex) [Data Pattern March read/write]
> > test status: 03
> > ~CLI-F-(tCLItelnet) Server Manager response fatal error
> > MBM> rsvd1: 00
> > result length: 0046
> > revision: V1.0-31
> > error number: 04 -- Remap error bit set, ZBOX0
> > rsvd2: 00
> > error format: 02
> > severity code: 01
> > FRU1: 5
> > FRU2: 0
> > FRU3: 0
> > FRU4: 0
> > P1: 00000000.00000000 Expected data
> > P2: 00000000.00000004 ZBOX0 DRAM Error Status 3
> > P3: ffffff9f.ffc14030 Address of status3 reg
> > P4: 00000000.a108333f ZBOX0 DRAM ERROR CTL reg
> > P5: 00000000.00000000 (null)
> > P6: 00000000.00000000 (null)
> > P7: 00000000.00000000 (null)
> > P8: 00000000.00000000 (null)
> > FRU suspect: RIMM, EV7
> > FRU extra: cab:00 drw:1 CPU4 Warning Zbox0 Raid Remap J5
> > END DIAGNOSTIC TEST FAILURE INFO END
> > [2009/08/15 20:03:24]
> > ~PCO-E-(pco_03) HALT ON ERROR.
>
> FRU extra: cab:00 drw:1 CPU4 Warning Zbox0 Raid Remap J5
>
> Points to RIMM J5 on DU04
>
> MBM> show version
> Local MBM(cab:00, drw:0) FW version V2.7-7 built on Jun 4 2007 at
> 11:05:08
>
> Latest version
> Cab Drw Micro FW Module Flash Firmware Revision
> 0 0 MBM MBMFW V2.7-7
> MBMFSL V2.1-1
> PBMFPGA* V4.1-01
> XSHFPGA* V3.1-10
> CPLD V0.5
> CMMFW* V2.7-5
> CMMFSL* V2.7-2
> CMMFPGA* V114
> SROMFW* V1.0-9
> XSROMFW* V1.0-31
> SRMFW* V7.3-11
> 0 0 CMM0 CMMFW V2.7-5
> CMMFSL V2.7-2
> CMMFPGA V114
> SROMFW V1.0-9
> XSROMFW V1.0-31
> SRMFW V7.3-11
> 0 0 CMM1 CMMFW V2.7-5
> CMMFSL V2.7-2
> CMMFPGA V114
> SROMFW V1.0-9
> XSROMFW V1.0-31
> SRMFW V7.3-11
> 0 0 PBM PBMFW V2.7-7
> PBMFSL V2.1-1
> PBMFPGA V4.1-01
> CPLD V0.5
> CMMFW* V2.7-5
> CMMFSL* V2.7-2
> CMMFPGA* V114
> SROMFW* V1.0-9
> XSROMFW* V1.0-31
> SRMFW* V7.3-11- Hide quoted text -
>
> - Show quoted text -
I meant points to RIMM J5 on DU02 (CPU4)
More information about the Info-vax
mailing list