[Info-vax] faulty cpu?

David J Dachtera djesys.no at spam.comcast.net
Sat Aug 15 22:41:42 EDT 2009


kiwi-red wrote:
> 
> anyone know what this means and if I can fix it before HP get back to
> me? Its been 90 minutes since I logged the call.
> 
> ~PCO-I-(pco_03) Running diagnostics on HP: Default_HP
> Running test 10, Initialize RAMBUS ... on 8 EV7s
> Running test 11, Initialize Memory ... on 8 EV7s
> Running test 12, Data Pattern March read/write ... on 8 EV7s
> [2009/08/15 19:59:57]
> ~DIA-W-(pco_03) Test 12 [Data Pattern March read/write] failed on cpu
> [NS: 2 EW: 2] which is cab:00 drw:1 cpu:4
>     BEGIN DIAGNOSTIC TEST FAILURE INFO BEGIN
> ~CLI-F-(tCLItelnet) Server Manager response fatal error
> MBM>     Test target cabinet:00 drawer:1 CPU4 Serial Num:JA34101631
>     test number: 12 (hex)  [Data Pattern March read/write]
>     test status: 03
>     rsvd1: 00
>     result length: 0046
>     revision: V1.0-31
>     error number: 04 -- Remap error bit set, ZBOX0
>     rsvd2: 00
>     error format: 02
>     severity code: 01
>     FRU1: 5
>     FRU2: 0
>     FRU3: 0
>     FRU4: 0
>     P1: 00000000.00000000 Expected data
>     P2: 00000000.00000004 ZBOX0 DRAM Error Status 3
>     P3: ffffff9f.ffc14030 Address of status3 reg
>     P4: 00000000.a108333f ZBOX0 DRAM ERROR CTL reg
>     P5: 00000000.00000000 (null)
>     P6: 00000000.00000000 (null)
>     P7: 00000000.00000000 (null)
>     P8: 00000000.00000000 (null)
>     FRU suspect: RIMM, EV7
>     FRU extra:  cab:00 drw:1 CPU4 Warning Zbox0 Raid Remap J5
>     END DIAGNOSTIC TEST FAILURE INFO END
> [2009/08/15 20:00:02]
> ~PCO-E-(pco_03) HALT ON ERROR.
> 
> MBM> power on
> [2009/08/15 20:02:44]
> ~PCO-I-(pco_03) Preparing to power on partition. HP: Default_HP
> [2009/08/15 20:02:56]
> ~PCO-I-(pco_03)
> 
> Configuring for 8 CPUs for HP:0 Default_HP
> 
>        0  1  2  3  4  5  6  7  8  9  A  B  C  D  E  F
>       .w.....w........................................
>   0   .P--F--P..F..F..F..F..F.........................
>       .|.....|........................................   P  Processor
>       .|.....|........................................   F  Filler
>   1   .P--F--P..F..F..F..F..F.........................   |  connection
>       .|.....|........................................   -  connection
>       .|.....|........................................
>   2   .P--F--P..F..F..F..F..F.........................    W <----------
> > E
>       .|.....|........................................  N ^ (0,0)
>       .|.....|........................................    |  .
>   3   .P--F--P..F..F..F..F..F.........................    |    .
>       .w.....w........................................    |       .
>       ................................................  S v
> (ns,ew)
>   4   ................................................
>       ................................................
>       ................................................
>   5   ................................................
>       ................................................
>       ................................................
>   6   ................................................
>       ................................................
>       ................................................
>   7   ................................................
>       ................................................
>        0  1  2  3  4  5  6  7  8  9  A  B  C  D  E  F
> [2009/08/15 20:03:05]
> ~PCO-I-(pco_03) Running diagnostics on HP: Default_HP
> Running test 10, Initialize RAMBUS ... on 8 EV7s
> Running test 11, Initialize Memory ... on 8 EV7s
> Running test 12, Data Pattern March read/write ... on 8 EV7s
> [2009/08/15 20:03:20]
> ~DIA-W-(pco_03) Test 12 [Data Pattern March read/write] failed on cpu
> [NS: 2 EW: 2] which is cab:00 drw:1 cpu:4
>     BEGIN DIAGNOSTIC TEST FAILURE INFO BEGIN
>     Test target cabinet:00 drawer:1 CPU4 Serial Num:JA34101631
>     test number: 12 (hex)  [Data Pattern March read/write]
>     test status: 03
> ~CLI-F-(tCLItelnet) Server Manager response fatal error
> MBM>     rsvd1: 00
>     result length: 0046
>     revision: V1.0-31
>     error number: 04 -- Remap error bit set, ZBOX0
>     rsvd2: 00
>     error format: 02
>     severity code: 01
>     FRU1: 5
>     FRU2: 0
>     FRU3: 0
>     FRU4: 0
>     P1: 00000000.00000000 Expected data
>     P2: 00000000.00000004 ZBOX0 DRAM Error Status 3
>     P3: ffffff9f.ffc14030 Address of status3 reg
>     P4: 00000000.a108333f ZBOX0 DRAM ERROR CTL reg
>     P5: 00000000.00000000 (null)
>     P6: 00000000.00000000 (null)
>     P7: 00000000.00000000 (null)
>     P8: 00000000.00000000 (null)
>     FRU suspect: RIMM, EV7
>     FRU extra:  cab:00 drw:1 CPU4 Warning Zbox0 Raid Remap J5
>     END DIAGNOSTIC TEST FAILURE INFO END
> [2009/08/15 20:03:24]
> ~PCO-E-(pco_03) HALT ON ERROR.

Fair bet the answer is "no".

If MBM is reporting it, it's bad enough to prevent the o.s. from
booting, much less running.

Better to wait for field service to arrive.

D.J.D.



More information about the Info-vax mailing list