[Info-vax] hung program location
Stephen Hoffman
seaohveh at hoffmanlabs.invalid
Tue Feb 19 08:54:19 EST 2013
On 2013-02-19 13:26:44 +0000, Tom Adams said:
> The PC is 80141918 (as shown on show proc/cont) The process is in HIB
> when it's at that address.
>
> The code is well controlled in CMS so it's easy produce link maps.
>
> I restarted the hung processes. This hanging is a rare event that I
> don't know how to reproduce. But the process does pause at that PC
> in HIB during a normal operation mode.
Build with full machine-code listings and with full maps, and start
instrumenting the code.
As a guess directed at the error...
Look specifically at the handling of $hiber and $wake calls in the
source code, as code that uses $hiber can easily be broken in various
ways, and the end result is either a spurious $wake cycle — which the
code should always expect — or the code gets stuck in a $hiber quite
possibly because one or more $wake calls got coallesced into one $wake
somewhere; it's not really a lost $wake call, but it seems like it.
The gloriously ugly work-around for these problems is adding a $schdwk
call into the code, and deliberately inducing a periodic spurious $wake.
The best approach being figuring out where the $wake got lost, and
reviewing the asynchronous portions of the code for errors.
--
Pure Personal Opinion | HoffmanLabs LLC
More information about the Info-vax
mailing list