[Info-vax] Unpleasant Disk Shadowing Surprise
Michael Moroney
moroney at world.std.spaamtrap.com
Tue Oct 11 22:05:32 EDT 2011
tadamsmar <tadamsmar at yahoo.com> writes:
>This is VMS 7.3-2 on a DS10. I am at V19.0 on patch updates (applied
>in July 2009)
>Our real-time application halted for more than 3 minutes.
>After the incident I found here was a disk error on one member of the
>shadow set.
>The first indication was whe the application watchdog alarm went off.
>According to the console log, more than 30 seconds after the watchdog
>sounded, the shadow set changed state, the offending disk went
>offline, a mount verification started and completed.
>Immediately after the mount verification completed, VMS started
>working again.
>Looks like the disk system was inaccessible for about 3 minutes and
>any process that tried to use it got
>halted somehow.
>Is this to be expected? We have had VMS and disk shadowing running
>the application for 20 years or so, but I don't know that we have ever
>had a disk error while the watchdog was configured to sound, so we
>might not have noticed the halting and recovery.
What is the value of SYSGEN parameter SHADOW_MBR_TMO (or SHADOW_SYS_TMO
if the shadowset in question was the system drive) ?
What is the value of MVTIMEOUT?
If a drive goes bad, SHADOWING will try SHADOW_MBR_TMO seconds before
giving up on it. The default is 120 seconds, I think it's 600 seconds on
older versions of VMS.
MVTIMEOUT is an overall retry timer, its default is 3600 seconds (1 hour).
More information about the Info-vax
mailing list