[Info-vax] Transient anal/disk errors
tadamsmar
tadamsmar at yahoo.com
Thu Dec 12 12:05:38 EST 2013
On Thursday, December 12, 2013 11:31:44 AM UTC-5, Stephen Hoffman wrote:
> On 2013-12-12 15:51:16 +0000, tadamsmar said:
>
>
>
> > On Thursday, December 12, 2013 10:07:28 AM UTC-5, Stephen Hoffman wrote:
>
> >> There's also the infamous BACKUP /IGNORE=INTERLOCK command, which
>
> >> some>> folks think is an online BACKUP. It's not. Worse, it allows
>
> >> silent>> data corruptions in the output savesets. If you have control
>
> >> over the>> applications involved, that's where the BACKUP support needs
>
> >> to reside,>> particularly if your applications are writing clumps of
>
> >> updates to>> disk. Various relational databases on VMS include
>
> >> application-internal>> backup tools, and always use those in preference
>
> >> to using the OpenVMS>> BACKUP command. Alternatively, quiesce the
>
> >> applications or the disks>> or the systems, and then use the standard
>
> >> BACKUP tools. Or quiesce the>> environment and yank a disk from the
>
> >> shadowset, and backup that.
>
> >
>
> > You think I was recently working on my backup strategy? I was just
>
> > working on those persistent ANAL/DISK problems.
>
>
>
> Yes, so was I. In my admittedly skewed view of the world,
>
> investigations of persistent disk errors are always secondary to having
>
> good and verified backups. Preserve the most current data first, then
>
> study the disks and the errors.
>
>
>
> > But I probably do need to work on my backup strategy. I have been
>
> > yanking out a disk without quiescing and backing up the yanked disk,
>
> > and I have not done any deliberate recovery testing, just defacto when
>
> > I had to recover a file or compress a disk. Just yanking a disk is
>
> > easy, I just have to run command procedures, but as you point out, it
>
> > might not have optimal reliability.
>
>
>
> You're hot-plugging active disks,
By "yanking" I just meant dismounting a disk from a shadowset. We only have one system, the AS800, that allow literal yanking of a physical disk.
> and probably in an environment
>
> without a quiesce function on the storage controller?
>
>
>
> Don't do that.
>
>
>
> You've probably been causing some of the errors and corruptions here.
>
>
>
> > What's the easiest way to quiesce and yank?
>
>
>
> Depending on the bus and the target, via DISMOUNT command. Some
>
> storage controllers support a quiesce function, and others expect you
>
> to shut down. I'm guessing your gear probably lacks one of those
>
> controllers; that feature usually only exists on outboard storage
>
> controllers. It's not a feature usually found with host-based JBOB
>
> SCSI controllers, nor even necessarily on some of the host-based SCSI
>
> RAID controllers.
>
>
dismount is what I meant by yank.
>
> But that's not how I'd do the backups I'm referring to. I'd DISMOUNT
>
> the disk from the shadowset, and MOUNT /NOWRITE the disk privately, and
>
> back up from there. There are minimerge and minicopy bitmaps that were
>
> discussed here in massive detail when Phillip Helbig was trying to
>
> understand how all that worked, so I'm not going to bother reposting
>
> all of that here. Those features will help bring the
>
> temporarily-removed disk back to current within the shadowset more
>
> quickly. Search for threads with minicopy or minimerge or related
>
> keywords via Google Groups, and start reading. Or check the current
>
> volume shadowing manual in the OpenVMS documentation set. Or both.
>
I use /minicopy when I dismount for backing up.
>
>
> > The only way I am sure of is to shutdown, boot with a CD, yank, then
>
> > reboot normally.
>
>
>
> That's the best way, official way, and only supported way, if you need
>
> to reconfigure a SCSI, and lack a storage controller with a quiesce
>
> function.
>
>
>
> > I am not sure that there is a console command that will yank a disk
>
> > from a shadowset, but I seem to recall one that will disable shadowing.
>
>
>
> Allow me to translate "I don't recall" as "which manual should I read
>
> to learn more about the fundamental operations of the server?". That'd
>
> be the volume shadowing manual. <http://www.hp.com/go/openvms/doc>,
>
> select the VMS documentation shelf, and search for "shadowing", and
>
> skim that manual. You'll definitely need to be more familiar with it
>
> if (when?) you decide to implement minicopy or minimerge. (Though your
>
> VMS version is ancient, and there were definitely various patches made
>
> available in this and related areas of OpenVMS.)
>
>
>
> > I have noticed that sometimes a yanked disk will not run ANAL/DISK
>
> > clean. This also seems to be transient.
>
>
>
> Yeah. Sometimes yanking the disk just silently corrupts the file data
>
> on that disk, depending on the timing. I wouldn't assume other disks
>
> on the SCSI bus would be entirely immune from problems or corruptions,
>
> either. Not without quiescing the bus, or shutting down, or
>
> dismounting the disks on that bus.
>
>
I meant dismounting. If I dismount from a shadowset and run anal/disk on the dismounted disk, I sometimes get warnings form anal/disk. I just recently started checking this as I have gotten a bit more concerned after I had persistent warnings from anal/disk on one system that required a good bit of cleanup.
So perhaps even a dismount from a shadowset is a bit risky for backing up. Of course, it's a heck of a lot easier than the official way to prep a disk for backup.
>
>
>
>
>
> --
>
> Pure Personal Opinion | HoffmanLabs LLC
More information about the Info-vax
mailing list