[Info-vax] Wide area cluster, metro area network, seeking info

Stephen Hoffman seaohveh at hoffmanlabs.invalid
Wed Jun 9 14:31:30 EDT 2021


On 2021-06-08 22:28:49 +0000, Rich Jordan said:

> We are looking at the possibility of putting VMS boxes in two 
> locations, with Integrity boxes running VSI VMS.  This is the very 
> beginning of the research on the possibility of clustering those two 
> servers instead of just having them networked.  Probably have to be 
> master/slave since only two nodes and no shared storage.
> 
> After reviewing the various cluster docs, they seem to be focused on 
> older technologies like SoNET and DS3 using FDDI bridges (which would 
> allow shared storage).  The prospect has a metropolitan area network 
> but I do not have any specs on that as yet.
> 
> Are there available docs relevant to running a distributed VMS cluster 
> over a metro area network or fast/big enough VPN tunnel?  Or is that 
> just the straight cluster over IP configuration in the docs (which 
> we've never used) that we need to concentrate on?

Once the costs are understood and acceptable, then the design 
discussions and trade-offs can progress.

Get the prices, first. Before starting to investigate the design in any 
detail, get a quote from VSI for clustering and likely also for 
software RAID-1 (HBVS), and determine whether those costs are 
sustainable, and whether SaaS licensing or traditional licensing is 
preferred. Prior to the advent of SaaS licensing, the cluster license 
purchase prices alone ended more than a few of these project 
discussions.  Also for this case, get a quote for metropolitan Ethernet 
via SDN or MPLS or point-to-point microwave or whatever other similar 
connect the local comms carriers are offering.

Once you're over that purchasing hurdle, your design here is inherently 
going to be primary-secondary only (two hosts, with no shared storage), 
and your interconnect speed will be limited by the available link 
speeds and which particularly plays into software RAID-1 processing, 
and you'll need to discuss whether that'll meet your customer's 
clustering expectations. Primary-secondary also means manual 
intervention to promote the secondary to primary status, whether the 
primary is down or the link is down. And clustering usually means both 
sites drop offline if somebody or something clobbers the shared 
production data, whether that might have been an accidental or 
incidental or malicious corruption.

I prefer LAN clustering and network bridging to the IPCI clustering / 
NISCS_USE_UDP path as there are some other cluster-related options 
available only with a bridged LAN, though both interconnects can and do 
work.

Whichever you pick here, software RAID-1 needs beaucoup bandwidth or 
you'll be enjoying cascading failures.

Also review the DT (delirium tremens, err, disaster tolerance) 
alternatives. Splitting off volumes and a rotating schedule shipping 
those copies to the warm site can be cost-acceptable for some 
applications and environments.

The above assumes y'all are within maximum the clustering span; within 
500 mi / 800 km, as the wires were drug; as the electrons fly.








-- 
Pure Personal Opinion | HoffmanLabs LLC 




More information about the Info-vax mailing list