[Info-vax] VMware

Wed Dec 11 16:59:10 EST 2019

On 11/12/2019 20:43, Bob Gezelter wrote:
> On Wednesday, December 11, 2019 at 2:48:50 PM UTC-5, David Wade wrote:
>> On 11/12/2019 17:01, Bob Gezelter wrote:
>>> On Wednesday, December 11, 2019 at 9:02:21 AM UTC-5, David Wade wrote:
>>>> On 11/12/2019 13:32, Bob Gezelter wrote:
>>>>> On Tuesday, December 10, 2019 at 10:00:27 PM UTC-5, Grant Taylor wrote:
>>>>>> On 12/10/19 5:27 PM, Bob Gezelter wrote:
>>>>>>> With all due respect, I want to see the fine-grain details on
>>>>>>> that implementation.
>>>>>>
>>>>>> Please see my reply from ~ 7:35.  (Adjust hour accordingly for your time
>>>>>> zone.)
>>>>>>
>>>>>> I think that's about as granular as I can get without going and looking
>>>>>> things up.
>>>>>>
>>>>>> I have no problem with you wanting to see the fine-grain details.  I
>>>>>> asked very similar questions 10+ years ago.  Hence why I have the
>>>>>> understanding that I do.  Also why it's now only a high level detail.
>>>>>>
>>>>>>> Particularly the part about "packets do not drop".
>>>>>>
>>>>>> I've routinely moved VMs between hosts without dropping packets.  I do
>>>>>> see latency at the epoch of the transition increase momentarily (usually
>>>>>> just one packet).  But the packet does make it through and is not dropped.
>>>>>>
>>>>>> Frequently latency is something like this:
>>>>>>
>>>>>> 1–3 ms
>>>>>> 1–3 ms
>>>>>> 1–3 ms
>>>>>> 9–12 ms
>>>>>> 1–3 ms
>>>>>> 1–3 ms
>>>>>> 1–3 ms
>>>>>>
>>>>>> No packet drop.
>>>>>>
>>>>>> TCP sessions continue without retransmissions.
>>>>>>
>>>>>>> Ensuring granularity of file update is also quite a challenge.
>>>>>>
>>>>>> Why?  (Please see my other message about what happens.)
>>>>>>
>>>>>>> There is a large difference between "rarely are packets lost" and
>>>>>>> "packets are never lost". Pre-loading other virtual instances and
>>>>>>> keeping memory state updated them updated is one thing, ensuring mass
>>>>>>> storage state is something else.
>>>>>>
>>>>>> All hosts in the cluster have access to the same storage.  So anything
>>>>>> written on one host is readable by other hosts.  Part of the migration
>>>>>> ensures that cached data is synced to disk and / or copied as part of
>>>>>> the memory for the system.
>>>>>>
>>>>>> So there's no "mass storage state" to keep in sync because it is the
>>>>>> same back end storage.
>>>>>>
>>>>>>> I will not even get into questions like the state of attached
>>>>>>> non-storage peripherals, e.g. RNGs.
>>>>>>
>>>>>> Those would be the types of things that would prevent migration between
>>>>>> hosts.
>>>>>>
>>>>>> Though, I think that VMware has an option to allow USB peripherals to be
>>>>>> used across the network.
>>>>>>
>>>>>> If not VMware, there are other OS level solutions to allow some
>>>>>> peripherals to be used across the network.
>>>>>>
>>>>>> I've personally used remote (TCP based) serial ports for fax servers.
>>>>>> The modem is physically connected to a network attached DigiBoard (or
>>>>>> the likes) and the VM is free to move from host to host to host because
>>>>>> it's TCP connection to the serial port is still in tact.
>>>>>>
>>>>>> Given that faxing is time sensitive serial audio / data (depending on
>>>>>> the modem) there may be an issue with the momentary increased latency.
>>>>>> I don't know if that would ride through a migration or if it would rely
>>>>>> on error detection and correction in the modem / fax level.
>>>>>>
>>>>>>> My general advice is to deeply verify the precise nature of the
>>>>>>> implementation and its limitations before relying on it.
>>>>>>
>>>>>> I think that's a wonderful idea.
>>>>>>
>>>>>>> A while back, I was at an user group event where there was a
>>>>>>> presentation on VM migration. The speaker made a statement that
>>>>>>> failover migration would handle all cases. Being from New York City,
>>>>>>> I inquired about a scenario we had experienced a few years earlier.
>>>>>>
>>>>>> ~chuckle~
>>>>>>
>>>>>> Absolutes are usually a problem in one way or another.  ;-)
>>>>>>
>>>>>>> A Boeing 767 doing between 150 and 200 knots comes through your machine
>>>>>>> room window. How long does it take to traverse the 24 inches between
>>>>>>> front of the cabinet and the back of the cabinet. Even that scenario
>>>>>>> does not include the fact that the infrastructure connecting one
>>>>>>> VM host to another has likely been severed before the VM host frame
>>>>>>> is hit.
>>>>>>
>>>>>> I think that's a valid question.  I think it's an EXTREMELY ATYPICAL
>>>>>> failure scenario.  But it is decidedly within the "all cases" absolute
>>>>>> the speaker set themselves up for.
>>>>>>
>>>>>> I think that would be very difficult to protect against.
>>>>>>
>>>>>> I would question, what about a data center in an adjacent building that
>>>>>> you can extend the LAN / SAN / etc. into.  Though it could also
>>>>>> experience a similar problem (fate sharing).
>>>>>>
>>>>>> When you start talking about failures that can take out multiple
>>>>>> buildings in close proximity to each other, you REALLY need an EXTREMELY
>>>>>> robust solution.
>>>>>>
>>>>>> I do think that VMware has some solutions that can work over extended
>>>>>> distances.
>>>>>>
>>>>>>
>>>>>>
>>>>>> -- 
>>>>>> Grant. . . .
>>>>>> unix || die
>>>>>
>>>>> Grant,
>>>>>
>>>>> Your post proves my point.
>>>>>
>>>>> I do not disagree that within the context of "controlled" VM migration between hosts, it is possible to accomplish the migration without loss of packets or I/O inconsistency.
>>>>>
>>>>> It is the uncontrolled case to which I referred.
>>>>>
>>>>> Of course, in the controlled case, the connection to the switch can be blocked/queued AND acknowledged to prevent packet(s) from being caught during the transition. Alternatively, the MAC address can be changed and the packets queued at the new host. A similar argument applies to I/O. In a controlled case, active I/O cam be completed before the transfer.
>>>>>
>>>>> Otherwise, one needs facilities not present in x86 (e.g., lock-step execution as was implemented on some fault tolerant architectures in the past). As an example, modern hardware RNGs make precise execution profiles on modern systems unlikely.
>>>>>
>>>>> - Bob Gezelter, http://www.rlgsc.com
>>>>>
>>>>
>>>> Bob,
>>>>
>>>> Other than the hypervisor, nothing under VMWARE runs as a real CPU
>>>> process.It all runs using the virtual assists. The OS drivers are not
>>>> talking to real hardware, they are talking via emulated hardware.
>>>>
>>>> Typically the OS sees a SCSI or ATA adaptor but the real hardware will
>>>> be fibre SAN.
>>>>
>>>> So whilst X86 does not have lock step VMWARE does have lock step...
>>>>
>>>> https://searchvmware.techtarget.com/definition/VMware-vLockstep
>>>>
>>>> Dave
>>>
>>> David,
>>>
>>> The cited reference for vLockstep specifically notes that INSTRUCTION-level lockstep requires hardware support from the CPU itself. The cited resource does not address the question of randomization.
>>>
>>> If your hardware RNG is operating properly, RNGs on different CPUs will invariably generate different results. If code then executes different code paths depending directly or indirectly on the output of the RNG, the execution paths will diverge.
>>>
>>> Believe me, this can be quite a challenge. In my past, I spent a fair amount of time ensuring that two execution runs were identical in execution profile. It is surprising where randomness can creep in and produce interesting downstream effects.
>>>
>>> - Bob Gezelter, http://www.rlgsc.com
>>>
>> Bob,
>> As I said almost all modern IA64 CPUs have microcode support for
>> virtualization and lockstep.
>>
>> https://kb.vmware.com/s/article/1008027
>>
>> https://www.intel.co.uk/content/www/uk/en/virtualization/virtualization-technology/intel-virtualization-technology.html
>>
>> I can't see VMWARE touting something that doesn't work. I never tried it
>> as when I was working with VMWare it only supported single CPUS and
>> machines where I needed HA had more than one virtual CPU. I see Vmware 6
>> supports up to four virtual CPUs.
>>
>> As I said you are not running on real CPUs so you can intercept
>> instructions that can cause different results. There is a performance
>> impact as detailed here:-
>>
>> https://www.vmware.com/files/pdf/techpaper/VMware-vSphere6-FT-arch-perf.pdf
>>
>> (you will need to re-join the URL)
>>
>> Dave
> 
> Dave,
> 
> Reference: https://en.wikipedia.org/wiki/RDRAND
> 
> I think NIST and NSA will be interested if RDRAND is synchronizable between different CPUs.
> 
> There is a big difference between maintaining an ongoing backup copy of memory and lockstep execution. The whitepaper appears on a quick reading to refer to the former.
> 

Its lockstep.

> VMs, as opposed to emulators, have always executed most code natively, while taking an exception for instructions which must be specially processed. Some CPUs have microcode or hardware assist for virtualization, others take a conventional processor fault and use exception routines to emulate the subject instruction. This has been true since VM/360.

VMWARe won't run on a CPU without the intel virtualization hardware support

> 
> Lockstep instruction execution with the two machines separated by a network link is inherently VERY slow. It is a matter of simple physics.
> 
> - Bob Gezelter, http://www.rlgsc.com
> 

Dave