
Alibaba Cloud said that it has been using ZooRoute in AliCloud for the last 18 months, where it has reduced outage time by 92.71%.
Nezha for network performance in high-demand VMs
Another software upgrade is helping Alibaba Cloud maintain network performance for high-demand virtual machines (VMs) without spending more on SmartNIC-accelerated virtual switches (vSwitches).
Nezha, a distributed vSwitch load-sharing system, identifies idle SmartNICs and uses them to create a remote resource pool for high-demand virtual NICs (vNICs).
Alibaba has tested the system in its data centers for a year and said in the paper that “Nezha effectively resolves vSwitch overloads and removes it as a bottleneck.” With the number of concurrent flows improved by up to 50x, and the number of vNICs by up to 40x, the bottleneck s now the VM kernel stack, the researchers wrote.
Dai’s Forrester said that Nezha’s stateless offloading and cluster-wide pooling design is superior to solutions being pursued by rival cloud service providers.
Separately, Alibaba’s cloud computing division has also been working on another software update that will enable it to provide better network performance for AI workloads.