CN114675972B - Cloud network resource flexible scheduling method and system based on integral algorithm - Google Patents
Cloud network resource flexible scheduling method and system based on integral algorithm Download PDFInfo
- Publication number
- CN114675972B CN114675972B CN202210381458.0A CN202210381458A CN114675972B CN 114675972 B CN114675972 B CN 114675972B CN 202210381458 A CN202210381458 A CN 202210381458A CN 114675972 B CN114675972 B CN 114675972B
- Authority
- CN
- China
- Prior art keywords
- module
- integral
- virtual switch
- cpu
- consumed
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
- 
        - G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5005—Allocation of resources, e.g. of the central processing unit [CPU] to service a request
- G06F9/5027—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
 
- 
        - G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/44—Arrangements for executing specific programs
- G06F9/455—Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
- G06F9/45533—Hypervisors; Virtual machine monitors
- G06F9/45558—Hypervisor-specific management and integration aspects
 
- 
        - G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5061—Partitioning or combining of resources
- G06F9/5072—Grid computing
 
- 
        - G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5061—Partitioning or combining of resources
- G06F9/5077—Logical partitioning of resources; Management or configuration of virtualized resources
 
- 
        - G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/44—Arrangements for executing specific programs
- G06F9/455—Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
- G06F9/45533—Hypervisors; Virtual machine monitors
- G06F9/45558—Hypervisor-specific management and integration aspects
- G06F2009/45595—Network integration; Enabling network access in virtual machine instances
 
Landscapes
- Engineering & Computer Science (AREA)
- Software Systems (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Data Exchanges In Wide-Area Networks (AREA)
Abstract
Description
技术领域Technical Field
本发明属于云数据中心网络技术领域,尤其涉及一种基于积分算法的虚拟化云网络资源弹性调度方法及系统。The present invention belongs to the technical field of cloud data center network, and in particular relates to a method and system for elastically scheduling virtualized cloud network resources based on an integral algorithm.
背景技术Background technique
随着信息技术的发展,工业企业信息化的趋势日益明显,众多工业企业发展已逐渐趋于智能化经营,即企业上云。企业上云是指企业通过互联网技术与云计算技术方式,连接社会性资源、共享平台及工作内容等,从而开展信息化管理基础设施建设、管理方法、业务流程等运用的全过程。随着越来越多的企业纷纷将业务上云,云服务提供商承载了越来越大的运营压力。一方面,云服务提供商的网络总流量逐渐增大;另一方面,云服务提供商出售的服务质量等级纷繁复杂。为了保障企业用户能够获得良好的服务体验,云服务提供商必须依照流量分布特征和用户预设服务质量合理地调度网络资源,否则,用户的正常性能需求将无法得到满足,还可能造成局部网络热点引发云服务器CPU过载宕机的巨大运营事故。With the development of information technology, the trend of informatization of industrial enterprises has become increasingly obvious. The development of many industrial enterprises has gradually tended towards intelligent operation, that is, enterprises going to the cloud. Enterprise going to the cloud refers to the whole process of enterprises using the Internet technology and cloud computing technology to connect social resources, shared platforms and work content, so as to carry out the construction of information management infrastructure, management methods, business processes, etc. As more and more enterprises are moving their businesses to the cloud, cloud service providers are bearing increasing operational pressure. On the one hand, the total network traffic of cloud service providers is gradually increasing; on the other hand, the service quality levels sold by cloud service providers are complex. In order to ensure that enterprise users can obtain a good service experience, cloud service providers must reasonably dispatch network resources according to traffic distribution characteristics and user preset service quality. Otherwise, the normal performance requirements of users will not be met, and it may also cause huge operational accidents such as local network hotspots causing cloud server CPU overload and downtime.
目前,主流的云服务提供商都采用虚拟大二层网络技术。云服务提供商通过虚拟化技术将物理服务器(Host)虚拟化为多台虚拟逻辑服务器(VM)。出于业务管理和部署的便利性,云服务提供商需要将海量的VM纳入同一个二层广播域中。由于传统的VLAN二层技术无法支持云数据中心动辄上万甚至十万级别的Host数量,云服务提供商开发了VXLAN、NVGRE和STT等网络协议以满足跨地域、跨中心大二层要求。实现这些协议的核心组件是虚拟交换机。虚拟交换机是一种允许在VM之间进行网络通信的软件应用程序,其通常被部署于Host的系统中,与虚拟机监视器(Hypervisor)协同工作。由于虚拟交换机通过软件抽象层与多个VM的虚拟网卡联通,是VM对外进行网络通信的直接交互组件,对VM集群的整体网络性能有举足轻重的影响,因此虚拟交换机的弹性资源保障机制是实现服务质量和性能隔离的重要技术之一。At present, mainstream cloud service providers all use virtual large Layer 2 network technology. Cloud service providers use virtualization technology to virtualize physical servers (hosts) into multiple virtual logical servers (VMs). For the convenience of business management and deployment, cloud service providers need to include a large number of VMs in the same Layer 2 broadcast domain. Since traditional VLAN Layer 2 technology cannot support tens of thousands or even hundreds of thousands of hosts in cloud data centers, cloud service providers have developed network protocols such as VXLAN, NVGRE, and STT to meet the requirements of large Layer 2 across regions and centers. The core component for implementing these protocols is the virtual switch. A virtual switch is a software application that allows network communication between VMs. It is usually deployed in the host system and works in conjunction with the virtual machine monitor (Hypervisor). Since the virtual switch is connected to the virtual network cards of multiple VMs through the software abstraction layer, it is a direct interactive component for VMs to communicate with the outside world on the network. It has a significant impact on the overall network performance of the VM cluster. Therefore, the elastic resource guarantee mechanism of the virtual switch is one of the important technologies for achieving service quality and performance isolation.
弹性资源保障机制即为动态且合理地调度虚拟交换机在处理每一台VM网络报文所消耗的计算资源。目前,虚拟交换机面临着以下几个挑战:首先,由于在同一台Host中部署的VM往往有着不同的服务规格指标,需要虚拟交换机对各个VM进行位速率(BPS)及包速率(PPS)等多个维度的限速;其次,虚拟交换机需要保证隔离性,即当其中一台VM产生异常流量时,同一Host上的其他VM的网络服务质量不受影响;最后,虚拟交换机需要尽可能地提高Host计算资源的整体利用率,避免空闲资源无法得到有效利用。The elastic resource guarantee mechanism is to dynamically and reasonably schedule the computing resources consumed by the virtual switch in processing each VM network message. At present, virtual switches face the following challenges: First, because VMs deployed in the same host often have different service specifications, the virtual switch needs to limit the speed of each VM in multiple dimensions such as bit rate (BPS) and packet rate (PPS); second, the virtual switch needs to ensure isolation, that is, when one of the VMs generates abnormal traffic, the network service quality of other VMs on the same host is not affected; finally, the virtual switch needs to improve the overall utilization of the host computing resources as much as possible to avoid idle resources from being effectively utilized.
目前业界内已有多种关于弹性资源保障的算法与技术。例如基于Fair sharing模型的FairCloud和NetShare;基于Hose模型的ElasticSwitch和Silo等等。此外,Google也提出了基于VM对应的SLA分配虚拟交换机CPU周期的PicNIC方案。尽管上述方案在不同程度上改善了资源争用和服务质量,但目前这些方法仍然缺乏细粒度的调度手段。一个核心的问题是,如何在闲时低负载和忙时高负载两种不同工况下动态无缝地切换。There are many algorithms and technologies for elastic resource guarantee in the industry. For example, FairCloud and NetShare based on the Fair sharing model; ElasticSwitch and Silo based on the Hose model, etc. In addition, Google has also proposed a PicNIC solution that allocates virtual switch CPU cycles based on the SLA corresponding to the VM. Although the above solutions have improved resource contention and service quality to varying degrees, these methods currently still lack fine-grained scheduling methods. A core issue is how to dynamically and seamlessly switch between two different working conditions: low load during idle time and high load during busy time.
发明内容Summary of the invention
鉴于上述,本发明的目的是提供一种基于积分算法的云网络资源弹性调度方法及系统,以保证多个VM网络趋于合理水平的资源利用。In view of the above, an object of the present invention is to provide a cloud network resource elastic scheduling method and system based on an integral algorithm to ensure that resource utilization of multiple VM networks tends to a reasonable level.
为实现上述发明目的,实施例提供的一种基于积分算法的云网络资源弹性调度系统,包括虚拟交换机模块、数据采集模块、积分计算模块以及资源调度模块;To achieve the above-mentioned purpose of the invention, an embodiment provides a cloud network resource elastic scheduling system based on an integral algorithm, comprising a virtual switch module, a data acquisition module, an integral calculation module and a resource scheduling module;
所述虚拟交换机模块作为承载VM网络功能的核心模块,用于根据CPU周期占比为VM提供网络服务;The virtual switch module is used as a core module that carries VM network functions and is used to provide network services for VMs according to the CPU cycle ratio;
所述数据采集模块用于从虚拟交换机模块中收集每个工作时间片内,各VM所消耗的虚拟交换机的实际CPU时钟周期数及周期占比,并传输至积分计算模块;The data collection module is used to collect the actual number of CPU clock cycles and cycle proportion of the virtual switch consumed by each VM in each working time slice from the virtual switch module, and transmit it to the integral calculation module;
所述积分计算模块用于根据输入的实际CPU时钟周期数及周期占比更新每个VM的积分值,并将更新的积分值传输至资源调度模块;The integral calculation module is used to update the integral value of each VM according to the input actual CPU clock cycle number and cycle proportion, and transmit the updated integral value to the resource scheduling module;
所述资源调度模块用于根据输入的积分值动态限制下一工作时间片内每个VM所消耗的虚拟交换机CPU周期数和周期占比,实现资源分配。The resource scheduling module is used to dynamically limit the number of virtual switch CPU cycles and cycle proportion consumed by each VM in the next working time slice according to the input integral value, so as to realize resource allocation.
在一个实施例中,所述虚拟交换机模块包括虚拟网卡接口模块、Netframe转发模块;In one embodiment, the virtual switch module includes a virtual network card interface module and a Netframe forwarding module;
所述虚拟网卡接口模块用于提供与VM进行网络数据通信的接口;The virtual network card interface module is used to provide an interface for network data communication with the VM;
所述Netframe转发模块是基于DPDK的用户态网络协议栈组件,作为数据转发的核心模块,用于实现了两层MAC地址转发,即以Netfilter Hook的方式提供给虚拟网卡接口模块使用。The Netframe forwarding module is a user-mode network protocol stack component based on DPDK. As a core module for data forwarding, it is used to implement two-layer MAC address forwarding, that is, it is provided to the virtual network card interface module in the form of Netfilter Hook.
在一个实施例中,所述数据采集模块从Netframe转发模块中收集每个工作时间片内,各VM所消耗的虚拟交换机CPU时钟周期数及占总周期数的比例。In one embodiment, the data collection module collects the number of virtual switch CPU clock cycles consumed by each VM in each working time slice and the proportion of the total number of cycles from the Netframe forwarding module.
在一个实施例的所述积分计算模块中,更新每个VM的积分值的过程包括:In the integral calculation module of one embodiment, the process of updating the integral value of each VM includes:
为每个VM预设积分参数BASE、MAX和MIN,其中,BASE为基础消耗CPU周期占比,MAX和MIN分别为最大和最小消耗CPU周期占比;Preset integral parameters BASE, MAX and MIN for each VM, where BASE is the basic consumption CPU cycle ratio, MAX and MIN are the maximum and minimum consumption CPU cycle ratios respectively;
针对每个VM,比较消耗的CPU周期占比是否小于BASE值,在CPU周期占比小于BASE值时,积分增加X1;在CPU周期占比大于BASE值时,积分减少X2,以得到积分更新结果,其中,X1和X2为预设的增加比例和减少比例,其中,X1小于X2,表示积分的积累速度要慢于积分的消耗速度。For each VM, compare whether the consumed CPU cycle ratio is less than the BASE value. When the CPU cycle ratio is less than the BASE value, the points are increased by X1; when the CPU cycle ratio is greater than the BASE value, the points are reduced by X2 to obtain the points update result, where X1 and X2 are the preset increase ratio and decrease ratio. If X1 is less than X2, it means that the accumulation speed of points is slower than the consumption speed of points.
在一个实施例的所述积分计算模块中,当积分值为0时,不进行积分的更新,保持积分值为0。In the integral calculation module of one embodiment, when the integral value is 0, the integral is not updated and remains 0.
在一个实施例的所述资源调度模块中,若输入的积分值为0,则将下一秒内VM所消耗的最大CPU周期占比设为BASE;若积分值大于0,则将下一秒内VM所消耗的最大CPU占比设为MAX;同时,必须保证下一秒内VM所消耗的最小CPU周期占比为MIN。In the resource scheduling module of one embodiment, if the input integral value is 0, the maximum CPU cycle ratio consumed by the VM in the next second is set to BASE; if the integral value is greater than 0, the maximum CPU cycle ratio consumed by the VM in the next second is set to MAX; at the same time, it must be ensured that the minimum CPU cycle ratio consumed by the VM in the next second is MIN.
在一个实施例中,所述资源调度模块根据输入的积分值动态控制的每个VM所消耗的虚拟交换机CPU周期占比传入至所述虚拟交换机模块;In one embodiment, the resource scheduling module dynamically controls the proportion of virtual switch CPU cycles consumed by each VM according to the input integral value and transmits it to the virtual switch module;
所述虚拟交换机模块根据接收的CPU周期占比为VM提供相应规格的网络服务。The virtual switch module provides network services of corresponding specifications for the VM according to the received CPU cycle ratio.
在一个实施例的所述积分计算模块中,为每个VM预设的初始积分值根据用户的服务指标指定,取值为100~500。In the score calculation module of one embodiment, the initial score value preset for each VM is specified according to the user's service index and is in the range of 100 to 500.
为实现上述发明目的,实施例提供的基于积分算法的云网络资源弹性调度方法,所述方法采用上述云网络资源弹性调度系统,所述调度方法包括以下步骤:To achieve the above-mentioned purpose of the invention, the embodiment provides a cloud network resource elastic scheduling method based on an integral algorithm, the method adopts the above-mentioned cloud network resource elastic scheduling system, and the scheduling method includes the following steps:
步骤1,利用虚拟交换机模块为每个VM根据CPU周期占比提供网络服务;Step 1, using the virtual switch module to provide network services for each VM according to the CPU cycle ratio;
步骤2,利用数据采集模块从虚拟交换机模块中收集每个工作时间片内,各VM所消耗的虚拟交换机的实际CPU时钟周期数及周期占比,并传输至积分计算模块;Step 2, using the data acquisition module to collect the actual number of CPU clock cycles and cycle proportion of the virtual switch consumed by each VM in each working time slice from the virtual switch module, and transmit it to the integral calculation module;
步骤3,利用积分计算模块输入的实际CPU时钟周期数及周期占比更新每个VM的积分值,并将更新的积分值传输至资源调度模块;Step 3: Update the integral value of each VM using the actual number of CPU clock cycles and cycle proportion input by the integral calculation module, and transmit the updated integral value to the resource scheduling module;
步骤4,利用资源调度模块根据输入的积分值动态计算下一工作时间片内每个VM所消耗的虚拟交换机CPU周期数和周期占比,并传输至虚拟交换机模块。Step 4: Utilize the resource scheduling module to dynamically calculate the number of virtual switch CPU cycles and cycle proportion consumed by each VM in the next working time slice according to the input integral value, and transmit the calculations to the virtual switch module.
与现有技术相比,本发明具有的有益效果至少包括:Compared with the prior art, the present invention has the following beneficial effects:
通过积分算法实现了VM消耗CPU周期占比与VM规格绑定,避免并发多种指标压力造成Host资源争抢,有效地提高了Host资源的整体利用率;The integration algorithm is used to bind the proportion of VM CPU cycles to VM specifications, avoiding competition for Host resources caused by pressure from multiple concurrent indicators, and effectively improving the overall utilization of Host resources.
通过积分算法对每个VM消耗CPU周期占比的动态调节,实现了统一维度的网络突发传输能力,并能够在此情景下实现较好的性能隔离。By dynamically adjusting the proportion of CPU cycles consumed by each VM through an integral algorithm, a unified network burst transmission capacity is achieved, and better performance isolation can be achieved in this scenario.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图做简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动前提下,还可以根据这些附图获得其他附图。In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings required for use in the embodiments or the description of the prior art will be briefly introduced below. Obviously, the drawings described below are only some embodiments of the present invention. For ordinary technicians in this field, other drawings can be obtained based on these drawings without creative work.
图1是实施例提供的基于积分算法的云网络资源弹性调度系统的结构示意图;FIG1 is a schematic diagram of the structure of a cloud network resource elastic scheduling system based on an integral algorithm provided in an embodiment;
图2是实施例提供的积分更新过程和基于积分更新值的CPU周期占比调控过程示意图;2 is a schematic diagram of an integral update process and a CPU cycle ratio control process based on an integral update value provided by an embodiment;
图3是实施例提供的云网络资源弹性调度系统模式工况切换示意图。FIG3 is a schematic diagram of a mode switching of a cloud network resource elastic scheduling system provided in an embodiment.
具体实施方式Detailed ways
为使本发明的目的、技术方案及优点更加清楚明白,以下结合附图及实施例对本发明进行进一步的详细说明。应当理解,此处所描述的具体实施方式仅仅用以解释本发明,并不限定本发明的保护范围。In order to make the purpose, technical solution and advantages of the present invention more clearly understood, the present invention is further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific implementation methods described herein are only used to explain the present invention and do not limit the scope of protection of the present invention.
通过对云数据中心网络流量模式的观察,基于“闲时超卖,忙时低保”的调度指导思想,实施例提供了一种基于积分算法的云网络资源弹性调度系统和方法,通过引入积分计算的机制,构建VM带宽使用与虚拟交换机CPU消耗的对应积分,合理分配下一阶段的虚拟交换机网络资源,保障整个数据中心上的云网络负载趋于合理水平。By observing the network traffic pattern of cloud data centers and based on the scheduling guiding ideology of "overselling during idle times and underselling during busy times", an embodiment provides a cloud network resource elastic scheduling system and method based on an integral algorithm. By introducing an integral calculation mechanism, corresponding integrals of VM bandwidth usage and virtual switch CPU consumption are constructed, and virtual switch network resources in the next stage are reasonably allocated to ensure that the cloud network load on the entire data center tends to a reasonable level.
图1是实施例提供的基于积分算法的云网络资源弹性调度系统的结构示意图。如图1所示,实施例提供的基于积分算法的云网络资源弹性调度系统,包括虚拟交换机模块、数据采集模块、积分计算模块以及资源调度模块,通过这些模块的协同作用动态分配物理服务器(Host)上虚拟逻辑服务器(VM)的带宽资源。Fig. 1 is a schematic diagram of the structure of the cloud network resource elastic scheduling system based on the integral algorithm provided in the embodiment. As shown in Fig. 1, the cloud network resource elastic scheduling system based on the integral algorithm provided in the embodiment includes a virtual switch module, a data acquisition module, an integral calculation module and a resource scheduling module, and the bandwidth resources of the virtual logical server (VM) on the physical server (Host) are dynamically allocated through the synergy of these modules.
其中,虚拟交换机模块作为承载VM网络功能的核心模块,用于根据CPU周期占比为Host上的VM提供相应规格的网络服务。具体地,虚拟交换机模块包括虚拟网卡接口模块和Netframe转发模块,虚拟网卡接口模块用于提供与VM进行网络数据通信的接口;Netframe转发模块是基于DPDK的用户态网络协议栈组件,作为数据转发的核心模块,用于实现了两层MAC地址转发,即以Netfilter Hook的方式提供给虚拟网卡接口模块使用。该虚拟网卡接口模块和Netframe转发模块均属于软件架构。Among them, the virtual switch module is the core module that carries the VM network function, and is used to provide network services of corresponding specifications for the VM on the Host according to the CPU cycle ratio. Specifically, the virtual switch module includes a virtual network card interface module and a Netframe forwarding module. The virtual network card interface module is used to provide an interface for network data communication with the VM; the Netframe forwarding module is a user-mode network protocol stack component based on DPDK. As the core module for data forwarding, it is used to implement two-layer MAC address forwarding, that is, it is provided to the virtual network card interface module in the form of Netfilter Hook. The virtual network card interface module and the Netframe forwarding module both belong to the software architecture.
数据采集模块用于从虚拟交换机模块包括的Netframe转发模块中收集每个工作时间片内,各VM所消耗的虚拟交换机的实际CPU时钟周期数及周期占比,并传输至积分计算模块。其中,工作时间片是指一段工作时间,可以为1秒。The data collection module is used to collect the actual number of CPU clock cycles and cycle proportions of the virtual switch consumed by each VM in each working time slice from the Netframe forwarding module included in the virtual switch module, and transmit it to the integral calculation module. Among them, the working time slice refers to a period of working time, which can be 1 second.
积分计算模块用于根据输入的实际CPU时钟周期数及周期占比更新每个VM的积分值,并将更新的积分值传输至资源调度模块。具体地,更新每个VM的积分值的过程包括:The integral calculation module is used to update the integral value of each VM according to the input actual CPU clock cycle number and cycle proportion, and transmit the updated integral value to the resource scheduling module. Specifically, the process of updating the integral value of each VM includes:
为每个VM预设积分参数BASE、MAX和MIN,其中,BASE是默认服务质量下,VM消耗虚拟交换机CPU的基础周期比例,例如,可以将BASE定为0.1,此时的BASE积分对应1Gbps;MAX是VM在突发传输时,VM消耗虚拟交换机CPU的最大周期比例,例如,可以将MAX定为0.3,此时的MAX积分对应3Gbps,其代表了VM在积分可用的情况下网络性能上限;MIN是VM在高负载或积分耗尽的情况下所能消耗的虚拟交换机CPU的最小周期比例,例如,可以将MIN定为0.02,对应200Mbps,其代表了VM的最低服务质量。需要注意的是,BASE的具体数值与具体物理服务器的硬件规格相关,需要进行基准测试后标定。The integral parameters BASE, MAX and MIN are preset for each VM, where BASE is the basic cycle ratio of the virtual switch CPU consumed by the VM under the default service quality. For example, BASE can be set to 0.1, and the BASE integral at this time corresponds to 1Gbps; MAX is the maximum cycle ratio of the virtual switch CPU consumed by the VM during burst transmission. For example, MAX can be set to 0.3, and the MAX integral at this time corresponds to 3Gbps, which represents the upper limit of the network performance of the VM when the integral is available; MIN is the minimum cycle ratio of the virtual switch CPU that the VM can consume under high load or when the integral is exhausted. For example, MIN can be set to 0.02, corresponding to 200Mbps, which represents the minimum service quality of the VM. It should be noted that the specific value of BASE is related to the hardware specifications of the specific physical server and needs to be calibrated after benchmarking.
积分计算模块同时维护所服务的每个VM积分,积分初始值定为INIT,INIT依据用户的服务指标制定,一般为100~500。如图2所示,针对每个VM,比较消耗的CPU周期占比是否小于BASE值,在CPU周期占比小于BASE值时,积分增加X1;在CPU周期占比大于BASE值时,积分减少X2,以得到积分更新结果,其中,X1和X2为预设的增加比例和减少比例,其中,X1小于X2,X1的值一般定为0.1,X2的值定为1.0,表示积分的积累速度要慢于积分的消耗速度。The integral calculation module also maintains the integral of each VM served. The initial integral value is set as INIT, which is set according to the user's service indicators and is generally 100 to 500. As shown in Figure 2, for each VM, the CPU cycle ratio consumed is compared to see if it is less than the BASE value. When the CPU cycle ratio is less than the BASE value, the integral is increased by X1; when the CPU cycle ratio is greater than the BASE value, the integral is reduced by X2 to obtain the integral update result, where X1 and X2 are the preset increase and decrease ratios, and where X1 is less than X2, the value of X1 is generally set to 0.1, and the value of X2 is set to 1.0, indicating that the accumulation speed of integral is slower than the consumption speed of integral.
还需要特殊说明的是,积分计算模块中,每个工作时间片获得积分的量当原积分值为0时,不做处理,保持为0。It should also be noted that in the integral calculation module, when the original integral value of each working time slice is 0, no processing is performed and it remains 0.
资源调度模块用于根据输入的积分值动态限制下一工作时间片内每个VM所消耗的虚拟交换机CPU周期数和周期占比,如图2所示,具体过程包括:若输入的积分值为0,则将下一秒内VM所消耗的最大CPU周期占比设为BASE;若积分值大于0,则将下一秒内VM所消耗的最大CPU占比设为MAX;同时,必须保证下一秒内VM所消耗的最小CPU周期占比为MIN。动态控制的下一工作时间片内每个VM所消耗的虚拟交换机CPU周期数和周期占比传入至虚拟交换机模块,虚拟交换机模块根据接收的CPU周期占比为VM提供相应规格的网络服务。The resource scheduling module is used to dynamically limit the number of virtual switch CPU cycles and cycle proportion consumed by each VM in the next working time slice according to the input integral value, as shown in Figure 2. The specific process includes: if the input integral value is 0, the maximum CPU cycle proportion consumed by the VM in the next second is set to BASE; if the integral value is greater than 0, the maximum CPU cycle proportion consumed by the VM in the next second is set to MAX; at the same time, it must be ensured that the minimum CPU cycle proportion consumed by the VM in the next second is MIN. The number of virtual switch CPU cycles and cycle proportion consumed by each VM in the next working time slice under dynamic control are transmitted to the virtual switch module, and the virtual switch module provides network services of corresponding specifications for the VM according to the received CPU cycle proportion.
实施例还提供了一种采用上述云网络资源弹性调度系统的云网络资源弹性调度方法,包括以下步骤:The embodiment also provides a cloud network resource elastic scheduling method using the above cloud network resource elastic scheduling system, including the following steps:
步骤1,利用虚拟交换机模块为每个VM根据CPU周期占比提供网络服务。Step 1: Use the virtual switch module to provide network services for each VM based on the CPU cycle ratio.
实施例中,各个模块在程序加载后,完成参数初始化,将各VM所对应的积分设定为初始值;开始执行程序进程,为VM提供网络服务。In the embodiment, after the program is loaded, each module completes parameter initialization, sets the points corresponding to each VM to the initial value, starts executing the program process, and provides network services for the VM.
步骤2,利用数据采集模块从虚拟交换机模块中收集每个工作时间片内,各VM所消耗的虚拟交换机的实际CPU时钟周期数及周期占比,并传输至积分计算模块。Step 2: Use the data collection module to collect the actual number of CPU clock cycles and cycle proportion of the virtual switch consumed by each VM in each working time slice from the virtual switch module, and transmit it to the integral calculation module.
实施例中,在一个工作时间片(一秒)结束时,向Netframe转发模块发出查询请求;Netframe转发模块统计上一秒内各个VM所消耗的CPU周期数,并统计过去一秒内的总CPU周期数(包含空闲CPU周期),将结果返回到数据采集模块。In the embodiment, at the end of a working time slice (one second), a query request is sent to the Netframe forwarding module; the Netframe forwarding module counts the number of CPU cycles consumed by each VM in the last second, and counts the total number of CPU cycles (including idle CPU cycles) in the past second, and returns the result to the data acquisition module.
步骤3,利用积分计算模块输入的实际CPU时钟周期数及周期占比更新每个VM的积分值,并将更新的积分值传输至资源调度模块。Step 3: Use the actual number of CPU clock cycles and cycle proportion input by the integral calculation module to update the integral value of each VM, and transmit the updated integral value to the resource scheduling module.
实施例中,为每个VM预设积分参数BASE、MAX和MIN,针对每个VM,比较消耗的CPU周期占比是否小于BASE值,在CPU周期占比小于BASE值时,积分增加X1;在CPU周期占比大于BASE值时,积分减少X2,以得到积分更新结果,其中,X1和X2为预设的增加比例和减少比例,其中,X1小于X2,表示积分的积累速度要慢于积分的消耗速度。In the embodiment, integral parameters BASE, MAX and MIN are preset for each VM. For each VM, it is compared whether the proportion of consumed CPU cycles is less than the BASE value. When the proportion of CPU cycles is less than the BASE value, the integral is increased by X1; when the proportion of CPU cycles is greater than the BASE value, the integral is reduced by X2 to obtain an integral update result, wherein X1 and X2 are preset increase ratios and decrease ratios, wherein X1 is less than X2, indicating that the accumulation speed of integrals is slower than the consumption speed of integrals.
步骤4,利用资源调度模块根据输入的积分值动态计算下一工作时间片内每个VM所消耗的虚拟交换机CPU周期数和周期占比,并传输至虚拟交换机模块。Step 4: Utilize the resource scheduling module to dynamically calculate the number of virtual switch CPU cycles and cycle proportion consumed by each VM in the next working time slice according to the input integral value, and transmit the calculations to the virtual switch module.
实施例中,接收各个VM所对应的积分,若输入的积分值为0,则将下一秒内VM所消耗的最大CPU周期占比设为BASE;若积分值大于0,则将下一秒内VM所消耗的最大CPU占比设为MAX;同时,必须保证下一秒内VM所消耗的最小CPU周期占比为MIN。In the embodiment, the points corresponding to each VM are received. If the input integral value is 0, the maximum CPU cycle ratio consumed by the VM in the next second is set to BASE; if the integral value is greater than 0, the maximum CPU cycle ratio consumed by the VM in the next second is set to MAX; at the same time, it must be ensured that the minimum CPU cycle ratio consumed by the VM in the next second is MIN.
如图3所示,基于积分算法的云网络资源弹性调度方法及系统设计类调度情景:完全轻负载、部分高负载、完全高负载。As shown in Figure 3, the cloud network resource elastic scheduling method based on the integral algorithm and the system design scheduling scenarios: completely light load, partially high load, and completely high load.
(一)完全轻负载(I) Completely light load
该工况对应图3的状态1。在此工况下,所有VM的实际网络开销均为达到BASE值,即各台VM的网络速率均维持在预设值左右,维持一段时间后,各个VM均累积了一定数量的积分,VM可以获得超过其BASE所限定的网络资源(但不超过MAX)。此时,整个系统会容忍适度的突发传输和高速流,用以提升系统的整体资源利用率。This condition corresponds to state 1 in Figure 3. Under this condition, the actual network overhead of all VMs reaches the BASE value, that is, the network rate of each VM is maintained at around the preset value. After a period of time, each VM has accumulated a certain number of points, and the VM can obtain network resources exceeding the limit of its BASE (but not exceeding MAX). At this time, the entire system will tolerate moderate burst transmission and high-speed flow to improve the overall resource utilization of the system.
(二)部分高负载(II) Partial high load
该工况对应图3的状态2。在此工况下,部分VM的实际网络开销达到了BASE值,而另一部分的VM在进行突发传输或高速流,所有VM的总网络容量小于或接近Host的总容量。对于正在进行突发传输的、实际网络开销大于BASE的VM,会消耗先前累积的积分,因此在一定时间内,它的网络速率将会被虚拟交换机限制到BASE值附近,实现了限速压制功能。同时,由于积分对应着虚拟交换机的计算资源(CPU周期数),所有的VM都能获得与积分成比例的网络资源,故而VM仍能够维持性能隔离。This condition corresponds to state 2 in Figure 3. Under this condition, the actual network overhead of some VMs reaches the BASE value, while another part of the VMs are performing burst transmission or high-speed flow, and the total network capacity of all VMs is less than or close to the total capacity of the Host. For VMs that are performing burst transmission and whose actual network overhead is greater than BASE, the previously accumulated points will be consumed. Therefore, within a certain period of time, its network rate will be limited by the virtual switch to near the BASE value, realizing the speed limit suppression function. At the same time, since the points correspond to the computing resources (CPU cycles) of the virtual switch, all VMs can obtain network resources proportional to the points, so the VMs can still maintain performance isolation.
(三)完全高负载(III) Fully high load
该工况对应图3的状态3。在此工况下,所有VM的总网络容量高于Host的总容量,因此必然出现VM之间的网络资源竞争。此时,积分算法不再适用,虚拟交换机会自动按照每台VM所对应的MIN值进行网络资源的调配,以保证VM网络服务的可用性。若一段时间后网络热点消失,则系统自动回退到上述两种工况;若一段时间后系统仍存在网络热点,则会主动地讲Host上的部分VM迁移到其他Host上,以避免CPU过热导致网络故障。This condition corresponds to state 3 in Figure 3. In this condition, the total network capacity of all VMs is higher than the total capacity of the Host, so network resource competition between VMs is inevitable. At this time, the integral algorithm is no longer applicable, and the virtual switch will automatically allocate network resources according to the MIN value corresponding to each VM to ensure the availability of VM network services. If the network hotspot disappears after a period of time, the system will automatically fall back to the above two conditions; if the system still has network hotspots after a period of time, it will actively migrate some VMs on the Host to other Hosts to avoid CPU overheating and network failures.
以上所述的具体实施方式对本发明的技术方案和有益效果进行了详细说明,应理解的是以上所述仅为本发明的最优选实施例,并不用于限制本发明,凡在本发明的原则范围内所做的任何修改、补充和等同替换等,均应包含在本发明的保护范围之内。The specific implementation methods described above have described in detail the technical solutions and beneficial effects of the present invention. It should be understood that the above is only the most preferred embodiment of the present invention and is not intended to limit the present invention. Any modifications, supplements and equivalent substitutions made within the scope of the principles of the present invention should be included in the protection scope of the present invention.
Claims (6)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title | 
|---|---|---|---|
| CN202210381458.0A CN114675972B (en) | 2022-04-12 | 2022-04-12 | Cloud network resource flexible scheduling method and system based on integral algorithm | 
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title | 
|---|---|---|---|
| CN202210381458.0A CN114675972B (en) | 2022-04-12 | 2022-04-12 | Cloud network resource flexible scheduling method and system based on integral algorithm | 
Publications (2)
| Publication Number | Publication Date | 
|---|---|
| CN114675972A CN114675972A (en) | 2022-06-28 | 
| CN114675972B true CN114675972B (en) | 2024-07-05 | 
Family
ID=82077686
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date | 
|---|---|---|---|
| CN202210381458.0A Active CN114675972B (en) | 2022-04-12 | 2022-04-12 | Cloud network resource flexible scheduling method and system based on integral algorithm | 
Country Status (1)
| Country | Link | 
|---|---|
| CN (1) | CN114675972B (en) | 
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title | 
|---|---|---|---|---|
| CN115333948A (en) * | 2022-08-23 | 2022-11-11 | 四川通信科研规划设计有限责任公司 | Method for improving network utilization rate based on cloud computing and transmission network | 
Family Cites Families (7)
| Publication number | Priority date | Publication date | Assignee | Title | 
|---|---|---|---|---|
| CN104503838B (en) * | 2014-11-23 | 2017-06-27 | 华中科技大学 | A kind of virtual cpu dispatching method | 
| JP6692994B2 (en) * | 2017-06-26 | 2020-05-13 | 株式会社日立製作所 | Virtual machine management apparatus and management method | 
| CN111399970B (en) * | 2019-01-02 | 2023-04-07 | 中国移动通信有限公司研究院 | A reserved resource management method, device and storage medium | 
| CN112783658A (en) * | 2021-02-01 | 2021-05-11 | 中科视拓(南京)科技有限公司 | Server computing resource pooling and scheduling system | 
| CN113515324B (en) * | 2021-07-16 | 2024-06-11 | 广东工业大学 | Collaborative edge computing method and device based on unloading decision of directed acyclic graph, electronic equipment and storage medium | 
| CN113821310B (en) * | 2021-11-19 | 2022-05-06 | 阿里云计算有限公司 | Data processing method, programmable network card device, physical server and storage medium | 
| CN114217974B (en) * | 2021-12-20 | 2025-01-07 | 哈尔滨工业大学 | A resource management method and system in a cloud computing environment | 
- 
        2022
        - 2022-04-12 CN CN202210381458.0A patent/CN114675972B/en active Active
 
Non-Patent Citations (1)
| Title | 
|---|
| Chengkun Wei."Achelous: Enabling Programmability, Elasticity, and Reliability in Hyperscale Cloud Networks".《ACM SIGCOMM'23: Proceedings of the ACM SIGCOMM 2023 Conference》.2023,第769-782页. * | 
Also Published As
| Publication number | Publication date | 
|---|---|
| CN114675972A (en) | 2022-06-28 | 
Similar Documents
| Publication | Publication Date | Title | 
|---|---|---|
| US10445850B2 (en) | Technologies for offloading network packet processing to a GPU | |
| US9632839B2 (en) | Dynamic virtual machine consolidation | |
| CN102739798B (en) | Cloud platform resource scheduling method with network sensing function | |
| US11099906B2 (en) | Handling tenant requests in a system that uses hardware acceleration components | |
| CN102724277B (en) | The method of live migration of virtual machine and deployment, server and group system | |
| US11567556B2 (en) | Platform slicing of central processing unit (CPU) resources | |
| CN110896373A (en) | Techniques for dynamically selecting resources for virtual switching | |
| CN103747107B (en) | A kind of compatible cloud operating platform and its implementation | |
| CN112003797B (en) | Method, system, terminal and storage medium for improving performance of virtualized DPDK network | |
| CN109522090B (en) | Resource scheduling method and device | |
| US20160308649A1 (en) | Providing Services in a System having a Hardware Acceleration Plane and a Software Plane | |
| CN110798412A (en) | Multicast service processing method, device, cloud platform, equipment and readable storage medium | |
| CN109039933B (en) | A cluster network optimization method, device, equipment and medium | |
| CN116846978A (en) | Resource scheduling method, application identification method and related equipment of cloud computing system | |
| CN114675972B (en) | Cloud network resource flexible scheduling method and system based on integral algorithm | |
| JP2011203810A (en) | Server, computer system, and virtual computer management method | |
| WO2023138513A1 (en) | Resource processing method and storage medium | |
| CN103747439B (en) | Wireless controller device, wireless authentication processing method, system, networking | |
| TWI644540B (en) | Flow meter flexible cutting system for virtual network in multi-tenant software-defined network | |
| Suo et al. | Characterizing networking performance and interrupt overhead of container overlay networks | |
| CN115378885B (en) | Virtual machine service network bandwidth management method and device under super fusion architecture | |
| Al‐Jarrah et al. | Integrated network and hosts energy management for cloud data centers | |
| Li et al. | Fair and near-optimal coflow scheduling without prior knowledge of coflow size. | |
| CN101197782B (en) | Control method and system for network appliance based on multi-core processor | |
| Guo et al. | Distributed Caching and Lightpath Provisioning in Multi-access Edge Computing based Elastic Optical Networks | 
Legal Events
| Date | Code | Title | Description | 
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |