CN104902001B

CN104902001B - Web request load-balancing method based on operating system virtualization

Info

Publication number: CN104902001B
Application number: CN201510160526.0A
Authority: CN
Inventors: 黄彬彬; 张雪鹏; 俞东进
Original assignee: Hangzhou Dianzi University
Current assignee: Hangzhou Dianzi University
Priority date: 2015-04-07
Filing date: 2015-04-07
Publication date: 2018-04-06
Anticipated expiration: 2035-04-07
Also published as: CN104902001A

Abstract

The invention discloses a kind of Web request load-balancing method based on operating system virtualization.The present invention carries out server resource information and normalization first；Secondly server and Service Instance Dynamic Information Gathering；Then the multidate information arrived according to server resource information and periodic harvest calculates the final weights of the Service Instance copy of all services；Distribute finally by WRR and ask.The present invention is it is possible to prevente effectively from different Service Instance copies on same server, the phenomenon of server overload where a large amount of Web requests cause is received simultaneously because place server load is relatively low, realize the load balancing between Service Instance copy and server, the concurrent capability of Web group systems is improved, reduces the average response time of request.

Description

Web request load balancing method based on operating system virtualization

技术领域technical field

本发明属于Web集群负载均衡技术领域，具体涉及到一种基于操作系统虚拟化的Web请求负载均衡方法。The invention belongs to the technical field of Web cluster load balancing, and in particular relates to a method for balancing Web request loads based on operating system virtualization.

背景技术Background technique

随着社交网络、电子商务等互联网应用的迅猛发展，网络服务器面临着以下两方面的问题：(1)访问数量成指数倍增加；(2)事务处理更加复杂。为了应对以上问题，为用户提供一个高性能的网络服务环境，分布式的Web服务器系统(Web集群)应运而生，并用以代替具有高性能的单台Web服务器。在Web集群中，每类服务包含多个服务实例副本，同一服务的多个服务实例副本由多个不同的服务器承载。为了将用户请求合理地、均衡地分配到不同的服务实例副本，确保各服务器及服务实例副本间的负载平衡，提高Web集群系统的并发能力和系统资源的利用率，高效的负载均衡策略是问题的关键所在。With the rapid development of Internet applications such as social networks and e-commerce, network servers are faced with the following two problems: (1) the number of visits increases exponentially; (2) transaction processing becomes more complicated. In order to deal with the above problems and provide users with a high-performance network service environment, a distributed Web server system (Web cluster) emerges as the times require, and is used to replace a single Web server with high performance. In a web cluster, each type of service contains multiple service instance copies, and multiple service instance copies of the same service are hosted by multiple different servers. In order to distribute user requests to different service instance copies in a reasonable and balanced manner, ensure load balance between servers and service instance copies, and improve the concurrency capability of the Web cluster system and the utilization of system resources, an efficient load balancing strategy is a problem the key to.

传统Web集群是以物理机或虚拟机为单位承载服务实例副本，并根据负载均衡策略将用户请求合理地分发给承载不同服务实例副本的物理机或虚拟机。然而，随着云计算技术的发展，基于操作系统虚拟化的Docker容器作为一种新的承载服务实例的基本单位出现了。由于Docker使用了基于操作系统的虚拟化技术，因此，与传统的基于完全虚拟化的虚拟机相比，Docker能够实现运行时系统的资源弹性分配功能(即当Docker容器中无任务执行时，Docker容器不占用CPU资源；有任务执行时Docker容器按照其CPU权重大小占用所需的CPU资源)，提高服务器资源的利用率。Docker的这一新特性，使得传统Web集群上的负载均衡策略不再适用，因此，需要构建一种合理的、均衡的基于操作系统虚拟化的Web请求负载均衡方法。Traditional web clusters host copies of service instances in units of physical machines or virtual machines, and reasonably distribute user requests to physical machines or virtual machines that host copies of different service instances according to load balancing policies. However, with the development of cloud computing technology, the Docker container based on operating system virtualization has emerged as a new basic unit for carrying service instances. Because Docker uses the virtualization technology based on the operating system, compared with the traditional virtual machine based on full virtualization, Docker can realize the resource elastic allocation function of the runtime system (that is, when there is no task execution in the Docker container, Docker The container does not occupy CPU resources; when a task is executed, the Docker container occupies the required CPU resources according to its CPU weight), which improves the utilization of server resources. This new feature of Docker makes the load balancing strategy on the traditional Web cluster no longer applicable. Therefore, it is necessary to build a reasonable and balanced method of Web request load balancing based on operating system virtualization.

发明内容Contents of the invention

本发明针对现有技术的不足，提供了一种基于操作系统虚拟化的Web请求负载均衡方法。Aiming at the deficiencies of the prior art, the present invention provides a web request load balancing method based on operating system virtualization.

本发明方法的具体步骤是：The concrete steps of the inventive method are:

步骤(1)建立服务器资源状态信息列表。Web集群中服务器集合S表示为：Step (1) Establish a server resource status information list. The server set S in the Web cluster is expressed as:

S＝{s₁,s₂,s₃,...,s_j,...,s_n}S＝{s ₁ ,s ₂ ,s ₃ ,...,s _j ,...,s _n }

其中s_j(1≤j≤n)表示Web集群中的某一服务器，n表示Web集群包含的服务器总数。服务器s_j总的资源P_j表示为：Among them, s _j (1≤j≤n) represents a certain server in the Web cluster, and n represents the total number of servers contained in the Web cluster. The total resource P _j of server s _j is expressed as:

P_j＝(P_{j_cpu},P_{j_memory},P_{j_io},P_{j_network})P _j ＝(P _{j_cpu} ,P _{j_memory} ,P _{j_io} ,P _{j_network} )

其中P_{j_cpu}表示服务器s_j的CPU的计算能力，P_{j_memory}表示服务器s_j的内存处理能力，P_{j_io}表示服务器s_j的硬盘I/O能力，P_{j_network}表示服务器s_j的网络吞吐率。为了消除服务器异构性及不同种类资源的差异性所带来的影响，采用Max-Min方法对服务器s_j总的资源P_j值进行归一化，服务器s_j总的资源归一化值PO_j表示为：Among them, P _{j_cpu} represents the computing power of the CPU of server s _j , P _{j_memory} represents the memory processing capacity of server s _j , P _{j_io} represents the hard disk I/O capacity of server s _j , and P _{j_network} represents the network throughput of server s _j . In order to eliminate the impact of server heterogeneity and the difference of different types of resources, the Max-Min method is used to normalize the total resource P _j value of server s _j , and the total resource normalized value PO of server s _j _j is expressed as:

PO_j＝(PO_{j_cpu},P_{Oj_memory},P_{Oj_io},P_{Oj_network})PO _j ＝(PO _{j_cpu} ,P _{Oj_memory} ,P _{Oj_io} ,P _{Oj_network} )

步骤(2)建立服务实例资源状态信息列表。设Web集群对外提供的服务集合F表示为:Step (2) Establish a list of service instance resource state information. Let the service set F provided by the Web cluster be expressed as:

F＝{f₁,f₂,f₃,...,f_i,...f_m}F＝{f ₁ ,f ₂ ,f ₃ ,...,f _i ,...f _m }

其中f_i(1≤i≤m)表示集群对外提供的第i种服务，m表示集群提供的服务种类总数。服务f_i包含多个不同的服务实例副本集合F_i表示为：Among them, f _i (1≤i≤m) represents the i-th service provided by the cluster, and m represents the total number of service types provided by the cluster. The service f _i contains multiple different service instance replica sets F _i expressed as:

F_i＝{f_i1,f_i2,f_i3,…,f_ik,…,f_il}F _i ＝{f _i1 ,f _i2 ,f _i3 ,…,f _ik ,…,f _il }

其中f_ik(1≤k≤l)表示服务f_i的实例副本，l表示服务f_i的实例副本个数。服务实例f_ik的资源状态信息ST_ik表示为：Among them, f _ik (1≤k≤l) represents the instance copy of service f _i , and l represents the number of instance copies of service f _i . The resource state information ST _ik of the service instance f _ik is expressed as:

其中表示服务f_i的实例副本f_ik所在的主机s_j，cpu_share_ik表示服务f_i的实例副本f_ik的CPU权值，memory_ik__to_tal表示服务f_i的实例副本f_ik的最大可占用内存。in Indicates the host s _j where the instance copy f _ik of the service _{f i} is located, cpu_share _ik indicates the CPU weight of the instance copy f _ik of the service f _i , and memory _{ik_t} _o _tal indicates the maximum occupancy of the instance copy f _ik of the service f _i Memory.

步骤(3)每隔周期时间T，负载均衡服务器会定期收集Web集群中所有服务器在该周期时间T内的负载信息。主要负载信息包括：Step (3) Every cycle time T, the load balancing server periodically collects load information of all servers in the Web cluster within the cycle time T. Key payload information includes:

服务器s_j的CPU利用率：CPU utilization of server s _j :

CPU_j＝CPU繁忙时间/(CPU繁忙时间+CPU空闲时间)CPU _j = CPU busy time/(CPU busy time + CPU idle time)

服务器s_j的硬盘I/O负载：Hard disk I/O load of server s _j :

IO_j＝硬盘IO繁忙时间/(硬盘IO繁忙时间+硬盘IO空闲时间)IO _j = hard disk IO busy time / (hard disk IO busy time + hard disk IO idle time)

服务器s_j的网络负载：Network load of server s _j :

Network_j＝(周期T内流入流量+周期T内输出流量)/(周期T*P_{j_network})Network _j = (incoming traffic in period T + output traffic in period T)/(period T*P _{j_network} )

服务器s_j上所有实例副本CPU权值数之和：The sum of CPU weights of all instance replicas on server s _j :

服务器s_j上服务f_i的实例副本f_ik内存利用率:The memory utilization of instance copy f _ik of service f _i on server s _j :

其中memory_{ik_used}表示服务f_i的实例副本f_ik已使用的内存值；Where memory _{ik_used} represents the memory value used by the instance copy f _ik of the service f _i ;

步骤(4)负载均衡服务器根据收集的服务器负载信息以及现存服务器与服务实例状态信息，计算部署在服务器s_j上的服务实例f_ik的最终权值并更新权值列表中相应服务实例的权值信息。Step (4) The load balancing server calculates the final weight of the service instance f _ik deployed on the server s _j according to the collected server load information and the status information of the existing servers and service instances And update the weight information of the corresponding service instance in the weight list.

其中α_ik、β_ik、γ_ik、δ_ik分别表示服务f_i对CPU、内存、硬盘以及网络四类资源所赋予的不同权值，且同一服务不同服务实例对这四类资源所赋予的权值相同。负载均衡服务器根据每个服务实例的权值大小采用业界广泛使用的加权轮询算法分发相应服务的Web请求。Among them, α _ik , β _ik , γ _ik , and δ _ik respectively represent the different weights given by service f _i to the four types of resources of CPU, memory, hard disk and network, and the weights given by different service instances of the same service to these four types of resources same value. According to the weight of each service instance, the load balancing server uses the weighted round-robin algorithm widely used in the industry to distribute the web requests of the corresponding services.

本发明所提供基于操作系统虚拟化的Web请求负载均衡方法由一组功能模块组成，它们包括：静态信息收集模块、动态信息收集模块、权值计算模块和负载分发模块。The web request load balancing method based on operating system virtualization provided by the present invention is composed of a group of functional modules, including: a static information collection module, a dynamic information collection module, a weight calculation module and a load distribution module.

静态信息收集模块在负载均衡方法初始化时收集服务器与服务实例的静态信息。服务器的静态信息主要包括：Web集群中每台服务器总的CPU计算能力P_{j_cpu}、内存处理能力P_{j_memory}、硬盘I/0能力P_{j_io}、网络吞吐能力P_{j_network}。服务实例的静态信息主要包括：服务f_i的服务实例f_ik所在主机服务f_i的服务实例f_ik的CPU权值cpu_share_ik，服务f_i的服务实例f_ik的最大可占用内存memory_{ik_total}。当Web集群中服务器或服务动态变更时，在静态信息模板中需要更新相应服务器或服务的静态信息。The static information collection module collects static information of servers and service instances when the load balancing method is initialized. The static information of the server mainly includes: the total CPU computing capability P _{j_cpu} of each server in the Web cluster, the memory processing capability P _{j_memory} , the hard disk I/O capability P _{j_io} , and the network throughput capability P _{j_network} . The static information of the service instance mainly includes: the host of the service instance f _ik of the service f _i The CPU weight cpu_share _ik of the service instance _{f ik} _of the service f _i , the maximum memory _{ik_total} that can be occupied by the service instance f _ik of the service f i. When the server or service in the Web cluster changes dynamically, the static information of the corresponding server or service needs to be updated in the static information template.

动态信息收集模块主要收集服务器及服务实例动态变化的状态信息。每隔周期时间T，动态信息收集模块会收集服务器及服务实例动态变化的状态信息。为了防止信息收集过于频繁而造成网络负担，周期时间T的取值依据统计经验设定为5-10秒。此外，当服务实例新增或删除时，动态信息收集模块也会执行收集动作。收集的状态信息主要包括该周期时间T内服务器s_j(1≤j≤n)的CPU利用率CPU_j、硬盘的I/O负载IO_j、网络负载Network_j、服务器s_j上所有服务实例副本的CPU权值之和服务器s_j上服务f_i的实例副本f_ik的内存利用率 The dynamic information collection module mainly collects the dynamic state information of servers and service instances. Every cycle time T, the dynamic information collection module will collect the dynamically changing state information of the server and the service instance. In order to prevent network load caused by too frequent information collection, the value of cycle time T is set to 5-10 seconds based on statistical experience. In addition, when a service instance is added or deleted, the dynamic information collection module will also perform a collection action. The collected state information mainly includes the CPU utilization CPU _j of the server s _j (1≤j≤n) within the cycle time T, the I/O load IO _j of the hard disk, the network load Network _j , and all service instance copies on the server s _j The sum of the CPU weights Memory utilization of instance copy f _ik of service f _i on server s _j

权值计算模块根据所收集的服务器及服务实例的信息计算出每个服务实例的最终权值。The weight calculation module calculates the final weight of each service instance according to the collected information of servers and service instances.

负载分发模块根据权值计算模块所得的服务实例权值，采用加权轮询调度算法，将较多的Web请求分发给权值较大的服务实例，较少的服务请求分发给权值较小的服务实例，达到同一服务不同服务实例间的负载均衡。According to the weight of the service instance obtained by the weight calculation module, the load distribution module adopts the weighted round-robin scheduling algorithm to distribute more Web requests to service instances with larger weights, and distribute fewer service requests to those with smaller weights. Service instances to achieve load balancing among different service instances of the same service.

本发明提出的方法利用基于操作系统虚拟化的Docker容器能够实现运行时系统资源弹性分配的特点，即根据服务实例CPU权值大小将计算资源按比例分配给承载相应服务实例的Docker容器，实现不同服务实例副本及服务器间的负载均衡。该方法通过进一步在服务实例权值计算公式中引入服务实例的CPU权值，从而避免了同一服务器上承载不同服务实例的Docker容器对所在服务器剩余计算资源的恶性竞争而导致服务器负载过重的情况。与传统方法相比，本发明所述方法可以有效避免同一服务器上的不同服务实例副本，由于所在服务器负载较低而同时接收大量Web请求造成所在服务器超载的现象，实现服务实例副本及服务器间的负载均衡，提高Web集群系统的并发能力，减少请求的平均响应时间。The method proposed in the present invention utilizes the characteristics that the Docker container based on the virtualization of the operating system can realize the elastic allocation of system resources at runtime, that is, the computing resources are allocated in proportion to the Docker container carrying the corresponding service instance according to the CPU weight of the service instance, so as to realize different Service instance replicas and load balancing between servers. This method further introduces the CPU weight of the service instance into the calculation formula of the service instance weight, thus avoiding the vicious competition between the Docker containers carrying different service instances on the same server for the remaining computing resources of the server and causing the server to be overloaded . Compared with the traditional method, the method of the present invention can effectively avoid different service instance copies on the same server, and the phenomenon that the server is overloaded due to the low load of the server receiving a large number of Web requests at the same time can realize the service instance copy and server-to-server communication. Load balancing, improve the concurrent capability of the Web cluster system, and reduce the average response time of requests.

附图说明Description of drawings

图1为集群中服务器、服务、服务实例副本关系示例图；Figure 1 is an example diagram of the relationship among servers, services, and service instance replicas in the cluster;

图2为动态信息收集流程图。Figure 2 is a flowchart of dynamic information collection.

具体实施方式Detailed ways

本发明所提供的基于操作系统虚拟化的Web请求负载均衡方法具体实施方式主要分四步：The specific implementation of the Web request load balancing method based on operating system virtualization provided by the present invention is mainly divided into four steps:

(1)服务器资源信息收集与归一化；(2)服务器及服务实例动态信息收集(3)根据服务器资源信息以及周期性收集到的动态信息计算所有服务的服务实例副本的最终权值；(4)通过加权轮询分发请求。(1) Server resource information collection and normalization; (2) Server and service instance dynamic information collection (3) Calculate the final weight of service instance copies of all services based on server resource information and periodically collected dynamic information; ( 4) Distribute requests by weighted round robin.

(1)服务器资源信息收集与归一化(1) Collection and normalization of server resource information

静态信息收集模块收集Web集群中每台服务器的CPU计算能力P_{j_cpu}、内存处理能力P_{j_memory}、硬盘I/0能力P_{j_io}、网络吞吐能力P_{j_network}四项资源信息。并分别计算出他们的最大最小值如下：The static information collection module collects four resource information including CPU computing capability P _{j_cpu} , memory processing capability P _{j_memory} , hard disk I/0 capability P _{j_io} , and network throughput capability P _{j_network} of each server in the Web cluster. And calculate their maximum and minimum values respectively as follows:

P_{max_cpu}＝max(P_{1_cpu},P_{2_cpu},…,P_{j_cpu},…,P_{n_cpu})P _{max_cpu} ＝max(P _{1_cpu} ,P _{2_cpu} ,…,P _{j_cpu} ,…,P _{n_cpu} )

P_{min_cpu}＝min(P_{1_cpu},P_{2_cpu},…,P_{j_cpu},…,P_{n_cpu})P _{min_cpu} ＝min(P _{1_cpu} ,P _{2_cpu} ,…,P _{j_cpu} ,…,P _{n_cpu} )

P_{max_memory}＝max(P_{1_memory},P_{2_memory},…,P_{j_memory},…,P_{n_memory})P _{max_memory} ＝max(P _{1_memory} ,P _{2_memory} ,…,P _{j_memory} ,…,P _{n_memory} )

P_{min_memory}＝min(P_{1_memory},P_{2_memory},…,P_{j_memory},…,P_{n_memory})P _{min_memory} ＝min(P _{1_memory} ,P _{2_memory} ,…,P _{j_memory} ,…,P _{n_memory} )

P_{max_io}＝max(P_{1_io},P_{2_io},…,P_{j_io},…,P_{n_io})P _{max_io} ＝max(P _{1_io} ,P _{2_io} ,…,P _{j_io} ,…,P _{n_io} )

P_{min_io}＝min(P_{1_io},P_{2_io},…,P_{j_io},…,P_{n_io})P _{min_io} ＝min(P _{1_io} ,P _{2_io} ,…,P _{j_io} ,…,P _{n_io} )

P_{max_network}＝max(P_{1_network},P_{2_network},…,P_{j_network},…,P_{n_network})P _{max_network} ＝max(P _{1_network} ,P _{2_network} ,…,P _{j_network} ,…,P _{n_network} )

P_{min_network}＝min(P_{1_network},P_{2_network},…,P_{j_network},…,P_{n_network})P _{min_network} ＝min(P _{1_network} ,P _{2_network} ,…,P _{j_network} ,…,P _{n_network} )

为了消除服务器异构性及不同种类资源的差异性所带来的影响，采用Max-Min方法对服务器CPU计算能力P_{j_cpu}、内存处理能力P_{j_memory}、硬盘I/0能力P_{j_io}、网络吞吐能力P_{j_network}四项资源信息进行归一化操作，并分别用PO_{j_cpu},PO_{j_memory},PO_{j_io},PO_{j_network}表示归一化后相对应的值。In order to eliminate the impact of server heterogeneity and the difference of different types of resources, the Max-Min method is used to evaluate the server CPU computing power P _{j_cpu} , memory processing power P _{j_memory} , hard disk I/0 capacity P _{j_io} , and network throughput P The four resource information _{of j_network} are normalized, and PO _{j_cpu} , PO _{j_memory} , PO _{j_io} , PO _{j_network} are respectively used to represent the corresponding values after normalization.

CPU计算能力归一化值PO_j__cpu：Normalized value of CPU computing capability PO _j _ _cpu :

PO_{j_cpu}＝(P_{j_cpu}-P_{min_cpu})/(P_{max_cpu}-P_{min_cpu})PO _{j_cpu} = (P _{j_cpu} -P _{min_cpu} )/(P _{max_cpu} -P _{min_cpu} )

内存处理能力归一化值P_{j_memory}：Normalized value of memory processing capability P _{j_memory} :

PO_{j_memory}＝(P_{j_memory}-P_{min_memory})/(P_{max_memory}-P_{min_memory})PO _{j_memory} = (P _{j_memory} -P _{min_memory} )/(P _{max_memory} -P _{min_memory} )

硬盘I/O能力归一化值PO_j__io：Hard disk I/O capability normalized value PO _j _ _i o:

PO_{j_io}＝(P_{j_io}-P_{min_i}x)/(P_{max_io}-P_{min_io})PO _{j_io} = (P _{j_io} -P _{min_ix} )/(P _{max_io} -P _{min_io} )

网络吞吐能力归一化值P_{j_network}：Normalized value of network throughput P _{j_network} :

PO_{j_network}＝(P_{j_network}-P_{min_network})/(P_{max_network}-P_{min_network})PO _{j_network} = (P _{j_network} -P _{min_network} )/(P _{max_network} -P _{min_network} )

(2)服务器及服务实例动态信息收集(2) Server and service instance dynamic information collection

在Web集群中服务器、服务以及服务实例副本之间的关系如图1所示。在图1中，Web集群中s₁，s₂，s₃三个物理机运行着f₁，f₂，f₃三个服务的8个服务实例副本(图1中f₁₁，f₁₂，f₂₁，f₂₂，f₂₃，f₃₁，f₃₂，f₃₃)。每隔周期时间T，动态信息收集模块根据图2所示的动态信息收集流程对图1中三个服务器以及三个服务的八个服务实例副本进行动态信息收集，然后通过权值计算模块重新计算所有服务的服务实例副本的权值。此外，当服务实例新增或删除时，动态信息收集模块也会执行收集动作并重新计算服务实例副本的权值。The relationship among servers, services and service instance replicas in the Web cluster is shown in Figure 1. In Figure 1, the three physical machines s ₁ , s ₂ , and s ₃ in the Web cluster are running 8 service instance copies of the three services f ₁ , f ₂ , and f ₃ (f ₁₁ , f ₁₂ , f ₂₁ , f ₂₂ , f ₂₃ , f ₃₁ , f ₃₂ , f ₃₃ ). Every cycle time T, the dynamic information collection module performs dynamic information collection on the three servers and eight service instance copies of the three services in Figure 1 according to the dynamic information collection process shown in Figure 2, and then recalculates through the weight calculation module The weight of service instance replicas for all services. In addition, when a service instance is added or deleted, the dynamic information collection module will also perform a collection action and recalculate the weight of the service instance copy.

动态信息收集模块收集的信息主要包括：周期时间T时间内服务器s_j(1≤j≤n)的CPU利用率CPU_j、硬盘的I/O负载IO_j，网络负载Network_j，服务器s_j上所有服务实例副本的CPU权值之和服务器s_j上服务f_i的实例副本f_ik的内存利用率 The information collected by the dynamic information collection module mainly includes: the CPU utilization CPU _j of the server s _j (1≤j≤n) within the cycle time T, the I/O load IO _j of the hard disk, the network load Network _j , _and the The sum of the CPU weights of all service instance replicas Memory utilization of instance copy f _ik of service f _i on server s _j

(3)服务实例副本权值计算(3) Service instance replica weight calculation

根据步骤(1)中计算所得的服务器资源归一化值以及步骤(2)中所收集的服务器及服务实例动态信息，计算部署在服务器s_j上的服务实例f_ik的最终权值 According to the normalized value of server resources calculated in step (1) and the dynamic information of servers and service instances collected in step (2), calculate the final weight of service instance f _ik deployed on server s _j

(4)通过加权轮询分发请求(4) Distribute requests through weighted round robin

根据第(3)步所得的服务实例权值讲权值结果更新后，负载分发模块采用加权轮询调度算法，将同一服务的Web请求均衡地分发给该服务的不同服务实例副本，从而实现不同实例副本以及服务器之间的负载均衡。According to the service instance weight obtained in step (3) and after the weight result is updated, the load distribution module uses the weighted round-robin scheduling algorithm to evenly distribute the web requests of the same service to different service instance copies of the service, so as to realize different Instance replication and load balancing between servers.

Claims

1. the Web request load-balancing method based on operating system virtualization, it is characterised in that this method comprises the following steps：

Step (1) establishes server resource state information list；Server set S is expressed as in Web clusters：

S={ s₁,s₂,s₃,…,s_j,…,s_n}

Wherein s_jThe a certain server in Web clusters is represented, 1≤j≤n, n represent the server sum that Web clusters include；Service Device s_jTotal resource P_jIt is expressed as：

P_j=(P_{j_cpu},P_{j_memory},P_{j_io},P_{j_network})

Wherein P_{j_cpu}Represent server s_jCPU computing capability, P_{j_memory}Represent server s_jInternal memory disposal ability, P_{j_io} Represent server s_jHard disk I/O abilities, P_{j_network}Represent server s_jNetwork throughput；Using Max-Min methods to clothes Be engaged in device s_jTotal resource P_jValue is normalized, server s_jTotal resource normalized value PO_jIt is expressed as：

PO_j=(PO_{j_cpu},PO_{j_memory},PO_{j_io},PO_{j_network})

Wherein PO_{j_cpu}Represent P_{j_cpu}Normalized value, PO_{j_memory}Represent P_{j_memory}Normalized value, PO_{j_io}Represent P_{j_io}Return One change value, PO_{j_network}Represent P_{j_network}Normalized value；

Step (2) establishes Service Instance resource state information list；If the set of service F that Web clusters externally provide is expressed as:

F={ f₁,f₂,f₃,…,f_i,…,f_m}

Wherein f_iI-th kind of service that cluster externally provides is represented, 1≤i≤m, m represent the type service sum that cluster provides；Service f_iInclude multiple different Service Instance copy set F_iIt is expressed as：

F_i={ f_i1,f_i2,f_i3,…,f_ik,…,f_il}

Wherein f_ikRepresent service f_iExample copy, 1≤k≤l, l represent service f_iExample copy number；Service Instance f_ik's Resource state information ST_ikIt is expressed as：

<mrow> <msub> <mi>ST</mi> <mrow> <mi>i</mi> <mi>k</mi> </mrow> </msub> <mo>=</mo> <mrow> <mo>(</mo> <msubsup> <mi>L</mi> <mrow> <mi>i</mi> <mi>k</mi> </mrow> <mi>j</mi> </msubsup> <mo>,</mo> <mi>c</mi> <mi>p</mi> <mi>u</mi> <mo>_</mo> <msub> <mi>share</mi> <mrow> <mi>i</mi> <mi>k</mi> </mrow> </msub> <mo>,</mo> <msub> <mi>memory</mi> <mrow> <mi>i</mi> <mi>k</mi> <mo>_</mo> <mi>t</mi> <mi>o</mi> <mi>t</mi> <mi>a</mi> <mi>l</mi> </mrow> </msub> <mo>)</mo> </mrow> </mrow>

WhereinRepresent service f_iExample copy f_ikThe main frame s at place_j, cpu_share_ikRepresent service f_iExample copy f_ik CPU weights, memory_{ik_total}Represent service f_iExample copy f_ikMaximum can committed memory；

Step (3) is every Servers-all in cycle time T, load-balanced server meeting periodic collection Web clusters in the cycle Load information in time T；Basic load information includes：

Server s_jCpu busy percentage CPU_j：

CPU_j=cpu busy time/(the cpu busy time+cpu idle time)

Server s_jHard disk I/O load IO_j：

IO_j=hard disk IO the rush hours/(the hard disk IO rush hours+hard disk IO free times)

Server s_jNetwork load Network_j：

Network_j=(output flow in flow+cycle T is flowed into cycle T)/(cycle T * P_{j_network})

Server s_jUpper all example copy CPU weights number sums：

Server s_jUpper service f_iExample copy f_ikMemory usage

<mrow> <msubsup> <mi>Memory</mi> <mrow> <mi>i</mi> <mi>k</mi> </mrow> <mi>j</mi> </msubsup> <mo>=</mo> <msub> <mi>memory</mi> <mrow> <mi>i</mi> <mi>k</mi> <mo>_</mo> <mi>u</mi> <mi>s</mi> <mi>e</mi> <mi>d</mi> </mrow> </msub> <mo>/</mo> <msub> <mi>memory</mi> <mrow> <mi>i</mi> <mi>k</mi> <mo>_</mo> <mi>t</mi> <mi>o</mi> <mi>t</mi> <mi>a</mi> <mi>l</mi> </mrow> </msub> </mrow>

Wherein memory_{ik_used}Represent service f_iExample copy f_ikThe memory value used；

Step (4) load-balanced servers are according to the server load information and existing service device of collection and Service Instance shape State information, calculating are deployed in server s_jOn Service Instance f_ikFinal weightsAnd update corresponding in weights list The value information of Service Instance：

Wherein α_ik、β_ik、γ_ik、δ_ikService f is represented respectively_iCPU, internal memory, hard disk and the class resource of network four are assigned not Same weights, and the weights that the different Service Instances of same service are assigned to this four classes resource are identical,Expression is deployed in Server s_jOn service f_iExample f_ikCPU weights；Load-balanced server is according to the weights size of each Service Instance Using the Web request of Weighted Round Robin distribution respective service.