CN117615017A - Calculation force request method, device and system - Google Patents
Calculation force request method, device and system Download PDFInfo
- Publication number
- CN117615017A CN117615017A CN202211195527.5A CN202211195527A CN117615017A CN 117615017 A CN117615017 A CN 117615017A CN 202211195527 A CN202211195527 A CN 202211195527A CN 117615017 A CN117615017 A CN 117615017A
- Authority
- CN
- China
- Prior art keywords
- computing power
- service
- node
- computing
- segment
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Landscapes
- Computer And Data Communications (AREA)
Abstract
本申请提供了一种算力请求方法、装置及系统,属于通信技术领域。在本申请提供的方案中,第一算力服务实例对应的第一节点接收到算力请求报文后,能够基于该第一算力服务实例是否满足服务条件,确定是否由该第一算力服务实例向客户端设备提供目标算力服务。由于第一节点能够直接判断其对应的第一算力服务实例是否满足服务条件,并确定是否提供目标算力服务,因此无需再由算力网络中的路由节点感知各个算力服务实例的算力负载,并选择匹配的算力服务实例。由此,有效降低了对路由节点的性能要求,提高了提供算力服务时的效率。
This application provides a computing power request method, device and system, which belongs to the field of communication technology. In the solution provided by this application, after receiving the computing power request message, the first node corresponding to the first computing power service instance can determine whether to use the first computing power based on whether the first computing power service instance meets the service conditions. The service instance provides target computing power services to client devices. Since the first node can directly determine whether its corresponding first computing power service instance meets the service conditions and determine whether to provide the target computing power service, there is no need for routing nodes in the computing power network to sense the computing power of each computing power service instance. load, and select the matching computing service instance. As a result, the performance requirements for routing nodes are effectively reduced and the efficiency of providing computing power services is improved.
Description
本申请要求于2022年8月22日提交的申请号为202211004588.9、发明名称为“一种算力寻址方法”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims priority from the Chinese patent application with application number 202211004588.9 and the invention title "A computing power addressing method" submitted on August 22, 2022, the entire content of which is incorporated into this application by reference.
技术领域Technical field
本申请涉及通信技术领域,特别涉及一种算力请求方法、装置及系统。The present application relates to the field of communication technology, and in particular to a computing power request method, device and system.
背景技术Background technique
算力优先网络(computing first network,CFN),简称算力网络,是一种为客户端设备提供算力服务的网络。Computing first network (CFN), referred to as computing network, is a network that provides computing services to client devices.
算力网络一般包括一个或多个路由节点,以及用于提供算力服务的多个服务节点。其中,不同的服务节点提供的算力服务可以相同,也可以不同。路由节点可以感知各个服务节点的算力负载并生成算力路由表,该算力路由表中记录有服务节点的地址与服务节点的算力负载的对应关系。路由节点接收到客户端设备发送的算力请求报文后,可以基于算力路由表中记录的各个服务节点的算力负载,从各个服务节点中确定出匹配的服务节点,并将该算力请求报文路由至该匹配的服务节点。A computing power network generally includes one or more routing nodes and multiple service nodes used to provide computing power services. Among them, the computing power services provided by different service nodes can be the same or different. The routing node can sense the computing power load of each service node and generate a computing power routing table. The computing power routing table records the correspondence between the address of the service node and the computing power load of the service node. After the routing node receives the computing power request message sent by the client device, it can determine the matching service node from each service node based on the computing power load of each service node recorded in the computing power routing table, and use the computing power to The request message is routed to the matching service node.
但是,上述算力请求方法中,需要路由节点感知大量服务节点的算力负载,并基于该算力负载确定与客户端设备匹配的服务节点。由此,导致对路由节点的性能要求较高。However, in the above computing power request method, the routing node needs to sense the computing power load of a large number of service nodes, and determine the service node matching the client device based on the computing power load. As a result, the performance requirements for routing nodes are higher.
发明内容Contents of the invention
本申请提供了一种算力请求方法、装置及系统,可以解决路由节点需感知大量服务节点的算力负载,导致对路由节点的性能要求较高的技术问题。This application provides a computing power request method, device and system, which can solve the technical problem that routing nodes need to sense the computing power load of a large number of service nodes, resulting in high performance requirements for routing nodes.
第一方面,提供了一种算力请求方法,应用于第一节点,该第一节点对应第一算力服务实例,该第一算力服务实例用于提供该目标算力服务。该方法包括:接收来自客户端设备的算力请求报文,该算力请求报文用于请求目标算力服务;基于该第一算力服务实例是否满足服务条件,确定是否由该第一算力服务实例向该客户端设备提供目标算力服务。In a first aspect, a computing power request method is provided, which is applied to a first node, the first node corresponds to a first computing power service instance, and the first computing power service instance is used to provide the target computing power service. The method includes: receiving a computing power request message from a client device, the computing power request message being used to request a target computing power service; and based on whether the first computing power service instance satisfies service conditions, determining whether the first computing power service instance is used by the first computing power service instance. The computing power service instance provides the target computing power service to the client device.
第一节点接收到算力请求报文后,能够直接判断其对应的第一算力服务实例是否满足服务条件,并确定是否由第一算力服务实例提供目标算力服务。由此,无需再由算力网络中的路由节点集中感知大量算力服务实例的算力负载,并选择匹配的算力服务实例,从而有效降低了对路由节点的性能要求,提高了提供算力服务时的效率。After receiving the computing power request message, the first node can directly determine whether its corresponding first computing power service instance meets the service conditions, and determine whether the first computing power service instance provides the target computing power service. As a result, routing nodes in the computing network no longer need to centrally sense the computing load of a large number of computing service instances and select matching computing service instances, thereby effectively reducing the performance requirements for routing nodes and improving the provision of computing power. Efficiency in service.
基于上述分析可知,本申请提供的方案能够由各个算力服务实例对应的实例节点,通过分布式决策的方式确定用于提供目标算力服务的算力服务实例,从而有效减轻了由路由节点集中式决策时的信息发布的压力。Based on the above analysis, it can be seen that the solution provided by this application can use the instance nodes corresponding to each computing service instance to determine the computing service instance used to provide the target computing service through distributed decision-making, thereby effectively alleviating the need to concentrate on routing nodes. pressure to release information when making decisions.
可选地,该算力请求报文的段列表中可以包括多个段标识;该多个段标识分别指示多个算力服务实例,且该多个算力服务实例分别用于提供目标算力服务,该多个算力服务实例包括第一算力服务实例。Optionally, the segment list of the computing power request message may include multiple segment identifiers; the multiple segment identifiers respectively indicate multiple computing power service instances, and the multiple computing power service instances are respectively used to provide the target computing power. service, the plurality of computing power service instances include a first computing power service instance.
通过在算力请求报文中封装包括多个段标识的段列表,可以使得该算力请求报文能够按照该段列表指示的转发路径依次转发至各个算力服务实例对应的实例节点。进而,可以由接收到算力请求报文的各个实例节点基于其对应的算力服务实例是否满足服务条件,确定是否由该算力服务实例提供目标算力服务。也即是,各个算力服务实例对应的实例节点能够通过分布式决策的方式确定用于提供目标算力服务的算力服务实例。By encapsulating a segment list including multiple segment identifiers in the computing power request message, the computing power request message can be forwarded to the instance nodes corresponding to each computing power service instance in sequence according to the forwarding path indicated by the segment list. Furthermore, each instance node that receives the computing power request message can determine whether the computing power service instance provides the target computing power service based on whether its corresponding computing power service instance meets the service conditions. That is, the instance node corresponding to each computing power service instance can determine the computing power service instance used to provide the target computing power service through distributed decision-making.
可选地,基于该第一算力服务实例是否满足服务条件,确定是否由第一算力服务实例向客户端设备提供目标算力服务的过程可以包括:基于该第一算力服务实例满足服务条件,确定由第一算力服务实例向该客户端设备提供该目标算力服务,并向客户端设备发送算力响应报文;或者,基于该第一算力服务实例不满足服务条件,确定不由该第一算力服务实例向客户端设备提供目标算力服务,并根据该段列表转发算力请求报文。Optionally, based on whether the first computing power service instance satisfies the service condition, the process of determining whether the first computing power service instance provides the target computing power service to the client device may include: based on the first computing power service instance satisfying the service condition Conditions, determine that the first computing power service instance provides the target computing power service to the client device, and sends a computing power response message to the client device; or, based on the first computing power service instance not meeting the service conditions, determine The first computing power service instance provides the target computing power service to the client device, and forwards the computing power request message according to the segment list.
其中,第一节点发送算力响应报文后,客户端设备即可基于该响应报文继续向第一节点发送算力任务报文,以获取该目标算力服务。第一节点根据段列表转发算力请求报文后,即可继续由后续接收到该算力请求报文的实例节点判断其对应的算力服务实例满足服务条件,进而确定是否由其对应的算力服务实例向客户端设备提供目标算力服务。After the first node sends the computing power response message, the client device can continue to send the computing power task message to the first node based on the response message to obtain the target computing power service. After the first node forwards the computing power request message according to the segment list, the instance node that subsequently receives the computing power request message can continue to judge that its corresponding computing power service instance meets the service conditions, and then determine whether its corresponding computing power service instance satisfies the service conditions. Power service instances provide target computing power services to client devices.
可选地,该算力请求报文的目的地址字段可以包括用于指示该第一算力服务实例的第一段标识,该多个段标识包括该第一段标识。相应的,根据该段列表转发算力请求报文的过程可以包括:将该算力请求报文的目的地址字段的值更新为段列表中的第二段标识,并将该算力请求报文转发至第二节点。其中,该第二节点对应第二段标识指示的第二算力服务实例,该第二段标识为段列表中与第一段标识相邻的下一个段标识。Optionally, the destination address field of the computing power request message may include a first segment identifier used to indicate the first computing power service instance, and the plurality of segment identifiers include the first segment identifier. Correspondingly, the process of forwarding the computing power request message according to the segment list may include: updating the value of the destination address field of the computing power request message to the second segment identifier in the segment list, and forwarding the computing power request message to Forwarded to the second node. The second node corresponds to the second computing power service instance indicated by the second segment identifier, and the second segment identifier is the next segment identifier adjacent to the first segment identifier in the segment list.
在本申请提供的方案中,封装有SRH的算力请求报文即为SRv6报文。第一节点确定第一算力服务实例不满足服务条件后,即可基于SRv6报文的转发机制转发该算力请求报文。由此,实现了SRv6技术在算力网络中的灵活应用。In the solution provided by this application, the computing power request message encapsulated with SRH is an SRv6 message. After the first node determines that the first computing power service instance does not meet the service conditions, it can forward the computing power request message based on the SRv6 message forwarding mechanism. This enables the flexible application of SRv6 technology in computing power networks.
可选地,在根据该段列表转发算力请求报文之前,该方法还可以包括:在该算力请求报文中封装第一算力服务实例的算力负载。Optionally, before forwarding the computing power request message according to the segment list, the method may further include: encapsulating the computing power load of the first computing power service instance in the computing power request message.
通过在算力请求报文中封装算力服务实例的算力负载,可以便于段列表中最后一个段标识对应的实例节点能够获取到各个算力服务实例的算力负载。进而,当段列表中各个段标识指示的算力服务实例均不满足服务条件时,该最后一个段标识对应的实例节点即可基于各个算力服务实例的算力负载,选择出用于向客户端设备提供目标算力服务的目标算力服务实例。By encapsulating the computing power load of the computing power service instance in the computing power request message, the instance node corresponding to the last segment identifier in the segment list can obtain the computing power load of each computing power service instance. Furthermore, when none of the computing power service instances indicated by each segment identifier in the segment list meets the service conditions, the instance node corresponding to the last segment identifier can be selected to provide the service to the customer based on the computing power load of each computing power service instance. The terminal device provides the target computing power service instance of the target computing power service.
可选地,该算力请求报文的目的地址字段可以包括用于指示该第一算力服务实例的第一段标识,该多个段标识包括该第一段标识。相应的,根据该段列表转发算力请求报文的过程可以包括:基于该第一段标识为段列表中的最后一个段标识,将该算力请求报文的目的地址字段的值更新为段列表中的第三段标识,得到服务指示;之后,将该服务指示转发至第三节点。其中,该第三节点对应第三段标识指示的第三算力服务实例,且该服务指示用于指示第三算力服务实例向客户端设备提供目标算力服务。Optionally, the destination address field of the computing power request message may include a first segment identifier used to indicate the first computing power service instance, and the plurality of segment identifiers include the first segment identifier. Correspondingly, the process of forwarding the computing power request message according to the segment list may include: based on the first segment identifier being the last segment identifier in the segment list, updating the value of the destination address field of the computing power request message to segment The third segment identifier in the list gets the service indication; then, the service indication is forwarded to the third node. The third node corresponds to the third computing power service instance indicated by the third segment identifier, and the service indication is used to instruct the third computing power service instance to provide the target computing power service to the client device.
在本申请提供的方案中,当段列表中最后一个段标识指示的算力服务实例也不满足服务条件时,即可由第一节点强制指示第三算力服务实例向客户端设备提供目标算力服务。由此,可以确保客户端设备能够有效获取到目标算力服务。In the solution provided by this application, when the computing power service instance indicated by the last segment identifier in the segment list also does not meet the service conditions, the first node can forcibly instruct the third computing power service instance to provide the target computing power to the client device. Serve. This ensures that the client device can effectively obtain the target computing power service.
可选地,该第三算力服务实例可以是从多个算力服务实例中随机选取的。或者,该第三算力服务实例可以满足下述条件中的至少一种:第三算力服务实例的算力负载小于第一负载阈值;第三算力服务实例对应的第三节点与第一路由节点之间的网络性能满足第一通信条件。其中,该第一路由节点为与客户端设备连接的路由节点。Optionally, the third computing power service instance may be randomly selected from multiple computing power service instances. Alternatively, the third computing power service instance may satisfy at least one of the following conditions: the computing power load of the third computing power service instance is less than the first load threshold; the third node corresponding to the third computing power service instance is not the same as the first The network performance between routing nodes meets the first communication condition. Wherein, the first routing node is a routing node connected to the client device.
基于上述条件,可以确保能够由算力负载较小,和/或,网络性能较好的算力服务实例为客户端设备提供目标算力服务。Based on the above conditions, it can be ensured that the computing power service instance with smaller computing power load and/or better network performance can provide the target computing power service to the client device.
可选地,基于该第一算力服务实例是否满足服务条件,确定是否由第一算力服务实例向该客户端设备提供目标算力服务的过程可以包括:基于该第一段标识为段列表中的最后一个段标识,确定由该第一算力服务实例向客户端设备提供该目标算力服务,并向客户端设备发送算力响应报文。Optionally, based on whether the first computing power service instance satisfies the service condition, the process of determining whether the first computing power service instance provides the target computing power service to the client device may include: based on the first segment identification as a segment list The last segment identifier in determines that the first computing power service instance provides the target computing power service to the client device and sends a computing power response message to the client device.
在本申请提供的方案中,当第一段标识为段列表中的最后一个段标识时,第一节点可以确定其他段标识指示的算力服务实例均不满足服务条件。因此,第一节点可以直接确定由其对应的第一算力服务实例向客户端设备提供目标算力服务。In the solution provided by this application, when the first segment identifier is the last segment identifier in the segment list, the first node can determine that none of the computing power service instances indicated by other segment identifiers meet the service conditions. Therefore, the first node can directly determine that its corresponding first computing power service instance provides the target computing power service to the client device.
可选地,第一节点发送的算力响应报文中可以包括第一段标识。由此,可以便于客户端设备基于该第一段标识,直接向第一节点发送算力任务报文以获取目标算力服务。Optionally, the computing power response message sent by the first node may include the first segment identification. Therefore, it is convenient for the client device to directly send the computing power task message to the first node based on the first segment of the identifier to obtain the target computing power service.
可选地,该算力响应报文的源地址字段或扩展报文头中包括该第一段标识。其中,该扩展报文头可以是SRH。Optionally, the source address field or extension header of the computing power response message includes the first segment identifier. The extended message header may be SRH.
可选地,该段列表中还可以包括目标算力服务的标识。例如,该段列表中的最后一个段标识字段可以包括该目标算力服务的标识。Optionally, the segment list may also include the identification of the target computing power service. For example, the last segment identification field in the segment list may include the identification of the target computing power service.
基于此,可以便于当算力请求报文中的目的地址字段被更新为该目标算力服务的标识时,算力网络中的路由节点或服务节点对该算力请求报文的源地址和目的地址进行校验时能够校验通过,以避免校验失败而丢弃该算力请求报文。Based on this, it is convenient for the routing node or service node in the computing power network to determine the source address and destination of the computing power request message when the destination address field in the computing power request message is updated to identify the target computing power service. The address can pass the verification during verification to avoid discarding the computing power request message due to verification failure.
可选地,该服务条件可以包括:第一算力服务实例的算力负载小于第二负载阈值。由此,可以确保由算力负载较小的算力服务实例向客户端设备提供目标算力服务。Optionally, the service condition may include: the computing power load of the first computing power service instance is less than the second load threshold. This ensures that the computing power service instance with a smaller computing power load provides the target computing power service to the client device.
可选地,该服务条件还可以包括:客户端设备所连接的第一路由节点与第一节点之间的网络性能满足第二通信条件。由此,可以确保由网络性能较好的算力服务实例向客户端设备提供目标算力服务。Optionally, the service condition may also include: network performance between the first routing node connected to the client device and the first node meets the second communication condition. This ensures that the computing power service instance with better network performance provides the target computing power service to the client device.
可选地,该第一节点可以为第一算力服务实例所属的服务节点;或者,该第一节点可以为与该第一算力服务实例所属的服务节点直连的第二路由节点。Optionally, the first node may be a service node to which the first computing power service instance belongs; or, the first node may be a second routing node directly connected to the service node to which the first computing power service instance belongs.
其中,若第一节点为服务节点,则服务节点可以基于其部署的第一算力服务实例的算力负载确定该第一算力服务实例是否满足服务条件。由此,算力网络中的路由节点将无需再感知算力服务实例的算力负载,仅需转发报文即可,从而有效降低了路由节点的控制面压力。Wherein, if the first node is a service node, the service node can determine whether the first computing power service instance satisfies the service condition based on the computing power load of the first computing power service instance deployed by it. As a result, routing nodes in the computing power network no longer need to sense the computing power load of the computing power service instance and only need to forward messages, thus effectively reducing the control surface pressure of routing nodes.
若第一节点为第二路由节点,则该第二路由节点可以仅感知其所直连的服务节点中部署的算力服务实例的算力负载,且服务节点无需解析报文中的SRH并根据段列表转发报文。由此,可以在有效降低路由节点的控制面压力的情况下,与传统的服务节点的功能有效兼容。If the first node is a second routing node, the second routing node can only sense the computing load of the computing service instance deployed in the service node to which it is directly connected, and the service node does not need to parse the SRH in the message and calculate the computing power load according to the SRH in the message. Segment list forwards the message. As a result, it can be effectively compatible with the functions of traditional service nodes while effectively reducing the control surface pressure of the routing node.
第二方面,提供了一种算力请求方法,该方法包括:生成算力请求报文,该算力请求报文的段列表中包括多个段标识,该多个段标识分别指示多个算力服务实例,该多个算力服务实例中的每个算力服务实例用于提供目标算力服务。之后,根据该段列表转发算力请求报文,以使该多个算力服务实例中的满足服务条件的至少一个算力服务实例向客户端设备提供目标算力服务。In a second aspect, a computing power request method is provided. The method includes: generating a computing power request message. The segment list of the computing power request message includes multiple segment identifiers. The multiple segment identifiers respectively indicate multiple computing power requests. A computing power service instance, each of the multiple computing power service instances is used to provide a target computing power service. Afterwards, the computing power request message is forwarded according to the segment list, so that at least one computing power service instance among the plurality of computing power service instances that meets the service condition provides the target computing power service to the client device.
通过在算力请求报文中封装段列表,可以使得该算力请求报文能够沿段列表指示的转发路径依次转发至各个算力服务实例对应的实例节点。进而,可以由接收到算力请求报文的各个实例节点通过分布式决策的方式,确定用于提供目标算力服务的算力服务实例。由此,有效减轻了由路由节点集中式决策时的信息发布的压力。By encapsulating the segment list in the computing power request message, the computing power request message can be forwarded to the instance nodes corresponding to each computing power service instance along the forwarding path indicated by the segment list. Furthermore, each instance node that receives the computing power request message can determine the computing power service instance used to provide the target computing power service through distributed decision-making. This effectively reduces the pressure of information release when centralized decision-making by routing nodes.
可选地,该算力请求方法可以应用于客户端设备。相应的,生成算力请求报文的过程可以包括:基于该客户端设备所请求的目标算力服务,将与该目标算力服务对应的多个段标识封装至算力请求报文。Optionally, the computing power request method can be applied to the client device. Correspondingly, the process of generating the computing power request message may include: based on the target computing power service requested by the client device, encapsulating multiple segment identifiers corresponding to the target computing power service into the computing power request message.
通过客户端设备生成算力请求报文,则与该客户端设备连接的路由节点无需再感知并获取与目标算力服务对应的多个段标识,从而有效简化了路由节点的操作。By generating a computing power request message through the client device, the routing node connected to the client device no longer needs to sense and obtain multiple segment identifiers corresponding to the target computing power service, thus effectively simplifying the operation of the routing node.
可选地,在生成算力请求报文之前,该方法还可以包括:向域名系统(domain namesystem,DNS)服务器发送针对该目标算力服务的域名请求,以及接收该DNS服务器发送的与目标算力服务对应的多个段标识。Optionally, before generating the computing power request message, the method may also include: sending a domain name request for the target computing power service to a domain name system (DNS) server, and receiving a domain name request related to the target computing power service sent by the DNS server. Multiple segment identifiers corresponding to force services.
可选地,该算力请求方法可以应用于与客户端设备连接的路由节点。相应的,生成算力请求报文的过程可以包括:接收该客户端设备发送的初始请求报文,该初始请求报文包括目标算力服务的标识;在该初始请求报文中封装与目标算力服务的标识对应的多个段标识,得到算力请求报文。Optionally, the computing power request method can be applied to a routing node connected to the client device. Correspondingly, the process of generating a computing power request message may include: receiving an initial request message sent by the client device, where the initial request message includes the identification of the target computing power service; encapsulating the information related to the target computing power service in the initial request message. Multiple segment identifiers corresponding to the identifier of the computing power service, and obtain the computing power request message.
通过路由节点生成算力请求报文,则客户端设备无需再感知并获取与目标算力服务对应的多个段标识,从而有效简化了客户端设备的操作。By generating the computing power request message through the routing node, the client device no longer needs to sense and obtain multiple segment identifiers corresponding to the target computing power service, thus effectively simplifying the operation of the client device.
可选地,在路由节点生成算力请求报文之前,该方法还可以包括:接收其他路由节点通告的与该目标算力服务的标识对应的多个段标识。其中,其他路由节点可以通过控制面协议(例如BGP)通告该多个段标识。Optionally, before the routing node generates the computing power request message, the method may also include: receiving multiple segment identifiers corresponding to the identifier of the target computing power service advertised by other routing nodes. Other routing nodes may advertise the multiple segment identifiers through a control plane protocol (such as BGP).
可选地,路由节点生成算力请求报文的过程还可以包括:将该路由节点与该多个段标识指示的算力服务实例所对应的实例节点之间的网络性能参数封装至算力请求报文。Optionally, the process of the routing node generating the computing power request message may also include: encapsulating the network performance parameters between the routing node and the instance node corresponding to the computing power service instance indicated by the multiple segment identifiers into the computing power request. message.
通过在算力请求报文中封装网络性能参数,可以便于接收到算力请求报文的实例节点基于算力服务实例的算力负载和网络性能,联合判断算力服务实例是否满足服务条件。By encapsulating network performance parameters in the computing power request message, the instance node that receives the computing power request message can jointly determine whether the computing power service instance meets the service conditions based on the computing power load and network performance of the computing power service instance.
可选地,该算力请求报文可以包括分段路由头(segment routing header,SRH),该SRH中的扩展类型长度值(type-length-value,TLV)字段包括该网络性能参数。Optionally, the computing power request message may include a segment routing header (SRH), and an extended type-length-value (TLV) field in the SRH includes the network performance parameter.
可选地,在该初始请求报文中封装与目标算力服务的标识对应的该多个段标识之前,该方法还可以包括:从与该目标算力服务的标识对应的多个备选段标识中筛选出满足封装条件的该多个段标识。其中,该封装条件可以包括:路由节点与段标识指示的算力服务实例所对应的实例节点之间的网络性能满足通信条件。Optionally, before encapsulating the multiple segment identifiers corresponding to the identifier of the target computing power service in the initial request message, the method may also include: selecting from multiple alternative segments corresponding to the identifier of the target computing power service. The multiple segment identifiers that meet the encapsulation conditions are filtered out from the identifiers. The encapsulation conditions may include: the network performance between the routing node and the instance node corresponding to the computing power service instance indicated by the segment identifier meets the communication conditions.
由于第一路由节点能够提前筛选掉网络性能较差的实例节点,因此可以确保仅由网络性能较好的实例节点确定是否由其对应的算力服务实例提供目标算力服务。由此,不仅可以确保确定出满足服务条件的算力服务实例的效率,而且可以确保为客户端设备提供目标算力服务时的可靠性。Since the first routing node can filter out instance nodes with poor network performance in advance, it can ensure that only instance nodes with better network performance determine whether their corresponding computing power service instance provides the target computing power service. This not only ensures the efficiency of determining the computing power service instance that meets the service conditions, but also ensures the reliability of providing the target computing power service to the client device.
可选地,该多个段标识可以按照网络性能由高到低的顺序依次排列。基于此,可以确保与路由节点之间的网络性能越好的实例节点能够越早接收到该算力请求报文,进而可以确保优先由网络性能较好的实例节点检测是否由其对应的算力服务实例提供目标算力服务。Optionally, the multiple segment identifiers may be arranged in order from high to low network performance. Based on this, it can be ensured that the instance node with better network performance between the routing node and the routing node can receive the computing power request message sooner, thereby ensuring that the instance node with better network performance is prioritized to detect whether its corresponding computing power is used Service instances provide target computing power services.
可选地,在根据该段列表转发算力请求报文之后,该方法还可以包括:接收目标节点发送的算力响应报文,该算力响应报文中包括该目标节点对应的目标算力服务实例的目标段标识,该目标段标识属于该多个段标识。之后,向该目标节点发送算力任务报文,该算力任务报文的目的地址字段包括该目标段标识。Optionally, after forwarding the computing power request message according to the segment list, the method may also include: receiving a computing power response message sent by the target node, where the computing power response message includes the target computing power corresponding to the target node. The target segment ID of the service instance that belongs to the multiple segment IDs. Afterwards, a computing power task message is sent to the target node, and the destination address field of the computing power task message includes the target segment identifier.
在本申请提供的方案中,客户端设备可以向目标节点发送算力任务报文,且该算力任务报文中的目标段标识可以由客户端设备封装,也可以由该客户端设备所连接的路由节点封装。In the solution provided by this application, the client device can send a computing power task message to the target node, and the target segment identifier in the computing power task message can be encapsulated by the client device or connected by the client device. routing node encapsulation.
可选地,该算力响应报文的源地址字段或扩展报文头中包括该目标段标识。Optionally, the source address field or extension header of the computing power response message includes the target segment identifier.
可选地,该目标算力服务的标识可以为该多个段标识的公共前缀。相应的,在向目标节点发送算力任务报文之前,该方法还可以包括:基于该目标段标识的前缀与目标算力服务的标识匹配,记录目标算力服务与目标段标识的对应关系。由此,可以实现对目标段标识的校验,以确保记录的对应关系的可靠性。Optionally, the identifier of the target computing power service may be a common prefix of the multiple segment identifiers. Correspondingly, before sending the computing power task message to the target node, the method may also include: based on the prefix of the target segment identifier matching the identifier of the target computing power service, recording the corresponding relationship between the target computing power service and the target segment identifier. Thus, the target segment identifier can be verified to ensure the reliability of the recorded correspondence.
可选地,该段列表还可以包括目标算力服务的标识。Optionally, the segment list may also include the identification of the target computing power service.
第三方面,提供了一种算力请求方法,应用于算力请求系统。该算力请求系统包括请求节点,以及与多个算力服务实例对应的多个实例节点,其中,该多个算力服务实例中的每个算力服务实例用于提供目标算力服务。该方法包括:请求节点生成算力请求报文,该算力请求报文的段列表中包括多个段标识,该多个段标识分别指示多个算力服务实例。之后,该请求节点根据该段列表转发算力请求报文。多个实例节点中的第一节点接收该算力请求报文,该第一节点对应该多个算力服务实例中的第一算力服务实例,之后,该第一节点基于第一算力服务实例是否满足服务条件,确定是否由该第一算力服务实例向客户端设备提供目标算力服务。In the third aspect, a computing power request method is provided, which can be applied to the computing power request system. The computing power requesting system includes a requesting node and multiple instance nodes corresponding to multiple computing power service instances, where each computing power service instance in the multiple computing power service instances is used to provide a target computing power service. The method includes: requesting a node to generate a computing power request message, and the segment list of the computing power request message includes multiple segment identifiers, and the multiple segment identifiers respectively indicate multiple computing power service instances. Afterwards, the requesting node forwards the computing power request message according to the segment list. The first node among the plurality of instance nodes receives the computing power request message, and the first node corresponds to the first computing power service instance among the plurality of computing power service instances. After that, the first node based on the first computing power service Whether the instance satisfies the service conditions and determines whether the first computing power service instance provides the target computing power service to the client device.
可选地,第一节点基于一算力服务实例是否满足服务条件,确定是否由该第一算力服务实例向客户端设备提供目标算力服务的过程可以包括:第一节点基于第一算力服务实例满足服务条件,确定由该第一算力服务实例向客户端设备提供目标算力服务,并向该客户端设备发送算力响应报文;或者,第一节点基于第一算力服务实例不满足服务条件,确定不由第一算力服务实例向客户端设备提供目标算力服务,并根据该段列表向该多个实例节点中的第二节点转发算力请求报文。Optionally, the first node determines whether the first computing power service instance provides the target computing power service to the client device based on whether the computing power service instance satisfies the service condition. The process may include: the first node determines whether the first computing power service instance satisfies the service condition. The service instance meets the service conditions, determines that the first computing power service instance provides the target computing power service to the client device, and sends a computing power response message to the client device; or, the first node based on the first computing power service instance If the service conditions are not met, it is determined that the first computing power service instance does not provide the target computing power service to the client device, and the computing power request message is forwarded to the second node among the multiple instance nodes according to the segment list.
可选地,该算力响应报文中包括用于指示该第一算力服务实例的第一段标识;该方法还可以包括:请求节点向第一节点发送算力任务报文,该算力任务报文的目的地址字段包括该第一段标识。Optionally, the computing power response message includes a first segment identifier used to indicate the first computing power service instance; the method may also include: requesting the node to send a computing power task message to the first node, the computing power The destination address field of the task message includes the first segment identifier.
可选地,该请求节点为客户端设备;在请求节点生成算力请求报文之前,该方法还可以包括:客户端设备向DNS服务器发送针对该目标算力服务的域名请求;客户端设备接收该DNS服务器发送的与该目标算力服务对应的多个段标识。Optionally, the requesting node is a client device; before the requesting node generates the computing power request message, the method may also include: the client device sends a domain name request for the target computing power service to the DNS server; the client device receives Multiple segment identifiers corresponding to the target computing power service sent by the DNS server.
可选地,该请求节点为与该客户端设备连接的路由节点;在请求节点生成算力请求报文之前,该方法还可以包括:该路由节点接收客户端设备发送的初始请求报文,该初始请求报文包括该目标算力服务的标识;该路由节点确定与该目标算力服务的标识对应的多个段标识。Optionally, the requesting node is a routing node connected to the client device; before the requesting node generates the computing power request message, the method may also include: the routing node receives an initial request message sent by the client device, and the The initial request message includes the identifier of the target computing service; the routing node determines multiple segment identifiers corresponding to the identifier of the target computing service.
第四方面,提供了一种算力请求方法,应用于尾实例节点。该方法包括:接收算力请求报文,该算力请求报文的段列表中包括多个段标识,该多个段标识分别指示多个算力服务实例,其中每个算力服务实例用于提供目标算力服务,该尾实例节点与段列表中最后一个段标识指示的算力服务实例对应。之后,尾实例节点能够向多个算力服务实例中的目标算力服务实例所对应的实例节点发送服务指示,以使该目标算力服务实例向客户端设备提供目标算力服务。In the fourth aspect, a computing power request method is provided, which is applied to the tail instance node. The method includes: receiving a computing power request message, the segment list of the computing power request message includes multiple segment identifiers, the multiple segment identifiers respectively indicate multiple computing power service instances, wherein each computing power service instance is used for Provide the target computing power service. The tail instance node corresponds to the computing power service instance indicated by the last segment identifier in the segment list. Afterwards, the tail instance node can send a service instruction to the instance node corresponding to the target computing power service instance among the multiple computing power service instances, so that the target computing power service instance provides the target computing power service to the client device.
可选地,该算力请求报文中还可以包括该多个算力服务实例中除尾实例节点对应的算力服务实例之外的其他算力服务实例的算力负载。该尾实例节点能够基于该多个算力服务实例的算力负载,从该多个算力服务实例中确定出目标算力服务实例。Optionally, the computing power request message may also include the computing power load of other computing power service instances among the multiple computing power service instances except the computing power service instance corresponding to the last instance node. The tail instance node can determine the target computing power service instance from the multiple computing power service instances based on the computing power load of the multiple computing power service instances.
可选地,该目标算力服务实例可以满足:目标算力服务实例的算力负载小于第一负载阈值。Optionally, the target computing power service instance may satisfy: the computing power load of the target computing power service instance is less than the first load threshold.
可选地,该目标算力服务实例还可以满足:目标算力服务实例对应的实例节点与第一路由节点之间的网络性能满足第一通信条件。其中,该第一路由节点为与客户端设备连接的路由节点。Optionally, the target computing power service instance may also satisfy: the network performance between the instance node corresponding to the target computing power service instance and the first routing node meets the first communication condition. Wherein, the first routing node is a routing node connected to the client device.
可选地,该目标算力服务实例可以是尾实例节点从多个算力服务实例中随机选取的。Optionally, the target computing power service instance may be randomly selected by the tail instance node from multiple computing power service instances.
第五方面,提供了一种算力请求方法,应用于与算力服务实例对应的实例节点,该实例节点可以是算力服务实例所属的服务节点,或者可以是与该服务节点直连的路由节点。该方法包括:接收来自客户端设备的算力请求报文,该算力请求报文的段列表中包括多个段标识,该多个段标识分别指示多个算力服务实例,其中每个算力服务实例用于提供目标算力服务。之后,在该算力请求报文中封装其对应的算力服务实例的算力负载,并根据该段列表转发算力请求报文。In the fifth aspect, a computing power request method is provided, which is applied to the instance node corresponding to the computing power service instance. The instance node can be the service node to which the computing power service instance belongs, or it can be a route directly connected to the service node. node. The method includes: receiving a computing power request message from a client device. The segment list of the computing power request message includes multiple segment identifiers. The multiple segment identifiers respectively indicate multiple computing power service instances, wherein each computing power request message includes multiple segment identifiers. Power service instances are used to provide target computing power services. Afterwards, the computing power load of the corresponding computing power service instance is encapsulated in the computing power request message, and the computing power request message is forwarded according to the segment list.
可选地,在根据该段列表转发算力请求报文之前,该方法还可以包括:在算力请求报文中封装其与第一路由节点之间的网络性能参数。其中,该第一路由节点为与客户端设备连接的路由节点。Optionally, before forwarding the computing power request message according to the segment list, the method may also include: encapsulating the network performance parameters between the computing power request message and the first routing node in the computing power request message. Wherein, the first routing node is a routing node connected to the client device.
第六方面,提供了一种第一节点,该第一节点可以是服务节点或与服务节点直连的路由节点。并且,该第一节点包括至少一个模块,该至少一个模块可以用于实现上述第一方面、第四方面或第五方面所提供的算力请求方法。A sixth aspect provides a first node, which may be a service node or a routing node directly connected to the service node. Moreover, the first node includes at least one module, and the at least one module can be used to implement the computing power request method provided in the above-mentioned first aspect, fourth aspect or fifth aspect.
第七方面,提供了一种请求节点,该请求节点可以是客户端设备或与客户端设备直连的路由节点。并且,该请求节点包括至少一个模块,该至少一个模块可以用于实现上述第二方面所提供的算力请求方法。A seventh aspect provides a request node, which may be a client device or a routing node directly connected to the client device. Furthermore, the request node includes at least one module, and the at least one module can be used to implement the computing power request method provided in the second aspect.
第八方面,提供了一种算力请求设备,该算力请求设备包括:存储器,处理器及存储在该存储器上并能够在该处理器上运行的计算机程序,该处理器执行该计算机程序时实现如上述任一方面提供的算力请求方法。In an eighth aspect, a computing power requesting device is provided. The computing power requesting device includes: a memory, a processor, and a computer program stored in the memory and capable of running on the processor. When the processor executes the computer program Implement the computing power request method provided by any of the above aspects.
第九方面,提供了一种计算机可读存储介质,该计算机可读存储介质中存储有指令,当该指令在处理器上运行时,使得处理器执行如上述任一方面提供的算力请求方法。In a ninth aspect, a computer-readable storage medium is provided. Instructions are stored in the computer-readable storage medium. When the instructions are run on a processor, they cause the processor to execute the computing power request method provided in any of the above aspects. .
第十方面,提供了一种包含指令的计算机程序产品,该指令由处理器执行,以实现如上述任一方面提供的算力请求方法。A tenth aspect provides a computer program product containing instructions that are executed by a processor to implement the computing power request method provided in any of the above aspects.
第十一方面,提供了一种芯片,该芯片可以用于实现上述任一方面提供的算力请求方法。An eleventh aspect provides a chip that can be used to implement the computing power request method provided in any of the above aspects.
第十二方面,提供了一种算力请求系统,该算力请求系统包括:与多个算力服务实例对应的多个实例节点,该多个算力服务实例中的每个算力服务实例用于提供目标算力服务。并且,该多个实例节点中的至少一个实例节点用于实现如上述第一方面、第四方面或第五方面提供的算力请求方法。In a twelfth aspect, a computing power requesting system is provided. The computing power requesting system includes: multiple instance nodes corresponding to multiple computing power service instances, each computing power service instance in the multiple computing power service instances. Used to provide target computing power services. Furthermore, at least one instance node among the plurality of instance nodes is used to implement the computing power request method provided in the above-mentioned first aspect, fourth aspect or fifth aspect.
可选地,该多个实例节点中的每个实例节点可以为其所对应的算力服务实例所属的服务节点;或者,该多个实例节点中的每个实例节点可以为与其所对应的算力服务实例所属的服务节点直连的路由节点。Optionally, each instance node among the plurality of instance nodes may be a service node to which its corresponding computing power service instance belongs; or, each instance node among the plurality of instance nodes may be a service node to which its corresponding computing power service instance belongs. The routing node directly connected to the service node to which the force service instance belongs.
可选地,该系统还可以包括:请求节点,该请求节点用于实现如上述第二方面提供的算力请求方法。其中,该请求节点可以为客户端设备,或者,与客户端设备连接的路由节点。Optionally, the system may also include: a request node, the request node being used to implement the computing power request method provided in the second aspect above. The requesting node may be a client device, or a routing node connected to the client device.
综上所述,本申请提供了一种算力请求方法、装置及系统。本申请提供的方案中,第一算力服务实例对应的第一节点接收到算力请求报文后,能够基于该第一算力服务实例是否满足服务条件,确定是否由该第一算力服务实例向客户端设备提供目标算力服务。由于第一节点能够直接判断其对应的第一算力服务实例是否满足服务条件,并确定是否提供目标算力服务,因此无需再由算力网络中的路由节点集中感知大量算力服务实例的算力负载,并选择匹配的算力服务实例。也即是,本申请提供的方法能够通过分布式决策的方式确定用于提供目标算力服务的算力服务实例,从而有效减轻了由路由节点集中式决策时的信息发布的压力。进而,有效降低了对路由节点的性能要求,提高了提供算力服务时的效率。To sum up, this application provides a computing power request method, device and system. In the solution provided by this application, after receiving the computing power request message, the first node corresponding to the first computing power service instance can determine whether to be served by the first computing power based on whether the first computing power service instance meets the service conditions. Instances provide target computing power services to client devices. Since the first node can directly determine whether its corresponding first computing power service instance meets the service conditions and determine whether to provide the target computing power service, there is no need for routing nodes in the computing power network to centrally sense the computing power of a large number of computing power service instances. load, and select a matching computing service instance. That is to say, the method provided by this application can determine the computing service instance used to provide the target computing service through distributed decision-making, thereby effectively reducing the pressure of information release when centralized decision-making by routing nodes. Furthermore, it effectively reduces the performance requirements for routing nodes and improves the efficiency of providing computing power services.
附图说明Description of drawings
图1是本申请实施例提供的一种算力网络的结构示意图;Figure 1 is a schematic structural diagram of a computing power network provided by an embodiment of the present application;
图2是本申请实施例提供的另一种算力网络的结构示意图;Figure 2 is a schematic structural diagram of another computing power network provided by an embodiment of the present application;
图3是本申请实施例提供的一种SRv6报文的结构示意图;Figure 3 is a schematic structural diagram of an SRv6 message provided by an embodiment of the present application;
图4是本申请实施例提供的一种SID的结构示意图;Figure 4 is a schematic structural diagram of a SID provided by an embodiment of the present application;
图5是本申请实施例提供的一种SRv6报文的转发流程的示意图;Figure 5 is a schematic diagram of an SRv6 message forwarding process provided by an embodiment of the present application;
图6是本申请实施例提供的一种算力请求方法的流程图;Figure 6 is a flow chart of a computing power request method provided by an embodiment of the present application;
图7是本申请实施例提供的另一种算力请求方法的流程图;Figure 7 is a flow chart of another computing power request method provided by an embodiment of the present application;
图8是本申请实施例提供的又一种算力请求方法的流程图;Figure 8 is a flow chart of yet another computing power request method provided by an embodiment of the present application;
图9是本申请实施例提供的一种算力请求报文的转发示意图;Figure 9 is a schematic diagram of forwarding a computing power request message provided by an embodiment of this application;
图10是本申请实施例提供的再一种算力请求方法的流程图;Figure 10 is a flow chart of yet another computing power request method provided by an embodiment of the present application;
图11是本申请实施例提供的另一种算力请求报文的转发示意图;Figure 11 is a schematic diagram of forwarding another computing power request message provided by the embodiment of this application;
图12是本申请实施例提供的再一种算力请求方法的流程图;Figure 12 is a flow chart of yet another computing power request method provided by an embodiment of the present application;
图13是本申请实施例提供的一种第一节点的结构示意图;Figure 13 is a schematic structural diagram of a first node provided by an embodiment of the present application;
图14是本申请实施例提供的一种请求节点的结构示意图;Figure 14 is a schematic structural diagram of a request node provided by an embodiment of the present application;
图15是本申请实施例提供的一种算力请求设备的结构示意图;Figure 15 is a schematic structural diagram of a computing power request device provided by an embodiment of the present application;
图16是本申请实施例提供的另一种算力请求设备的结构示意图。Figure 16 is a schematic structural diagram of another computing power requesting device provided by an embodiment of the present application.
具体实施方式Detailed ways
下面结合附图详细介绍本申请实施例提供的算力请求方法、装置及系统。The computing power request method, device and system provided by the embodiments of the present application will be introduced in detail below with reference to the accompanying drawings.
人工智能技术带来信息社会向着智能社会转变,智能社会的一个典型特征即物理世界和数字世界的深度融合,未来数字世界通过物联网及增强现实等技术提供的传感器和执行器与真实世界产生互动。交互方式上由传统社会人与人的交互,转变为数据与计算能力的交互,用户和应用使用互联网的首要需求可能不再是获得信息,而是获得计算资源、获得功能,核心的目的是获得计算的结果。Artificial intelligence technology brings about the transformation of the information society into a smart society. A typical feature of the smart society is the deep integration of the physical world and the digital world. In the future, the digital world will interact with the real world through sensors and actuators provided by technologies such as the Internet of Things and augmented reality. . The interaction mode has changed from the interaction between people in traditional society to the interaction between data and computing power. The primary demand of users and applications using the Internet may no longer be to obtain information, but to obtain computing resources and functions. The core purpose is to obtain The result of the calculation.
基于上述需求,算力网络应运而生。算力网络的技术组成,主要包括控制面的算网协同调度,数据面的网络融合感知,以及管理和服务面的算力资源编排等。算力网络体系的整体架构,应该具备统一纳管底层计算资源、存储资源和网络资源的能力,并能够将底层基础设施资源以统一的标准进行度量,抽象为信息要素加载在网络报文中,通过网络进行共享。Based on the above needs, the computing power network emerged as the times require. The technical components of the computing network mainly include computing network collaborative scheduling on the control plane, network fusion sensing on the data plane, and computing resource orchestration on the management and service plane. The overall architecture of the computing power network system should have the ability to uniformly manage underlying computing resources, storage resources and network resources, and be able to measure underlying infrastructure resources with unified standards, abstract them into information elements and load them into network messages. Share over the network.
算力网络能够实现边缘计算成网和边边协作,利用服务的多副本特性,实现用户的就近接入和服务的负载均衡,适应服务的动态性。并且,算力网络基于服务标识(serviceidentity,service ID)寻址,将计算能力状况和网络状况作为路由信息发布到网络,网络将算力请求报文路由到最合适的计算节点,实现用户体验最优、计算资源利用率最优以及网络效率最优。The computing power network can realize edge computing networking and edge-to-edge collaboration, using the multi-copy feature of services to achieve nearby access for users and load balancing of services, and adapt to the dynamics of services. Moreover, the computing power network is based on service identity (service ID) addressing and publishes computing power status and network status to the network as routing information. The network routes the computing power request message to the most appropriate computing node to achieve the best user experience. Optimal, optimal computing resource utilization and optimal network efficiency.
图1是本申请实施例提供的一种算力网络的结构示意图,如图1所示,该算力网络可以包括多个路由节点01和多个服务节点02。该多个路由节点01之间建立有通信连接,每个服务节点02与一个路由节点01之间建立有通信连接。其中,每个路由节点01可以为路由器或交换机等具有报文转发功能的设备,其也可以称为算力路由设备或CFN节点(node)。Figure 1 is a schematic structural diagram of a computing power network provided by an embodiment of the present application. As shown in Figure 1, the computing power network may include multiple routing nodes 01 and multiple service nodes 02. Communication connections are established between the multiple routing nodes 01 , and a communication connection is established between each service node 02 and one routing node 01 . Each routing node 01 may be a device with a packet forwarding function such as a router or a switch, which may also be called a computing power routing device or a CFN node.
每个服务节点02中部署有算力服务实例,该算力服务实例可以用于提供算力服务,即提供计算资源,以及功能或服务。每个服务节点02可以是单个物理服务器,也可以服务器集群,或者可以是云服务器。服务节点02也可以称为计算节点,或者多接入边缘计算(multi-access edge computing,MEC)节点。参考图1可以看出,该多个路由节点01可以位于CFN层,该CFN层可以由互联网协议(Internet protocol,IP)层承载(underlay)。A computing power service instance is deployed in each service node 02, and the computing power service instance can be used to provide computing power services, that is, to provide computing resources, functions or services. Each service node 02 can be a single physical server, a server cluster, or a cloud server. The service node 02 may also be called a computing node, or a multi-access edge computing (MEC) node. Referring to Figure 1, it can be seen that the multiple routing nodes 01 may be located at the CFN layer, and the CFN layer may be underlayed by the Internet protocol (Internet protocol, IP) layer.
继续参考图1,路由节点01还可以与客户端设备03连接,并用于将该客户端设备03发送的算力请求报文路由至匹配的服务节点02,以便该匹配的服务节点02为客户端设备03提供算力服务。其中,该客户端设备03也可以称为用户设备,其可以为家庭网关(residential gateway,RGW)、手机、笔记本电脑或者台式电脑等设备。Continuing to refer to Figure 1, the routing node 01 can also be connected to the client device 03, and is used to route the computing power request message sent by the client device 03 to the matching service node 02, so that the matching service node 02 is the client Device 03 provides computing power services. The client device 03 may also be called a user device, which may be a residential gateway (RGW), a mobile phone, a laptop or a desktop computer, or other devices.
可以理解的是,与客户端设备03直连的路由节点01也可以称为入(ingress)节点,与该匹配的服务节点02直连的路由节点01也可以称为出(egress)节点。该多个服务节点02根据部署位置的不同,可以划分为边缘节点、区域节点和中心节点等。并且,部署位置不同的服务节点02的性能可以不同,例如,中心节点的性能可以优于边缘节点。It can be understood that the routing node 01 directly connected to the client device 03 may also be called an ingress node, and the routing node 01 directly connected to the matching service node 02 may also be called an egress node. The plurality of service nodes 02 can be divided into edge nodes, regional nodes, center nodes, etc. according to different deployment locations. Moreover, the performance of service nodes 02 in different deployment locations may be different. For example, the performance of the central node may be better than that of the edge node.
相关技术中,路由节点01接收到某个客户端设备03发送的算力请求报文后,可以基于预先存储的算力路由表确定匹配的服务节点02,生成该客户端设备03的流表,并基于该流表转发客户端设备03的算力请求报文。参考图2,假设算力网络中的CFN节点1与客户端设备连接,CFN节点2与服务节点1连接,CFN节点3与服务节点2连接。该服务节点1的IP地址为IPA,服务节点2的IP地址为IPB,且该两个服务节点中部署的算力服务实例用于提供相同的算力服务。则CFN节点1转发客户端设备的算力请求报文的过程可以包括:In related technology, after routing node 01 receives a computing power request message sent by a certain client device 03, it can determine the matching service node 02 based on the pre-stored computing power routing table, and generate a flow table for the client device 03. And forward the computing power request message of client device 03 based on the flow table. Referring to Figure 2, assume that CFN node 1 in the computing power network is connected to the client device, CFN node 2 is connected to service node 1, and CFN node 3 is connected to service node 2. The IP address of service node 1 is IPA, the IP address of service node 2 is IPB, and the computing service instances deployed in the two service nodes are used to provide the same computing service. Then the process of CFN node 1 forwarding the computing power request message of the client device may include:
步骤1,CFN节点2和CFN节点3分别通过边界网关协议(border gateway protocol,BGP)协议向CFN节点1通告算力负载与网络位置的关联关系。其中,算力负载也可以称为算力状态,网络位置可以是服务节点的IP地址。例如,CFN节点2通告的关联关系包括服务节点1所提供的算力服务的标识S1,服务节点1的算力负载,以及服务节点1的IP地址。CFN节点3通告的关联关系包括服务节点2所提供的算力服务的标识S1,服务节点2的算力负载,以及服务节点2的IP地址。Step 1: CFN node 2 and CFN node 3 respectively notify CFN node 1 of the association between the computing power load and the network location through the border gateway protocol (BGP) protocol. Among them, the computing power load can also be called the computing power status, and the network location can be the IP address of the service node. For example, the association relationship advertised by CFN node 2 includes the identifier S1 of the computing power service provided by service node 1, the computing power load of service node 1, and the IP address of service node 1. The association relationship advertised by CFN node 3 includes the identifier S1 of the computing power service provided by service node 2, the computing power load of service node 2, and the IP address of service node 2.
其中,算力服务的标识也可以称为服务标识。每个服务节点的算力负载可以采用下述参数中的至少一个参数表征:处理器使用率,内存使用率,在线连接数,网络时延和融合开销(metric)等。该处理器使用率可以包括中央处理器(central processing unit,CPU)使用率,和/或,图形处理器(graphics processing unit,GPU)使用率等。Among them, the identity of the computing power service can also be called a service identity. The computing power load of each service node can be characterized by at least one of the following parameters: processor usage, memory usage, number of online connections, network delay and integration overhead (metric), etc. The processor usage may include a central processing unit (CPU) usage, and/or a graphics processing unit (GPU) usage, etc.
步骤2,CFN节点1根据接收到的关联关系建立算力路由表。该算力路由表中可以维护有同一算力服务对应的不同服务节点,以及不同服务节点的算力负载。例如,该CFN节点1建立的算力路由表可以如表1所示。其中,服务标识为S1的算力服务对应有两个服务节点,该两个服务节点的IP地址分别为IPA和IPB。每个服务节点的算力负载可以包括CPU使用率、GPU使用率、在线连接数、网络时延和融合开销。Step 2: CFN node 1 establishes a computing power routing table based on the received association relationship. The computing power routing table can maintain different service nodes corresponding to the same computing power service, as well as the computing power loads of different service nodes. For example, the computing power routing table established by CFN node 1 can be as shown in Table 1. Among them, the computing power service with the service identifier S1 corresponds to two service nodes, and the IP addresses of the two service nodes are IPA and IPB respectively. The computing power load of each service node can include CPU usage, GPU usage, number of online connections, network delay and integration overhead.
表1Table 1
步骤3,需要访问远端网络服务的客户端设备向CFN节点1(即头端CFN节点)发送算力请求报文。该算力请求报文的目的地址(destination address,DA)为所需请求的算力服务的标识。例如,参考图2,该算力请求报文的目的地址为S1。Step 3: The client device that needs to access remote network services sends a computing power request message to CFN node 1 (that is, the head-end CFN node). The destination address (DA) of the computing power request message is the identification of the requested computing power service. For example, referring to Figure 2, the destination address of the computing power request message is S1.
步骤4,CFN节点1基于算力路由表以及算力选择策略(例如优先选择轻载的服务节点)确定匹配的服务节点,将算力请求报文的目的地址更新为该服务节点的IP地址,并将算力请求报文转发至该服务节点。并且,CFN节点1可以建立客户端设备到该服务节点的流表以指导后续报文转发。其中,该流表具有老化机制,老化后需要CFN节点1根据各个服务节点的算力负载重新选择匹配的服务节点。Step 4: CFN node 1 determines the matching service node based on the computing power routing table and the computing power selection strategy (for example, giving priority to lightly loaded service nodes), and updates the destination address of the computing power request message to the IP address of the service node. And forward the computing power request message to the service node. Furthermore, CFN node 1 can establish a flow table from the client device to the service node to guide subsequent packet forwarding. Among them, the flow table has an aging mechanism. After aging, the CFN node 1 needs to reselect matching service nodes based on the computing power load of each service node.
示例的,假设CFN节点1确定出的匹配的服务节点为服务节点1,则CFN节点1可以将算力请求报文的目的地址更新为IPA,并转发至服务节点1。并且,CFN节点1可以建立如表2所示的流表。该流表中包括客户端设备的五元组信息,匹配的服务节点1的IP地址IPA,下一跳地址,以及出接口。其中,五元组信息中包括源IP,源端口,服务标识,目的端口,协议类型。下一跳地址为与该服务节点1连接的CFN节点2的IP地址:IP1。出接口为SRv6隧道。For example, assuming that the matching service node determined by CFN node 1 is service node 1, CFN node 1 can update the destination address of the computing power request message to IPA and forward it to service node 1. Moreover, CFN node 1 can establish a flow table as shown in Table 2. The flow table includes the five-tuple information of the client device, the matching IP address IPA of service node 1, the next hop address, and the outgoing interface. Among them, the five-tuple information includes source IP, source port, service identifier, destination port, and protocol type. The next hop address is the IP address of CFN node 2 connected to the service node 1: IP1. The outbound interface is an SRv6 tunnel.
表2Table 2
步骤5,CFN节点1根据流表中服务节点的IP地址所对应的下一跳转发算力请求报文。例如,参考表2,CFN节点1可以对算力请求报文进行隧道封装转发。Step 5: CFN node 1 forwards the computing power request message according to the next hop corresponding to the IP address of the service node in the flow table. For example, referring to Table 2, CFN node 1 can tunnel-encapsulate and forward the computing power request message.
步骤6,CFN节点2解封装隧道,并根据算力请求报文的目的地址,将该算力请求报文路由至对应的服务节点1。其中,CFN节点2即为传输网络的尾节点。Step 6: CFN node 2 decapsulates the tunnel and routes the computing power request message to the corresponding service node 1 according to the destination address of the computing power request message. Among them, CFN node 2 is the tail node of the transmission network.
步骤7,CFN节点1后续接收到客户端设备发送的算力任务报文后,可以直接根据表2所示的流表转发算力任务报文,而无需再执行上述步骤4。Step 7: After CFN node 1 subsequently receives the computing power task message sent by the client device, it can directly forward the computing power task message according to the flow table shown in Table 2, without performing the above step 4.
但是,上述服务请求方法需要CFN节点感知并维护大量服务节点的算力负载,导致CFN节点的控制面的压力较大。并且,针对每个客户端设备和每个业务流,CFN节点均需维护对应的一个流表,导致CFN节点中所需维护的表项数量较多,负担较重。此外,当网络震荡导致流表丢失时,需要CFN节点重新选择匹配的服务节点。若CFN节点前后选择的服务节点不一致,则前一个服务节点中未完成的任务需要由重新选择的服务节点重新开始计算,浪费算力资源。However, the above service request method requires the CFN node to sense and maintain the computing power load of a large number of service nodes, resulting in greater pressure on the control plane of the CFN node. Moreover, for each client device and each service flow, the CFN node needs to maintain a corresponding flow table, resulting in a large number of table entries to be maintained in the CFN node and a heavy burden. In addition, when network flapping causes the flow table to be lost, the CFN node needs to reselect a matching service node. If the service nodes selected before and after the CFN node are inconsistent, the unfinished tasks in the previous service node need to be restarted by the reselected service node, which wastes computing resources.
本申请实施例提供了一种算力请求方法,算力服务实例对应的实例节点(如服务节点或与服务节点直连的路由节点)在接收到算力请求报文后,直接判断其对应的算力服务实例是否满足服务条件,进而确定是否由该算力服务实例向客户端设备提供算力服务。也即是,本申请实施例提供的方法能够由各个实例节点通过分布式决策的方式,确定用于向客户端设备提供算力服务的算力服务实例。相比于由路由节点集中式决策的方式,该分布式决策的方式无需路由节点感知大量服务节点中部署的算力服务实例的算力负载,且无需路由节点维护大量的流表。由此,有效降低了对路由节点的性能要求,提高了提供算力服务时的效率。The embodiment of this application provides a computing power request method. After receiving the computing power request message, the instance node corresponding to the computing power service instance (such as a service node or a routing node directly connected to the service node) directly determines its corresponding Whether the computing power service instance meets the service conditions, and then determines whether the computing power service instance provides computing power services to the client device. That is to say, the method provided by the embodiments of the present application enables each instance node to determine the computing power service instance used to provide computing power services to the client device through distributed decision-making. Compared with the centralized decision-making method of routing nodes, this distributed decision-making method does not require routing nodes to sense the computing power load of computing service instances deployed in a large number of service nodes, and does not require routing nodes to maintain a large number of flow tables. As a result, the performance requirements for routing nodes are effectively reduced and the efficiency of providing computing power services is improved.
并且,在本申请实施例中,该算力请求报文可以为插入有SRH的SRv6报文,该SRH的段列表中包括用于提供同一算力服务的多个算力服务实例的段标识。也即是,本申请实施例提供的方案可以与SRv6技术结合,以实现算力请求报文的转发。下文对SRv6报文的转发机制进行说明。图3是本申请实施例提供的一种SRv6报文的结构示意图,如图3所示,SRv6报文包括:IPv6头、SRH和载荷(payload)。其中,IPv6头包括如下字段:版本(version)、流分类(traffic class,TC)、流标签(flow label)、载荷长度(payload length)、下一头部(nextheader)、跳数限制(hop limit)、源地址(source address,SA)和DA。Moreover, in this embodiment of the present application, the computing power request message may be an SRv6 message with an SRH inserted, and the segment list of the SRH includes segment identifiers of multiple computing power service instances used to provide the same computing power service. That is to say, the solution provided by the embodiments of this application can be combined with SRv6 technology to realize the forwarding of computing power request messages. The following describes the forwarding mechanism of SRv6 messages. Figure 3 is a schematic structural diagram of an SRv6 message provided by an embodiment of the present application. As shown in Figure 3, the SRv6 message includes: IPv6 header, SRH and payload. Among them, the IPv6 header includes the following fields: version (version), traffic class (TC), flow label (flow label), payload length (payload length), next header (nextheader), hop limit (hop limit) ), source address (source address, SA) and DA.
SRH包括如下字段:下一头部、扩展头长度(header extension length,Hdr ExtLen)、路由头类型(routing type)、SL、最后条目(last entry)、标记(flags)、标签(tag)、段列表以及可选TLV。其中,该段列表包括段列表[0]至段列表[n]共n+1个标识字段(n可以为大于或等于1的整数),每个标识字段用于携带报文转发路径中一个节点的SID,该SID可以为节点的IPv6地址。该SRH中各字段的长度和含义可以参考表3。SRH includes the following fields: next header, header extension length (Hdr ExtLen), routing header type (routing type), SL, last entry, flags, tag, segment List and optional TLV. Among them, the segment list includes segment list [0] to segment list [n], a total of n+1 identification fields (n can be an integer greater than or equal to 1), and each identification field is used to carry a node in the message forwarding path. SID, which can be the IPv6 address of the node. Please refer to Table 3 for the length and meaning of each field in the SRH.
表3table 3
图4是本申请实施例提供的一种SID的结构示意图。如图4所示,每个SID可以包括定位符(locator)字段、功能(function)字段和参数(arguments)字段。其中,定位符字段用于标识某个网络域以及该网络域中的某个节点。功能字段用于标识在相应的节点上执行的动作,也称为行为(behavior)。参数字段用于携带执行该动作所需的参数。Figure 4 is a schematic structural diagram of an SID provided by an embodiment of the present application. As shown in Figure 4, each SID may include a locator field, a function field and an arguments field. Among them, the locator field is used to identify a certain network domain and a certain node in the network domain. The function field is used to identify the action performed on the corresponding node, also called behavior. The parameter field is used to carry the parameters required to perform the action.
在报文转发的过程中,节点(例如路由节点)可以基于SL字段指示的SID更新报文的DA字段的值。该SRH中的内容即相当于计算机程序,段列表[0]至段列表[n]相当于计算机程序中的多个指令,且该计算机程序中第一个要执行的指令是段列表[n]。SRH中的SL字段相当于计算机程序的指针,该指针始终指向当前正在执行的指令。在报文转发之前,可以将SL字段的值初始化为n,每执行完一个指令,可以将SL字段的值减1(即执行SL--),从而使得SL字段指向下一条要执行的指令。通过上述方式,即可实现报文的端到端转发。During the packet forwarding process, a node (such as a routing node) may update the value of the DA field of the packet based on the SID indicated by the SL field. The content in the SRH is equivalent to a computer program. Segment list [0] to segment list [n] are equivalent to multiple instructions in the computer program, and the first instruction to be executed in the computer program is segment list [n] . The SL field in SRH is equivalent to the pointer of the computer program, which always points to the currently executing instruction. Before the message is forwarded, the value of the SL field can be initialized to n. After each instruction is executed, the value of the SL field can be decremented by 1 (that is, SL-- is executed), so that the SL field points to the next instruction to be executed. Through the above method, end-to-end forwarding of messages can be achieved.
示例的,参考图5,假设SRv6报文的转发路径为:节点A→节点B→节点C→节点D→节点I。其中,节点A、节点B、节点D和节点I均支持SRv6技术,而节点C不支持SRv6技术。则节点A作为入(ingress)节点,可以在SRv6报文中插入SRH,该SRH中封装有段列表<I::,D::,B::>。其中,I::为节点I的SID,D::为节点D的SID,B::为节点B的SID。在转发报文之前,节点A可以将SL字段的值初始化为2。然后,节点A可以将SL字段指示的段列表中的SID,即段列表[2]中的B::拷贝至IPv6头中的DA字段。之后,节点A即可将SRv6报文转发到节点B。For example, referring to Figure 5, assume that the forwarding path of the SRv6 message is: Node A→Node B→Node C→Node D→Node I. Among them, node A, node B, node D and node I all support SRv6 technology, while node C does not support SRv6 technology. Then node A, as the ingress node, can insert SRH into the SRv6 message. The SRH is encapsulated with the segment list <I::, D::, B::>. Among them, I:: is the SID of node I, D:: is the SID of node D, and B:: is the SID of node B. Before forwarding the message, node A can initialize the value of the SL field to 2. Then, node A can copy the SID in the segment list indicated by the SL field, that is, B:: in segment list [2], to the DA field in the IPv6 header. Afterwards, node A can forward the SRv6 packet to node B.
节点B收到SRv6报文后,可以根据IPv6头中的DA字段的值查找本地SID表。由于该DA字段的值能够命中节点B的本地SID表,因此节点B可以将SL字段的值减1,并将更新后的SL字段指示的段列表中的SID,即段列表[1]中的D::拷贝至IPv6头中的DA字段。之后,节点B即可将SRv6报文转发到节点C。After receiving the SRv6 message, Node B can search the local SID table based on the value of the DA field in the IPv6 header. Since the value of the DA field can hit the local SID table of node B, node B can decrement the value of the SL field by 1 and add the SID in the segment list indicated by the updated SL field, that is, the SID in the segment list [1] D::Copied to the DA field in the IPv6 header. Afterwards, node B can forward the SRv6 packet to node C.
节点C收到SRv6报文后,由于该节点C不支持SRv6技术,因此无需处理SRH,而可以直接基于该IPv6头中的DA查询路由表,并将SRv6报文转发到节点D。After node C receives the SRv6 message, since node C does not support SRv6 technology, it does not need to process SRH. Instead, it can directly query the routing table based on the DA in the IPv6 header and forward the SRv6 message to node D.
节点D收到SRv6报文后,可以根据IPv6头中的DA查找本地SID表。由于该DA能够命中节点D的本地SID表,因此节点D可以将SL字段的值减1,并将更新后的SL字段指示的段列表中的SID,即段列表[0]中的I::拷贝至IPv6头中的DA。由于更新后的SL字段的值为0,因此节点D可以将该SRH弹出,并将弹出SRH的SRv6报文转发至节点I。After receiving the SRv6 message, node D can search the local SID table based on the DA in the IPv6 header. Since the DA can hit the local SID table of node D, node D can decrement the value of the SL field by 1 and add the SID in the segment list indicated by the updated SL field, that is, I:: in segment list [0]. Copied to the DA in the IPv6 header. Since the value of the updated SL field is 0, node D can eject the SRH and forward the SRv6 message of the ejected SRH to node I.
下文对本申请实施例提供的算力请求方法进行介绍。该方法可以应用于诸如图1所示的算力网络中的第一节点,该第一节点对应第一算力服务实例,该第一算力服务实例用于提供目标算力服务。例如,该第一节点可以是部署有该第一算力服务实例的服务节点,或者可以是与该服务节点直连的路由节点。参考图6,该方法包括:The computing power request method provided by the embodiment of this application is introduced below. This method can be applied to a first node in a computing power network such as that shown in Figure 1. The first node corresponds to a first computing power service instance, and the first computing power service instance is used to provide a target computing power service. For example, the first node may be a service node where the first computing power service instance is deployed, or may be a routing node directly connected to the service node. Referring to Figure 6, the method includes:
步骤101、接收来自客户端设备的算力请求报文,该算力请求报文用于请求目标算力服务。Step 101: Receive a computing power request message from the client device. The computing power request message is used to request the target computing power service.
在本申请实施例中,第一节点能够接收到来自客户端设备的算力请求报文,该算力请求报文用于请求目标算力服务。其中,该算力请求报文可以是客户端设备生成,并发送至第一节点的。或者,该算力请求报文可以是与客户端设备直连的路由节点生成,并转发至第一节点的。In this embodiment of the present application, the first node can receive a computing power request message from a client device, and the computing power request message is used to request a target computing power service. The computing power request message may be generated by the client device and sent to the first node. Alternatively, the computing power request message may be generated by a routing node directly connected to the client device and forwarded to the first node.
步骤102、基于第一算力服务实例是否满足服务条件,确定是否由该第一算力服务实例向客户端设备提供目标算力服务。Step 102: Based on whether the first computing power service instance satisfies the service conditions, determine whether the first computing power service instance provides the target computing power service to the client device.
第一节点接收到算力请求报文后,可以检测第一算力服务实例是否满足服务条件,并可以基于检测结果,确定是否由该第一算力服务实例向客户端设备提供目标算力服务。例如,第一节点若检测到第一算力服务实例满足服务条件,则可以确定由该第一算力服务实例向客户端设备提供目标算力服务,并可以向客户端设备发送算力响应报文。第一节点若检测到第一算力服务实例不满足服务条件,则可以确定不由该第一算力服务实例向客户端设备提供目标算力服务,并可以继续向其他实例节点转发该算力请求报文。After receiving the computing power request message, the first node can detect whether the first computing power service instance meets the service conditions, and can determine whether the first computing power service instance provides the target computing power service to the client device based on the detection result. . For example, if the first node detects that the first computing power service instance meets the service conditions, it can determine that the first computing power service instance provides the target computing power service to the client device, and can send a computing power response message to the client device. arts. If the first node detects that the first computing power service instance does not meet the service conditions, it can determine that the first computing power service instance does not provide the target computing power service to the client device, and can continue to forward the computing power request to other instance nodes. message.
可以理解的是,该算力请求报文的段列表中可以包括多个段标识(SID),该多个段标识分别指示多个算力服务实例。也即是,该多个段标识中的每个段标识可以指示一个算力服务实例,即能够标识一个算力服务实例在算力网络中的唯一位置,并且不同段标识指示的算力服务实例不同。并且,该多个算力服务实例中的每个算力服务实例用于提供目标算力服务。该算力请求报文的目的地址字段可以包括第一段标识,该第一段标识属于该多个段标识。It can be understood that the segment list of the computing power request message may include multiple segment identifiers (SIDs), and the multiple segment identifiers respectively indicate multiple computing power service instances. That is to say, each segment identifier in the multiple segment identifiers can indicate a computing power service instance, that is, it can identify the unique position of a computing power service instance in the computing power network, and the computing power service instances indicated by different segment identifiers different. Moreover, each computing power service instance among the plurality of computing power service instances is used to provide a target computing power service. The destination address field of the computing power request message may include a first segment identifier, and the first segment identifier belongs to the multiple segment identifiers.
第一节点接收到算力请求报文后,可以先检测该第一段标识是否与第一算力服务实例的段标识匹配。若第一段标识与第一算力服务实例的段标识匹配,则第一节点可以继续检测该第一算力服务实例是否满足服务条件。若第一段标识不与第一算力服务实例的段标识匹配,则第一节点可以基于该第一段标识转发该算力请求报文,而无需检测第一算力服务实例是否满足服务条件。After receiving the computing power request message, the first node may first detect whether the first segment identifier matches the segment identifier of the first computing power service instance. If the first segment identifier matches the segment identifier of the first computing power service instance, the first node may continue to detect whether the first computing power service instance meets the service conditions. If the first segment identifier does not match the segment identifier of the first computing power service instance, the first node can forward the computing power request message based on the first segment identifier without detecting whether the first computing power service instance meets the service conditions. .
综上所述,本申请实施例提供了一种算力请求方法,第一算力服务实例对应的第一节点接收到算力请求报文后,能够基于该第一算力服务实例是否满足服务条件,确定是否由该第一算力服务实例向客户端设备提供目标算力服务。由于第一节点能够直接判断其对应的第一算力服务实例是否满足服务条件,并确定是否提供目标算力服务,因此无需再由算力网络中的路由节点集中感知大量算力服务实例的算力负载,并选择匹配的算力服务实例。也即是,本申请实施例提供的方法能够通过分布式决策的方式确定用于提供目标算力服务的算力服务实例,从而有效减轻了由路由节点集中式决策时的信息发布的压力。进而,有效降低了对路由节点的性能要求,提高了提供算力服务时的效率。To sum up, the embodiments of this application provide a computing power request method. After the first node corresponding to the first computing power service instance receives the computing power request message, it can based on whether the first computing power service instance satisfies the service requirement. condition to determine whether the first computing power service instance provides the target computing power service to the client device. Since the first node can directly determine whether its corresponding first computing power service instance meets the service conditions and determine whether to provide the target computing power service, there is no need for routing nodes in the computing power network to centrally sense the computing power of a large number of computing power service instances. load, and select a matching computing service instance. That is to say, the method provided by the embodiments of the present application can determine the computing power service instance used to provide the target computing power service through distributed decision-making, thereby effectively reducing the pressure of information release when centralized decision-making by routing nodes. Furthermore, it effectively reduces the performance requirements for routing nodes and improves the efficiency of providing computing power services.
本申请实施例提供了另一种算力请求方法,该方法可以应用于诸如图1所示的算力网络中的请求节点。该请求节点可以是客户端设备,或者可以是与客户端设备直连的路由节点。参考图7,该方法包括:The embodiment of the present application provides another computing power request method, which can be applied to requesting nodes in the computing power network such as shown in Figure 1. The requesting node may be a client device, or may be a routing node directly connected to the client device. Referring to Figure 7, the method includes:
步骤201、生成算力请求报文,该算力请求报文的段列表中包括多个段标识。Step 201: Generate a computing power request message. The segment list of the computing power request message includes multiple segment identifiers.
在本申请实施例中,请求节点可以基于所需请求的目标算力服务,确定与该目标算力服务对应的多个段标识。之后,请求节点即可在算力请求报文中封装SRH,该SRH的段列表中包括该多个段标识。该多个段标识分别指示多个算力服务实例,该多个算力服务实例中的每个算力服务实例用于提供目标算力服务。In this embodiment of the present application, the requesting node may determine multiple segment identifiers corresponding to the target computing power service based on the requested target computing power service. Afterwards, the requesting node can encapsulate the SRH in the computing power request message, and the segment list of the SRH includes the multiple segment identifiers. The multiple segment identifiers respectively indicate multiple computing power service instances, and each of the multiple computing power service instances is used to provide a target computing power service.
步骤202、根据该段列表转发该算力请求报文。Step 202: Forward the computing power request message according to the segment list.
请求节点生成算力请求报文后,即可向算力网络中的路由节点转发该算力请求报文,以使该多个段标识指示的算力服务实例中满足服务条件的至少一个算力服务实例向客户端设备提供目标算力服务。其中,每个段标识指示的一个算力服务实例均对应一个实例节点,该实例节点可以是算力服务实例所属的服务节点,也可以是该服务节点所直连的路由节点。每个实例节点接收到算力请求报文后,可以基于其对应的算力服务实例是否满足服务条件,确定是否由其对应的算力服务实例向客户端设备提供该目标算力服务。After the requesting node generates the computing power request message, it can forward the computing power request message to the routing node in the computing power network, so that at least one computing power among the computing power service instances indicated by the multiple segment identifiers meets the service conditions. The service instance provides target computing power services to client devices. Among them, a computing power service instance indicated by each segment identifier corresponds to an instance node. The instance node can be a service node to which the computing power service instance belongs, or a routing node directly connected to the service node. After each instance node receives the computing power request message, it can determine whether its corresponding computing power service instance provides the target computing power service to the client device based on whether its corresponding computing power service instance meets the service conditions.
综上所述,本申请实施例提供了一种算力请求方法,请求节点生成的算力请求报文中包括段列表,并可以根据该段列表转发该算力请求报文。由于该段列表中的每个段标识指示一个用于提供目标算力服务的算力服务实例,因此该算力请求报文能够沿段列表指示的转发路径依次转发至各个算力服务实例对应的实例节点。进而,可以由接收到算力请求报文的实例节点基于其对应的算力服务实例是否满足服务条件,并确定是否由该算力服务实例向客户端设备提供该目标算力服务。由此,无需再由算力网络中的路由节点集中感知大量算力服务实例的算力负载,并选择匹配的算力服务实例,从而有效降低了对路由节点的性能要求,提高了提供算力服务时的效率。To sum up, embodiments of this application provide a computing power request method. The computing power request message generated by the requesting node includes a segment list, and the computing power request message can be forwarded based on the segment list. Since each segment identifier in the segment list indicates a computing power service instance used to provide the target computing power service, the computing power request message can be forwarded to the corresponding computing power service instance along the forwarding path indicated by the segment list. Instance node. Furthermore, the instance node that receives the computing power request message can determine whether the computing power service instance provides the target computing power service to the client device based on whether its corresponding computing power service instance meets the service conditions. As a result, routing nodes in the computing network no longer need to centrally sense the computing load of a large number of computing service instances and select matching computing service instances, thereby effectively reducing the performance requirements for routing nodes and improving the provision of computing power. Efficiency in service.
图8是本申请实施例提供的又一种算力请求方法的流程图,该方法以请求节点为与客户端设备直连的第一路由节点,并以第一节点为与第一算力服务实例所属的服务节点直连的第二路由节点为例进行说明。参考图8,该方法包括:Figure 8 is a flow chart of another computing power request method provided by an embodiment of the present application. In this method, the requesting node is the first routing node directly connected to the client device, and the first node is used as the first computing power service. The second routing node directly connected to the service node to which the instance belongs is used as an example for explanation. Referring to Figure 8, the method includes:
步骤301、第一路由节点接收算力网络中其他路由节点通告的与目标算力服务的标识对应的多个段标识。Step 301: The first routing node receives multiple segment identifiers corresponding to the identifiers of the target computing services advertised by other routing nodes in the computing power network.
在本申请实施例中,算力网络中的每个服务节点中可以部署有至少一个算力服务实例,且每个服务节点可以与一个路由节点连接。其中,每个算力服务实例可以提供一种算力服务,且不同算力服务实例所提供的算力服务可以相同,也可以不同。In this embodiment of the present application, at least one computing service instance can be deployed in each service node in the computing network, and each service node can be connected to a routing node. Each computing power service instance can provide one computing power service, and the computing power services provided by different computing power service instances can be the same or different.
可以理解的是,算力网络中的一个路由节点可以与一个或多个服务节点连接,或者,也可以不与服务节点连接,而仅作为报文的转发节点。其中,连接有服务节点的每个路由节点可以获取其所直连的服务节点中部署的各个算力服务实例(即路由节点对应的各个算力服务实例)的段标识,以及各个算力服务实例所提供的算力服务的标识。例如,路由节点可以接收服务节点发送的算力服务实例的段标识和算力服务的标识,或者可以接收控制器下发的算力服务实例的段标识和算力服务的标识。It can be understood that a routing node in the computing power network can be connected to one or more service nodes, or it can not be connected to a service node and only serve as a message forwarding node. Among them, each routing node connected to a service node can obtain the segment identifier of each computing power service instance deployed in the directly connected service node (that is, each computing power service instance corresponding to the routing node), as well as the segment identifier of each computing power service instance. The identification of the computing power service provided. For example, the routing node may receive the segment identifier of the computing service instance and the identifier of the computing service sent by the service node, or may receive the segment identifier of the computing service instance and the identifier of the computing service issued by the controller.
可选地,每个算力服务的标识可以是其所对应的各个算力服务实例的段标识的公共前缀。或者,算力服务的标识与其所对应的段标识也可以是相互独立的IPv6地址。Optionally, the identifier of each computing power service may be the common prefix of the segment identifier of each corresponding computing power service instance. Alternatively, the identifier of the computing power service and its corresponding segment identifier can also be independent IPv6 addresses.
在本申请实施例中,连接有服务节点的每个路由节点可以向算力网络中的其他路由节点通告其获取到的各个算力服务实例的段标识与算力服务的标识的对应关系。例如,路由节点之间可以通过控制面协议(例如BGP)通告该算力服务实例的段标识与算力服务的标识的对应关系。基于此,算力网络中的第一路由节点即可接收到其他路由节点通告的与目标算力服务的标识对应的多个段标识。并且,第一路由节点可以存储该目标算力服务的标识与该多个段标识的对应关系。In this embodiment of the present application, each routing node connected to a service node can notify other routing nodes in the computing power network of the obtained correspondence between the segment identifier of each computing power service instance and the identity of the computing power service. For example, routing nodes may notify each other through a control plane protocol (such as BGP) of the corresponding relationship between the segment identifier of the computing power service instance and the identifier of the computing power service. Based on this, the first routing node in the computing power network can receive multiple segment identifiers corresponding to the identifiers of the target computing power services advertised by other routing nodes. Furthermore, the first routing node may store the corresponding relationship between the identifier of the target computing service and the plurality of segment identifiers.
示例的,参考图9,假设路由节点1、路由节点2和路由节点3所对应的算力服务实例的段标识分别为SID1、SID2和SID3,且该三个算力服务实例均用于提供目标算力服务,该目标算力服务的标识为S1。相应的,第一路由节点可以接收到路由节点1、路由节点2和路由节点3所通告的与目标算力服务的标识S1对应的段标识SID1、SID2和SID3,并存储该标识S1与段标识SID1、SID2和SID3的对应关系。For example, referring to Figure 9, assume that the segment identifiers of the computing service instances corresponding to routing node 1, routing node 2 and routing node 3 are SID1, SID2 and SID3 respectively, and these three computing service instances are all used to provide the target Computing power service, the identifier of the target computing power service is S1. Correspondingly, the first routing node can receive the segment identifiers SID1, SID2 and SID3 corresponding to the identifier S1 of the target computing service advertised by routing node 1, routing node 2 and routing node 3, and store the identifier S1 and the segment identifier. The corresponding relationship between SID1, SID2 and SID3.
可以理解的是,连接有服务节点的每个路由节点可以向算力网络中的其他所有路由节点均通告算力服务实例的段标识与算力服务的标识的对应关系。或者,连接有服务节点的每个路由节点也可以仅向算力网络中的部分路由节点通告该对应关系。其中,该部分路由节点可以是与客户端设备连接的路由节点。It can be understood that each routing node connected to a service node can notify all other routing nodes in the computing power network of the corresponding relationship between the segment identifier of the computing power service instance and the identifier of the computing power service. Alternatively, each routing node connected to the service node can also advertise the corresponding relationship to only some routing nodes in the computing power network. The part of the routing nodes may be routing nodes connected to the client device.
还可以理解的是,第一路由节点还可以通过其他方式获取与目标算力服务的标识对应的多个段标识。例如,第一路由节点可以接收控制器下发的与目标算力服务的标识对应的多个段标识。或者,第一路由节点中可以静态配置有与目标算力服务的标识对应的多个段标识。It can also be understood that the first routing node can also obtain multiple segment identifiers corresponding to the identifier of the target computing service through other methods. For example, the first routing node may receive multiple segment identifiers corresponding to the identifier of the target computing service issued by the controller. Alternatively, the first routing node may be statically configured with multiple segment identifiers corresponding to the identifier of the target computing service.
步骤302、客户端设备向第一路由节点发送初始请求报文。Step 302: The client device sends an initial request message to the first routing node.
在本申请实施例中,客户端设备若需要获取目标算力服务,则可以向其所直连的第一路由节点发送初始请求报文。该初始请求报文中包括该目标算力服务的标识。例如,该初始请求报文的目的地址字段可以包括该目标算力服务的标识。并且,该目标算力服务的标识可以是IPv6地址。In this embodiment of the present application, if the client device needs to obtain the target computing power service, it can send an initial request message to the first routing node to which it is directly connected. The initial request message includes the identification of the target computing service. For example, the destination address field of the initial request message may include the identification of the target computing service. Moreover, the identifier of the target computing power service may be an IPv6 address.
可以理解的是,算力网络中不同的算力服务可以由不同的标识进行区分,且每个算力服务可以由多个算力服务实例提供。客户端设备所请求的目标算力服务的标识可以是该客户端设备从DNS服务器获取到的。例如,客户端设备可以向DNS服务器发送针对目标算力服务的域名请求,DNS服务器可以响应于该域名请求,向客户端设备下发目标算力服务的标识。It can be understood that different computing power services in the computing power network can be distinguished by different identifiers, and each computing power service can be provided by multiple computing power service instances. The identity of the target computing service requested by the client device may be obtained by the client device from the DNS server. For example, the client device can send a domain name request for the target computing power service to the DNS server, and the DNS server can respond to the domain name request and issue the identification of the target computing power service to the client device.
示例的,如图9所示,假设客户端设备的IP地址为C0,该目标算力服务的标识为S1,则客户端设备可以向第一路由节点发送初始请求报文。该初始请求报文中的源地址(SA)为C0,目的地址(DA)为S1。For example, as shown in Figure 9, assuming that the IP address of the client device is C0 and the identifier of the target computing service is S1, the client device can send an initial request message to the first routing node. The source address (SA) in the initial request message is C0, and the destination address (DA) is S1.
还可以理解的是,该初始请求报文可以是客户端设备针对该目标算力服务发送的首个请求报文(即首包),该请求报文可以请求与提供目标算力服务的算力服务实例创建会话。客户端设备与提供目标算力服务的某个算力服务实例创建会话后,即可通过该会话获取目标算力服务。示例的,该请求报文可以是传输控制协议(transmission controlprotocol,TCP)连接的请求报文,或者可以是快速UDP互联网连接协议(quick UDPInternet connections,QUIC)连接的请求报文。当然,该请求报文还可以是其他用于在网络层以上(例如应用层和传输层等)建立会话的首包,本申请实施例对此不做限定。It can also be understood that the initial request message may be the first request message (i.e., the first packet) sent by the client device for the target computing power service, and the request message may request and provide computing power for the target computing power service. The service instance creates a session. After the client device creates a session with a computing power service instance that provides the target computing power service, it can obtain the target computing power service through the session. For example, the request message may be a request message for a transmission control protocol (transmission control protocol, TCP) connection, or may be a request message for a quick UDP Internet connection protocol (quick UDP Internet connections, QUIC) connection. Of course, the request message may also be other first packets used to establish a session above the network layer (such as the application layer and transport layer, etc.), which is not limited in the embodiments of the present application.
还可以理解的是,在上述步骤302之前,算力服务实例所对应的实例节点可以通过路由协议向第一路由节点发布目标算力服务的标识的路由。第一路由节点进而可以通过路由协议向客户端设备及其所在的网络发布该目标算力服务的标识的路由。由此,客户端设备即可基于学习的路由,向第一路由节点发送初始请求报文。其中,该路由协议可以是BGP或内部网关协议(interior gateway protocol,IGP)等。It can also be understood that before the above step 302, the instance node corresponding to the computing power service instance may publish the route identified by the target computing power service to the first routing node through the routing protocol. The first routing node can then publish the route identified by the target computing power service to the client device and the network where it is located through the routing protocol. Therefore, the client device can send the initial request message to the first routing node based on the learned route. The routing protocol may be BGP or interior gateway protocol (interior gateway protocol, IGP), etc.
步骤303、第一路由节点在该初始请求报文中封装段列表,得到算力请求报文。Step 303: The first routing node encapsulates the segment list in the initial request message to obtain a computing power request message.
在本申请实施例中,第一路由节点接收到客户端设备发送的初始请求报文后,可以读取该初始请求报文的目的地址中的目标算力服务的标识,并确定与该目标算力服务的标识对应的多个段标识。之后,第一路由节点可以在该初始请求报文中封装SRH,得到算力请求报文。其中,该SRH的段列表中包括与该目标算力服务的标识对应的多个段标识。基于此,算力请求报文即可通过其携带的多个段标识,指示出用于提供目标算力服务的多个算力服务实例。In this embodiment of the present application, after receiving the initial request message sent by the client device, the first routing node can read the identification of the target computing service in the destination address of the initial request message, and determine the identity of the target computing service. Multiple segment identifiers corresponding to the identifier of the force service. Afterwards, the first routing node can encapsulate SRH in the initial request message to obtain the computing power request message. The segment list of the SRH includes multiple segment identifiers corresponding to the identifier of the target computing power service. Based on this, the computing power request message can indicate multiple computing power service instances used to provide the target computing power service through the multiple segment identifiers it carries.
示例的,参考图9,第一路由节点可以在初始请求报文中封装SRH,该SRH的段列表中包括与目标算力服务的标识S1对应的3个段标识SID1、SID2和SID3。For example, referring to Figure 9, the first routing node may encapsulate the SRH in the initial request message. The segment list of the SRH includes three segment identifiers SID1, SID2 and SID3 corresponding to the identifier S1 of the target computing service.
可选地,第一路由节点还可以获取其与该多个算力服务实例所对应的实例节点之间的网络性能参数,并可以将获取到的网络性能参数也封装至算力请求报文。例如,参考图9,第一路由节点可以将该网络性能参数封装至SRH的扩展TLV中。由此,可以便于接收到算力请求报文的实例节点基于该网络性能参数,确定是否由其对应的算力服务实例向客户端设备提供目标算力服务。Optionally, the first routing node may also obtain network performance parameters between it and the instance nodes corresponding to the multiple computing power service instances, and may also encapsulate the obtained network performance parameters into the computing power request message. For example, referring to FIG. 9, the first routing node may encapsulate the network performance parameter into the extended TLV of the SRH. This makes it easy for the instance node that receives the computing power request message to determine whether its corresponding computing power service instance provides the target computing power service to the client device based on the network performance parameters.
示例的,该第一路由节点可以通过双向主动测量协议(two-way activemeasurement protocol,TWAMP)或随流检测等方式,检测得到其与各个实例节点之间的网络性能参数。其中,随流检测可以是指带内流信息测量(in-situ flow informationtelemetry,IFIT)。该网络性能参数也可以称为服务级别协议(service level agreement,SLA)消息,该网络性能参数可以包括下述参数中的至少一种:时延、丢包率和抖动等。For example, the first routing node can detect and obtain network performance parameters between it and each instance node through two-way active measurement protocol (TWAMP) or flow detection. The flow-following detection may refer to in-situ flow information telemetry (IFIT). The network performance parameter may also be called a service level agreement (service level agreement, SLA) message, and the network performance parameter may include at least one of the following parameters: delay, packet loss rate, jitter, etc.
作为第一种可能的示例,该多个段标识在段列表中的排列顺序可以是第一路由节点随机确定的。也即是,第一路由节点确定出多个段标识后,可以将该多个段标识随机封装至算力请求报文的段列表中。As a first possible example, the order in which the multiple segment identifiers are arranged in the segment list may be randomly determined by the first routing node. That is, after the first routing node determines multiple segment identifiers, it can randomly encapsulate the multiple segment identifiers into the segment list of the computing power request message.
作为第二种可能的示例,该第一路由节点在获取到其与各个实例节点之间的网络性能参数后,可以在段列表中按照网络性能由高到低的顺序依次排列各个段标识。也即是,若第一路由节点与某个段标识指示的算力服务实例所对应的实例节点之间的网络性能越好,则该段标识在段列表中的排列位置就越靠前。相应的,该段标识对应的实例节点即可越早接收到该算力请求报文。As a second possible example, after obtaining the network performance parameters between the first routing node and each instance node, the first routing node may arrange the segment identifiers in the segment list in order from high to low network performance. That is, if the network performance between the first routing node and the instance node corresponding to the computing service instance indicated by a certain segment identifier is better, the segment identifier will be ranked higher in the segment list. Correspondingly, the instance node corresponding to the segment identifier can receive the computing power request message sooner.
在该第二种示例中,由于与第一路由节点之间的网络性能越好的实例节点能够越早接收到该算力请求报文,因此可以优先由网络性能较好的实例节点检测是否由其对应的算力服务实例提供目标算力服务。由于实例节点在检测是否提供目标算力服务时,还会参考其与第一路由节点之间的网络性能,因此可以确保各个实例节点能够尽快确定出用于提供目标算力服务的算力服务实例。进而,可以有效提高向客户端设备提供目标算力服务的效率。In this second example, since the instance node with better network performance between the first routing node and the first routing node can receive the computing power request message earlier, the instance node with better network performance can be prioritized to detect whether the request message is sent by Its corresponding computing power service instance provides the target computing power service. Since the instance node will also refer to the network performance between it and the first routing node when detecting whether to provide the target computing power service, it can be ensured that each instance node can determine the computing power service instance used to provide the target computing power service as soon as possible. . Furthermore, the efficiency of providing target computing power services to client devices can be effectively improved.
可选地,对于第一路由节点能够获取到网络性能参数的场景,该第一路由节点还可以基于该网络性能参数,将满足封装条件的多个段标识封装至算力请求报文中。其中,该封装条件可以包括:第一路由节点与段标识指示的算力服务实例所对应的实例节点之间的网络性能满足通信条件。也即是,第一路由节点可以将网络性能较好的实例节点所对应的段标识封装至算力请求报文,而网络性能较差的实例节点所对应的段标识则无需封装。Optionally, for a scenario where the first routing node can obtain the network performance parameters, the first routing node can also encapsulate multiple segment identifiers that meet the encapsulation conditions into the computing power request message based on the network performance parameters. The encapsulation condition may include: the network performance between the first routing node and the instance node corresponding to the computing power service instance indicated by the segment identifier meets the communication condition. That is to say, the first routing node can encapsulate the segment identifier corresponding to the instance node with better network performance into the computing power request message, while the segment identifier corresponding to the instance node with poor network performance does not need to be encapsulated.
由于第一路由节点能够提前筛选掉网络性能较差的实例节点,因此可以确保仅由网络性能较好的实例节点确定是否由其对应的算力服务实例提供目标算力服务。进而,不仅可以确保确定出满足服务条件的算力服务实例的效率,而且可以确保为客户端设备提供目标算力服务时的可靠性。Since the first routing node can filter out instance nodes with poor network performance in advance, it can ensure that only instance nodes with better network performance determine whether their corresponding computing power service instance provides the target computing power service. Furthermore, not only the efficiency of determining the computing power service instance that meets the service conditions can be ensured, but also the reliability of providing the target computing power service to the client device can be ensured.
例如,假设第一路由节点基于其他路由节点的通告,获取到了与目标算力服务的标识对应的多个备选段标识。则第一路由节点可以检测其与每个备选段标识对应的实例节点之间的网络性能,并可以基于该网络性能从多个备选段标识中筛选出满足封装条件的多个段标识。For example, assume that the first routing node obtains multiple candidate segment identifiers corresponding to the identifiers of the target computing power services based on notifications from other routing nodes. Then the first routing node can detect the network performance between it and the instance node corresponding to each candidate segment identifier, and can filter out multiple segment identifiers that meet the encapsulation conditions from the multiple candidate segment identifiers based on the network performance.
其中,该通信条件可以包括与网络性能参数中的各个参数一一对应的参数值范围。例如,若网络性能参数包括时延,则该通信条件可以包括时延范围,且该时延范围可以为小于某个时延阈值的范围。若网络性能参数包括丢包率,则该通信条件可以包括丢包率范围,且该丢包率范围可以为小于某个丢包率阈值的范围。若网络性能参数包括抖动,则该通信条件可以包括抖动范围,且该抖动范围可以为小于某个抖动阈值的范围。The communication condition may include a parameter value range that corresponds one-to-one to each parameter in the network performance parameters. For example, if the network performance parameter includes delay, the communication condition may include a delay range, and the delay range may be a range smaller than a certain delay threshold. If the network performance parameter includes a packet loss rate, the communication condition may include a packet loss rate range, and the packet loss rate range may be a range smaller than a certain packet loss rate threshold. If the network performance parameter includes jitter, the communication condition may include a jitter range, and the jitter range may be a range smaller than a certain jitter threshold.
步骤304、第一路由节点根据该段列表转发算力请求报文。Step 304: The first routing node forwards the computing power request message according to the segment list.
第一路由节点生成算力请求报文后,即可根据该段列表转发算力请求报文。例如,该第一路由节点可以将段列表中的首个段标识拷贝至算力请求报文的目的地址字段,并将该算力请求报文转发至该首个段标识对应的实例节点。也即是,该第一路由节点可以基于SRv6报文的转发机制转发该算力请求报文。After the first routing node generates the computing power request message, it can forward the computing power request message according to the segment list. For example, the first routing node may copy the first segment identifier in the segment list to the destination address field of the computing power request message, and forward the computing power request message to the instance node corresponding to the first segment identifier. That is, the first routing node can forward the computing power request message based on the forwarding mechanism of the SRv6 message.
示例的,如图9所示,第一路由节点可以将算力请求报文的目的地址字段的值更新为首个段标识:SID1,并将该算力请求报文转发至SID1对应的实例节点,即路由节点1。For example, as shown in Figure 9, the first routing node can update the value of the destination address field of the computing power request message to the first segment identifier: SID1, and forward the computing power request message to the instance node corresponding to SID1, That is routing node 1.
步骤305、第二路由节点在确定第一段标识与第一算力服务实例的段标识匹配后,检测该第一算力服务实例是否满足服务条件。Step 305: After determining that the first segment identifier matches the segment identifier of the first computing power service instance, the second routing node detects whether the first computing power service instance satisfies the service condition.
该第二路由节点可以是段列表中任一段标识指示的算力服务实例所对应的实例节点。也即是,该第二路由节点可以是段列表指示的转发路径中接收到算力请求报文的任一实例节点。并且,该第二路由节点与用于提供目标算力服务的第一算力服务实例对应,即第二路由节点与第一算力服务实例所属的服务节点直连。The second routing node may be an instance node corresponding to the computing power service instance indicated by any segment identifier in the segment list. That is, the second routing node may be any instance node that receives the computing power request message in the forwarding path indicated by the segment list. Furthermore, the second routing node corresponds to the first computing power service instance used to provide the target computing power service, that is, the second routing node is directly connected to the service node to which the first computing power service instance belongs.
在本申请实施例中,第二路由节点接收到的算力请求报文中的目的地址字段可以包括段列表中的第一段标识。第二路由节点可以先检测该第一段标识是否与第一算力服务实例的段标识匹配。若两者不匹配,则第二路由节点可以继续根据该段列表转发算力请求报文,而无需检测该第一算力服务实例是否满足服务条件。若两者匹配,则第二路由节点可以继续检测该第一算力服务实例是否满足服务条件。若第一算力服务实例不满足服务条件,则第二路由节点可以执行步骤306;若第一算力服务实例满足服务条件,则第二路由节点可以执行步骤308。In this embodiment of the present application, the destination address field in the computing power request message received by the second routing node may include the first segment identifier in the segment list. The second routing node may first detect whether the first segment identifier matches the segment identifier of the first computing power service instance. If the two do not match, the second routing node can continue to forward the computing power request message according to the segment list without checking whether the first computing power service instance meets the service conditions. If the two match, the second routing node can continue to detect whether the first computing power service instance meets the service conditions. If the first computing power service instance does not meet the service conditions, the second routing node can perform step 306; if the first computing power service instance meets the service conditions, the second routing node can perform step 308.
可选地,该服务条件可以包括:第一算力服务实例的算力负载小于第二负载阈值。该算力负载可以采用下述参数中的至少一种表征:处理器使用率,内存使用率,在线连接数,网络时延和融合开销等。其中,该处理器使用率可以包括CPU使用率,和/或,GPU使用率。在线连接数也可以称为会话数,其是指算力服务实例与各个客户端设备之间所建立的会话的总数。网络时延是指算力服务实例与第一路由节点之间的时延。融合开销是指基于算力负载中的其他参数(例如处理器使用率,内存使用率,在线连接数,和/或网络时延)计算得到的综合度量值,且该综合度量值可以是服务节点或第二路由节点计算得到的。可选的,上述算力服务实例可以指算力服务实例对应的服务节点。Optionally, the service condition may include: the computing power load of the first computing power service instance is less than the second load threshold. The computing power load can be characterized by at least one of the following parameters: processor usage, memory usage, number of online connections, network delay and integration overhead, etc. The processor usage may include CPU usage and/or GPU usage. The number of online connections can also be called the number of sessions, which refers to the total number of sessions established between the computing service instance and each client device. Network delay refers to the delay between the computing service instance and the first routing node. Fusion overhead refers to a comprehensive metric value calculated based on other parameters in the computing load (such as processor usage, memory usage, number of online connections, and/or network delay), and the comprehensive metric value can be a service node Or calculated by the second routing node. Optionally, the above computing power service instance may refer to the service node corresponding to the computing power service instance.
该第二负载阈值可以包括与算力负载的各个参数一一对应的多个阈值。例如,该第二负载阈值可以包括下述阈值中的至少一种:处理器使用率阈值,内存使用率阈值,在线连接数阈值,网络时延阈值和融合开销阈值等。相应的,算力服务实例的算力负载小于第二负载阈值可以是指:算力负载中的每个参数均小于其对应的阈值,或者,算力负载中小于对应阈值的参数的数量大于数量阈值。基于上述服务条件,可以确保为客户端设备提供目标算力服务的算力服务实例的算力负载较小,进而可以确保提供目标算力服务时的可靠性。The second load threshold may include multiple thresholds corresponding to each parameter of the computing power load. For example, the second load threshold may include at least one of the following thresholds: processor usage threshold, memory usage threshold, online connection threshold, network delay threshold, convergence overhead threshold, etc. Correspondingly, the computing power load of the computing power service instance is less than the second load threshold, which may mean: each parameter in the computing power load is less than its corresponding threshold, or the number of parameters in the computing power load that is less than the corresponding threshold is greater than the number. threshold. Based on the above service conditions, it can be ensured that the computing power load of the computing power service instance that provides the target computing power service to the client device is small, thereby ensuring the reliability of providing the target computing power service.
可选地,该服务条件还可以包括:该第一路由节点与第一节点(即第二路由节点)之间的网络性能满足第二通信条件。该第二通信条件可以包括与网络性能参数中的各个参数一一对应的参数值范围。例如,该第二通信条件可以包括时延范围,丢包率范围,和/或抖动范围。并且,该第二通信条件与上述步骤303中所述的通信条件可以相同,也可以不同。Optionally, the service condition may also include: network performance between the first routing node and the first node (ie, the second routing node) meets the second communication condition. The second communication condition may include a parameter value range corresponding to each parameter in the network performance parameters. For example, the second communication condition may include a delay range, a packet loss rate range, and/or a jitter range. Moreover, the second communication condition may be the same as the communication condition described in step 303, or may be different.
可以理解的是,第二路由节点可以基于算力请求报文中封装的网络性能参数,确定其与第一路由节点之间的网络性能。或者,若该算力请求报文中未封装网络性能参数,则第二路由节点也可以直接检测其与第一路由节点之间的网络性能。例如,第二路由节点可以通过TWAMP或随流检测等方式进行检测。It can be understood that the second routing node can determine the network performance between it and the first routing node based on the network performance parameters encapsulated in the computing power request message. Alternatively, if the network performance parameter is not encapsulated in the computing power request message, the second routing node can also directly detect the network performance between it and the first routing node. For example, the second routing node can be detected through TWAMP or flow detection.
步骤306、第二路由节点检测第一段标识是否为段列表中的最后一个段标识。Step 306: The second routing node detects whether the first segment identifier is the last segment identifier in the segment list.
在本申请实施例中,第二路由节点若检测到第一算力服务实例不满足服务条件,则可以继续检测第一段标识是否为段列表中的最后一个段标识。若第一段标识不为段列表中的最后一个段标识,则第二路由节点可以确定能够继续根据该段列表向后续的段标识所对应的实例节点转发算力请求报文,因此可以执行步骤307。若第一段标识为段列表中的最后一个段标识,则第二路由节点可以确定其为段列表指示的转发路径中的尾节点,因此可以执行步骤308。In this embodiment of the present application, if the second routing node detects that the first computing power service instance does not meet the service conditions, it can continue to detect whether the first segment identifier is the last segment identifier in the segment list. If the first segment identifier is not the last segment identifier in the segment list, the second routing node can determine that it can continue to forward the computing power request message to the instance node corresponding to the subsequent segment identifier according to the segment list, and therefore can perform the steps 307. If the first segment identifier is the last segment identifier in the segment list, the second routing node can determine that it is the tail node in the forwarding path indicated by the segment list, and therefore step 308 can be performed.
步骤307、第二路由节点确定不由第一算力服务实例提供目标算力服务,并根据段列表转发算力请求报文。Step 307: The second routing node determines that the target computing power service is not provided by the first computing power service instance, and forwards the computing power request message according to the segment list.
在本申请实施例中,若第一算力服务实例不满足服务条件,且第一段标识不为段列表中的最后一个段标识,则第二路由节点可以确定不由该第一算力服务实例提供目标算力服务,并可以根据该段列表继续转发算力请求报文。例如,第二路由节点可以将该算力请求报文的目的地址字段的值更新为段列表中的第二段标识,并将该算力请求报文转发至第二节点。In the embodiment of this application, if the first computing power service instance does not meet the service conditions and the first segment identifier is not the last segment identifier in the segment list, the second routing node can determine that the first computing power service instance is not used. Provides target computing power services and can continue to forward computing power request messages based on the segment list. For example, the second routing node may update the value of the destination address field of the computing power request message to the second segment identifier in the segment list, and forward the computing power request message to the second node.
其中,该第二节点是与第二段标识指示的第二算力服务实例对应的实例节点,且该第二段标识为段列表中与第一段标识相邻的下一个段标识。也即是,第二路由节点可以基于SRv6报文的转发机制转发该算力请求报文。由此,可以便于第二节点继续确定第二算力服务实例是否满足服务条件,进而确定是否由该第二算力服务实例提供目标算力服务。Wherein, the second node is an instance node corresponding to the second computing power service instance indicated by the second segment identifier, and the second segment identifier is the next segment identifier adjacent to the first segment identifier in the segment list. That is, the second routing node can forward the computing power request message based on the forwarding mechanism of the SRv6 message. This can facilitate the second node to continue to determine whether the second computing power service instance satisfies the service conditions, and further determine whether the second computing power service instance provides the target computing power service.
示例的,如图9所示,假设路由节点1确定SID1指示的第一算力服务实例不满足服务条件,则由于段列表中与SID1相邻的下一个段标识为SID2,因此路由节点1可以通过SR模块将算力请求报文的目的地址字段的值更新为SID2。之后,路由节点1即可通过SR模块将该算力请求报文转发至SID2对应的第二节点,即路由节点2。其中,SR模块可以包括支持SRv6转发的软件模块和硬件模块。For example, as shown in Figure 9, assuming that routing node 1 determines that the first computing power service instance indicated by SID1 does not meet the service conditions, then since the next segment adjacent to SID1 in the segment list is identified as SID2, routing node 1 can Update the value of the destination address field of the computing power request message to SID2 through the SR module. Afterwards, routing node 1 can forward the computing power request message to the second node corresponding to SID2, that is, routing node 2, through the SR module. Among them, the SR module may include software modules and hardware modules that support SRv6 forwarding.
若路由节点2确定SID2指示的第二算力服务实例也不满足服务条件,则由于段列表中与SID2相邻的下一个段标识为SID3,因此路由节点2可以通过SR模块将算力请求报文的目的地址字段的值更新为SID3。之后,路由节点2即可通过SR模块将该算力请求报文转发至SID3对应的实例节点,即路由节点3。If routing node 2 determines that the second computing power service instance indicated by SID2 also does not meet the service conditions, since the next segment adjacent to SID2 in the segment list is identified as SID3, routing node 2 can report the computing power request through the SR module. The value of the destination address field of the message is updated to SID3. After that, routing node 2 can forward the computing power request message to the instance node corresponding to SID3, that is, routing node 3, through the SR module.
可选地,第二路由节点在根据该段列表转发算力请求报文之前,还可以在该算力请求报文中封装第一算力服务实例的算力负载。例如,第二路由节点可以在算力请求报文的SRH的扩展TLV中封装该算力负载,或者可以在算力请求报文中的其他扩展头中封装该算力负载。并且,为了避免增加算力请求报文的数据量,第二路由节点可以封装算力负载中的融合开销。Optionally, before forwarding the computing power request message according to the segment list, the second routing node may also encapsulate the computing power load of the first computing power service instance in the computing power request message. For example, the second routing node may encapsulate the computing power load in the extended TLV of the SRH of the computing power request message, or may encapsulate the computing power load in other extension headers in the computing power request message. Moreover, in order to avoid increasing the data volume of the computing power request message, the second routing node can encapsulate the integration overhead in the computing power load.
通过在算力请求报文中封装算力负载,可以便于最后一个段标识对应的实例节点确定各个算力服务实例的算力负载,并基于各个算力服务实例的算力负载选择用于向客户端设备提供目标算力服务的算力服务实例。By encapsulating the computing power load in the computing power request message, it is convenient for the instance node corresponding to the last segment identifier to determine the computing power load of each computing power service instance, and select the computing power load based on the computing power load of each computing power service instance to provide the client with the computing power load. The terminal device provides the computing power service instance of the target computing power service.
步骤308、第二路由节点确定由第一算力服务实例提供目标算力服务,并向客户端设备发送算力响应报文。Step 308: The second routing node determines that the first computing power service instance provides the target computing power service, and sends a computing power response message to the client device.
在本申请实施例中,第二路由节点若检测到第一算力服务实例满足服务条件,则可以确定由该第一算力服务实例提供目标算力服务,并可以向客户端设备发送算力响应报文。或者,第二路由节点若检测到第一算力服务实例不满足服务条件,且该第一段标识为段列表中的最后一个段标识,则也可以确定由该第一算力服务实例提供目标算力服务,并可以向客户端设备发送算力响应报文。In this embodiment of the present application, if the second routing node detects that the first computing power service instance satisfies the service conditions, it can determine that the first computing power service instance provides the target computing power service, and can send the computing power to the client device. response message. Alternatively, if the second routing node detects that the first computing power service instance does not meet the service conditions and the first segment identifier is the last segment identifier in the segment list, it may also determine that the first computing power service instance provides the target Computing power service, and can send computing power response messages to client devices.
可选地,该算力响应报文中可以包括第一段标识。例如,该算力响应报文的源地址字段或扩展报文头中包括该第一段标识。其中,该扩展报文头可以是IPv6扩展头,例如可以是目的选项扩展头(destination option header,DOH),或者可以是路由扩展头(如SRH)。该DOH的扩展选项(option)中可以携带第一段标识。Optionally, the computing power response message may include the first segment identification. For example, the source address field or extended header of the computing power response message includes the first segment identifier. The extension header may be an IPv6 extension header, such as a destination option header (DOH), or a routing extension header (such as SRH). The extended option of the DOH can carry the first segment identifier.
示例的,参考图9,若第二路由节点为路由节点3,且路由节点3确定SID3指示的第一算力服务实例满足服务条件,则可以确定由该SID3指示的第一算力服务实例提供目标算力服务。并且,路由节点3可以向客户端设备发送算力响应报文。该算力响应报文的目的地址字段的值可以为客户端设备的IP地址C0,源地址字段的值可以为第一段标识SID3。For example, referring to Figure 9, if the second routing node is routing node 3, and routing node 3 determines that the first computing power service instance indicated by SID3 meets the service conditions, it can be determined that the first computing power service instance indicated by SID3 provides Target computing service. Moreover, routing node 3 can send a computing power response message to the client device. The value of the destination address field of the computing power response message may be the IP address C0 of the client device, and the value of the source address field may be the first segment identifier SID3.
可以理解的是,上述步骤306也可以在步骤305之前执行,即第二路由节点接收到算力请求报文,并确定第一段标识与第一算力服务实例的段标识匹配后,可以先检测第一段标识是否为段列表中的最后一个段标识。若第一段标识不为段列表中的最后一个段标识,则第二路由节点可以继续检测第一算力服务实例是否满足服务条件。若第一段标识为段列表中的最后一个段标识,则第二路由节点可以直接执行步骤308,而无需再检测第一算力服务实例是否满足服务条件。It can be understood that the above step 306 can also be executed before step 305, that is, after the second routing node receives the computing power request message and determines that the first segment identifier matches the segment identifier of the first computing power service instance, it can first Checks whether the first segment ID is the last segment ID in the segment list. If the first segment identifier is not the last segment identifier in the segment list, the second routing node can continue to detect whether the first computing power service instance meets the service conditions. If the first segment identifier is the last segment identifier in the segment list, the second routing node can directly perform step 308 without checking whether the first computing power service instance meets the service conditions.
步骤309、客户端设备向该第二路由节点发送算力任务报文,该算力任务报文的目的地址字段包括第一段标识。Step 309: The client device sends a computing power task message to the second routing node, and the destination address field of the computing power task message includes the first segment identifier.
客户端设备接收到第二路由节点发送的算力响应报文后,可以确定能够向该第一算力服务实例获取目标算力服务。因此,客户端设备可以向第二路由节点发送算力任务报文,该算力任务报文的目的地址字段包括第一段标识。第二路由节点可以将该算力任务报文转发至第一算力服务实例所属的服务节点,该第一算力服务实例进而可以为客户端设备提供目标算力服务。After receiving the computing power response message sent by the second routing node, the client device may determine that the target computing power service can be obtained from the first computing power service instance. Therefore, the client device may send the computing power task message to the second routing node, and the destination address field of the computing power task message includes the first segment identification. The second routing node can forward the computing power task message to the service node to which the first computing power service instance belongs, and the first computing power service instance can then provide the target computing power service for the client device.
可以理解的是,若第二路由节点在上述步骤305或步骤306之后执行了步骤307,则客户端设备可以接收到其他算力服务实例对应的实例节点(例如路由节点)发送的算力响应报文。相应的,在该步骤309中,客户端设备可以向其他算力服务实例对应的实例节点发送算力任务报文。It can be understood that if the second routing node performs step 307 after the above-mentioned step 305 or step 306, the client device may receive the computing power response message sent by the instance node (such as the routing node) corresponding to the other computing power service instance. arts. Correspondingly, in step 309, the client device may send a computing power task message to instance nodes corresponding to other computing power service instances.
还可以理解的是,客户端设备接收到算力响应报文后,可以记录目标算力服务的标识与该算力响应报文中携带的目标段标识(例如第一段标识)的对应关系。也即是,客户端设备可以学习该目标段标识。之后,客户端设备若需要获取目标算力服务,则可以直接基于该对应关系,向目标段标识指示的目标算力服务实例发送算力任务报文。It can also be understood that after receiving the computing power response message, the client device can record the correspondence between the identifier of the target computing power service and the target segment identifier (eg, the first segment identifier) carried in the computing power response message. That is, the client device can learn the target segment identification. Afterwards, if the client device needs to obtain the target computing power service, it can directly send the computing power task message to the target computing power service instance indicated by the target segment identifier based on the corresponding relationship.
可选地,若目标算力服务的标识为其所对应的多个段标识的公共前缀,则客户端设备在接收到算力响应报文后,还可以先检测该目标段标识的前缀是否与目标算力服务的标识匹配。若两者匹配,则客户端设备可以记录该目标算力服务的标识与该目标段标识的对应关系。若两者不匹配,则客户端设备无需学习该目标段标识。由此,可以实现对目标段标识的校验,以确保客户端设备记录的对应关系的可靠性。Optionally, if the identifier of the target computing power service is the common prefix of multiple segment identifiers corresponding to it, then after receiving the computing power response message, the client device can also first detect whether the prefix of the target segment identifier matches the Matches the identifier of the target computing service. If the two match, the client device can record the correspondence between the identifier of the target computing service and the identifier of the target segment. If the two do not match, the client device does not need to learn the target segment ID. Thus, the target segment identifier can be verified to ensure the reliability of the corresponding relationship recorded by the client device.
还可以理解的是,对于第一段标识为段列表中的最后一个段标识的场景,上述步骤306和步骤308是以第二路由节点直接向客户端设备发送算力响应报文为例进行的说明。作为一种可能的实现方式,该第二路由节点检测到第一段标识为段列表中的最后一个段标识后,还可以将算力请求报文的目的地址字段的值更新为段列表中的第三段标识,得到服务指示。之后,第二路由节点可以将该服务指示转发至第三节点。It can also be understood that, for the scenario where the first segment identifier is the last segment identifier in the segment list, the above steps 306 and 308 are performed using the second routing node to directly send a computing power response message to the client device as an example. illustrate. As a possible implementation, after the second routing node detects that the first segment identifier is the last segment identifier in the segment list, it may also update the value of the destination address field of the computing power request message to the value in the segment list. The third segment identifies the service instructions. The second routing node may then forward the service indication to the third node.
其中,该第三节点对应第三段标识所指示的第三算力服务实例,且该服务指示用于指示该第三算力服务实例向客户端设备提供目标算力服务。也即是,第二路由节点若检测第一段标识为段列表中的最后一个段标识,则还可以通过该服务指示,强制指示第三算力服务实例提供目标算力服务。其中,该服务指示可以包括SRH,且该SRH中可以包括强制服务的标志位,例如可以在标记字段携带该标志位。或者,该服务指示可以是将算力请求报文中的SRH弹出后得到的。The third node corresponds to the third computing power service instance indicated by the third segment identifier, and the service indication is used to instruct the third computing power service instance to provide the target computing power service to the client device. That is, if the second routing node detects that the first segment identifier is the last segment identifier in the segment list, it can also forcefully instruct the third computing power service instance to provide the target computing power service through the service indication. The service indication may include an SRH, and the SRH may include a flag bit for mandatory service. For example, the flag bit may be carried in a flag field. Alternatively, the service indication can be obtained by popping the SRH in the computing power request message.
作为第一种可能的示例,该第三算力服务实例可以是从该多个段标识指示的多个算力服务实例中随机选取的。例如,第二路由节点可以从段列表中随机选取一个段标识作为第三段标识,该第三段标识指示的算力服务实例即为第三算力服务实例。As a first possible example, the third computing power service instance may be randomly selected from multiple computing power service instances indicated by the multiple segment identifiers. For example, the second routing node may randomly select a segment identifier from the segment list as the third segment identifier, and the computing power service instance indicated by the third segment identifier is the third computing power service instance.
作为第二种可能的示例,该第三算力服务实例可以满足下述条件中的至少一种:第三算力服务实例的算力负载小于第一负载阈值;第三算力服务实例对应的第三节点与第一路由节点之间的网络性能满足第一通信条件。As a second possible example, the third computing power service instance may satisfy at least one of the following conditions: the computing power load of the third computing power service instance is less than the first load threshold; the computing power load corresponding to the third computing power service instance Network performance between the third node and the first routing node meets the first communication condition.
在该第二种示例中,第二路由节点接收到算力请求报文中可以包括:第一段标识之前的各个段标识指示的算力服务实例的算力负载,和/或,段列表中各个段标识指示的算力服务实例所对应的实例节点与第一路由节点之间的网络性能参数。第二路由节点可以基于多个算力服务实例的算力负载,和/或,多个实例节点与第一路由节点之间的网络性能,确定出满足上述条件的第三算力服务实例。In this second example, the computing power request message received by the second routing node may include: the computing power load of the computing power service instance indicated by each segment identifier before the first segment identifier, and/or, in the segment list Network performance parameters between the instance node corresponding to the computing power service instance indicated by each segment identifier and the first routing node. The second routing node may determine the third computing power service instance that satisfies the above conditions based on the computing power load of the multiple computing power service instances and/or the network performance between the multiple instance nodes and the first routing node.
上述条件中的第一通信条件可以包括与网络性能参数中的各个参数一一对应的参数值范围。例如,该第一通信条件可以包括时延范围,丢包率范围,和/或抖动范围等。The first communication condition among the above conditions may include a parameter value range corresponding to each parameter in the network performance parameters. For example, the first communication condition may include a delay range, a packet loss rate range, and/or a jitter range, etc.
上述条件中的第一负载阈值可以是第二路由节点中预先配置的负载值。或者,该第一负载阈值可以是基于多个段标识指示的算力服务实例的算力负载确定的。例如,该第一负载阈值可以是该多个算力服务实例的算力负载的均值、中值或下四分位数等。或者,该第一负载阈值可以是该多个算力服务实例的算力负载的次小值,相应的,该第三算力服务实例所属的服务节点即为多个算力服务实例中算力负载最小的服务节点。The first load threshold in the above condition may be a preconfigured load value in the second routing node. Alternatively, the first load threshold may be determined based on the computing power load of the computing power service instance indicated by multiple segment identifiers. For example, the first load threshold may be the mean, median or lower quartile of the computing power loads of the multiple computing power service instances. Alternatively, the first load threshold may be the second smallest value of the computing power load of the multiple computing power service instances. Correspondingly, the service node to which the third computing power service instance belongs is the computing power of the multiple computing power service instances. The service node with the smallest load.
可以理解的是,若第二路由节点确定出算力负载小于第一负载阈值的多个备选的算力服务实例,则可以从该多个备选的算力服务实例中确定出第三算力服务实例。例如,第二路由节点可以从多个备选的算力服务实例中随机选取一个算力服务实例作为第三算力服务实例,或者第二路由节点可以选取与第一路由节点之间的网络性能最优的算力服务实例作为第三算力服务实例。It can be understood that if the second routing node determines multiple alternative computing power service instances whose computing power load is less than the first load threshold, the third computing power service instance can be determined from the multiple alternative computing power service instances. force service instance. For example, the second routing node can randomly select a computing power service instance from multiple alternative computing power service instances as the third computing power service instance, or the second routing node can select a computing power service instance that is in compliance with the network performance of the first routing node. The optimal computing power service instance is used as the third computing power service instance.
上述第二种示例由于能够基于各个算力服务实例的算力负载,和/或,各个实例节点与第一路由节点之间的网络性能,选取出满足条件的第三算力服务实例,因此可以确保该第三算力服务实例能够为客户端设备提供较为可靠的目标算力服务。The above second example can select a third computing power service instance that meets the conditions based on the computing power load of each computing power service instance and/or the network performance between each instance node and the first routing node. Ensure that the third computing power service instance can provide a relatively reliable target computing power service for the client device.
综上所述,本申请实施例提供了一种算力请求方法,第一算力服务实例对应的第一节点接收到算力请求报文后,能够基于该第一算力服务实例是否满足服务条件,确定是否由该第一算力服务实例向客户端设备提供目标算力服务。由于第一节点能够直接判断其对应的第一算力服务实例是否满足服务条件,并确定是否提供目标算力服务,因此无需再由算力网络中的路由节点集中感知大量算力服务实例的算力负载,并选择匹配的算力服务实例。也即是,本申请实施例提供的方法能够由各个实例节点通过分布式决策的方式确定用于提供目标算力服务的算力服务实例,从而有效减轻了由路由节点集中式决策时的信息发布的压力。进而,有效降低了对路由节点的性能要求,提高了提供算力服务时的效率。To sum up, the embodiments of this application provide a computing power request method. After the first node corresponding to the first computing power service instance receives the computing power request message, it can based on whether the first computing power service instance satisfies the service requirements. condition to determine whether the first computing power service instance provides the target computing power service to the client device. Since the first node can directly determine whether its corresponding first computing power service instance meets the service conditions and determine whether to provide the target computing power service, there is no need for routing nodes in the computing power network to centrally sense the computing power of a large number of computing power service instances. load, and select a matching computing service instance. That is to say, the method provided by the embodiments of this application can allow each instance node to determine the computing power service instance used to provide the target computing power service through distributed decision-making, thereby effectively reducing the information release during centralized decision-making by routing nodes. pressure. Furthermore, it effectively reduces the performance requirements for routing nodes and improves the efficiency of providing computing power services.
图10是本申请实施例提供的再一种算力请求的流程图,该方法以请求节点为客户端设备,并以第一节点为第一算力服务实例所属的第一服务节点为例进行说明。参考图10,该方法可以包括:Figure 10 is a flow chart of yet another computing power request provided by an embodiment of the present application. This method takes the requesting node as a client device and the first node as the first service node to which the first computing power service instance belongs. illustrate. Referring to Figure 10, the method may include:
步骤401、客户端设备向DNS服务器发送针对目标算力服务的域名请求。Step 401: The client device sends a domain name request for the target computing service to the DNS server.
如图11所示,该域名请求可以是目标算力服务的统一资源定位系统(uniformresource locator,URL)。As shown in Figure 11, the domain name request may be the uniform resource locator (URL) of the target computing service.
步骤402、DNS服务器向客户端设备发送与目标算力服务对应的多个段标识。Step 402: The DNS server sends multiple segment identifiers corresponding to the target computing power service to the client device.
在本申请实施例中,DNS服务器接收到客户端设备发送的域名请求后,可以基于该域名请求,向客户端设备发送与目标算力服务对应的多个段标识。可选地,DNS服务器还可以向客户端设备发送该目标算力服务的标识。In this embodiment of the present application, after receiving the domain name request sent by the client device, the DNS server can send multiple segment identifiers corresponding to the target computing power service to the client device based on the domain name request. Optionally, the DNS server can also send the identification of the target computing service to the client device.
示例的,参考图11,DNS服务器可以向客户端设备发送与目标算力服务对应的3个段标识SID1、SID2和SID3。For example, referring to Figure 11, the DNS server may send three segment identifiers SID1, SID2, and SID3 corresponding to the target computing power service to the client device.
可以理解的是,客户端设备除了可以通过DNS请求获取与目标算力服务对应的多个段标识,还可以通过其他方式获取该多个段标识。例如,客户端设备可以接收控制器下发的与目标算力服务对应的多个段标识,或者,客户端设备中可以预先静态配置有该多个段标识。It can be understood that, in addition to obtaining multiple segment identifiers corresponding to the target computing power service through DNS requests, the client device can also obtain the multiple segment identifiers through other methods. For example, the client device may receive multiple segment identifiers corresponding to the target computing power service issued by the controller, or the client device may be statically configured with the multiple segment identifiers in advance.
步骤403、客户端设备将该多个段标识封装至算力请求报文。Step 403: The client device encapsulates the multiple segment identifiers into the computing power request message.
客户端设备接收到DNS服务器发送的与目标算力服务对应的多个段标识后,可以记录目标算力服务与该多个段标识的对应关系。当客户端设备需要获取目标算力服务时,可以生成算力请求报文。该算力请求报文中封装有SRH,该SRH的段列表中包括该多个段标识。例如,如图11所示,该算力请求报文的段列表中记录有SID1、SID2和SID3共3个段标识。After receiving the multiple segment identifiers corresponding to the target computing power service sent by the DNS server, the client device may record the corresponding relationship between the target computing power service and the multiple segment identifiers. When the client device needs to obtain the target computing power service, it can generate a computing power request message. The computing power request message is encapsulated with SRH, and the segment list of the SRH includes the multiple segment identifiers. For example, as shown in Figure 11, the segment list of the computing power request message contains three segment identifiers, SID1, SID2, and SID3.
步骤404、客户端设备根据该段列表转发该算力请求报文。Step 404: The client device forwards the computing power request message according to the segment list.
在本申请实施例中,客户端设备可以基于SRv6报文的转发机制,根据该段列表转发该算力请求报文。例如,客户端设备可以将段列表中的首个段标识拷贝至算力请求报文的目的地址字段,并向该首个段标识对应的实例节点转发该算力请求报文。In this embodiment of the present application, the client device can forward the computing power request message according to the segment list based on the forwarding mechanism of the SRv6 message. For example, the client device can copy the first segment identifier in the segment list to the destination address field of the computing power request message, and forward the computing power request message to the instance node corresponding to the first segment identifier.
示例的,参考图11,假设段列表中的首个段标识为SID1,则客户端设备发出的算力请求报文的目的地址字段的值即为SID1。假设服务节点1中部署的算力服务实例的段标识为SID1,则客户端设备可以通过路由节点将该算力请求报文转发至服务节点1。For example, referring to Figure 11, assuming that the first segment identifier in the segment list is SID1, the value of the destination address field of the computing power request message sent by the client device is SID1. Assume that the segment identifier of the computing power service instance deployed in service node 1 is SID1, then the client device can forward the computing power request message to service node 1 through the routing node.
步骤405、第一服务节点在确定第一段标识与第一算力服务实例的段标识匹配后,确定该第一算力服务实例是否满足服务条件。Step 405: After determining that the first segment identifier matches the segment identifier of the first computing power service instance, the first service node determines whether the first computing power service instance satisfies the service conditions.
其中,该第一服务节点可以是段列表中多个段标识指示的多个算力服务实例所属的服务节点中的任一服务节点,且该第一服务节点中部署有用于提供目标算力服务的第一算力服务实例。第一服务节点接收到算力请求报文后,若该算力请求报文的目的地址字段中包括第一段标识,则可以检测该第一段标识是否与第一算力服务实例的段标识匹配。Wherein, the first service node may be any service node among the service nodes to which multiple computing power service instances indicated by multiple segment identifiers in the segment list belong, and the first service node is deployed with a device for providing the target computing power service. The first computing power service instance. After the first service node receives the computing power request message, if the destination address field of the computing power request message includes the first segment identifier, it can detect whether the first segment identifier is the same as the segment identifier of the first computing power service instance. match.
若两者不匹配,则第一服务节点可以继续根据该段列表转发算力请求报文,而无需检测该第一算力服务实例是否满足服务条件。若两者匹配,则第一服务节点可以继续检测该第一算力服务实例是否满足服务条件。若第一算力服务实例不满足服务条件,则第一服务节点可以执行步骤406;若第一算力服务实例满足服务条件,则第一服务节点可以执行步骤408。其中,该服务条件的内容可以参考上述步骤305中的相关描述,此处不再赘述。If the two do not match, the first service node can continue to forward the computing power request message according to the segment list without checking whether the first computing power service instance meets the service conditions. If the two match, the first service node can continue to detect whether the first computing power service instance meets the service conditions. If the first computing power service instance does not meet the service conditions, the first service node can perform step 406; if the first computing power service instance meets the service conditions, the first service node can perform step 408. For the content of the service conditions, please refer to the relevant description in step 305 above, which will not be described again here.
示例的,参考图11,假设第一服务节点为服务节点1,且服务节点1中部署的第一算力服务实例的段标识为SID1。若服务节点1接收到的算力请求报文中目的地址字段中的第一段标识也为SID1,则该服务节点1可以检测第一算力服务实例是否满足服务条件。For example, referring to Figure 11, assume that the first service node is service node 1, and the segment identifier of the first computing power service instance deployed in service node 1 is SID1. If the first segment identifier in the destination address field of the computing power request message received by the service node 1 is also SID1, the service node 1 can detect whether the first computing power service instance meets the service conditions.
步骤406、第一服务节点检测第一段标识是否为段列表中的最后一个段标识。Step 406: The first service node detects whether the first segment identifier is the last segment identifier in the segment list.
第一服务节点若检测到第一算力服务实例不满足服务条件,则可以继续检测第一段标识是否为段列表中的最后一个段标识。若第一段标识不为段列表中的最后一个段标识,则第一服务节点可以确定能够继续根据该段列表向后续的段标识对应的实例节点转发算力请求报文,因此可以执行步骤407。若第一服务节点为段列表中的最后一个段标识,则第一服务节点可以确定其为段列表指示的转发路径中的尾节点,因此可以执行步骤408。If the first service node detects that the first computing power service instance does not meet the service conditions, it can continue to detect whether the first segment identifier is the last segment identifier in the segment list. If the first segment identifier is not the last segment identifier in the segment list, the first service node can determine that it can continue to forward the computing power request message to the instance node corresponding to the subsequent segment identifier according to the segment list, so step 407 can be performed. . If the first service node is the last segment identifier in the segment list, the first service node can determine that it is the tail node in the forwarding path indicated by the segment list, and therefore step 408 can be performed.
步骤407、第一服务节点确定不由第一算力服务实例提供目标算力服务,并根据段列表转发算力请求报文。Step 407: The first service node determines that the target computing power service is not provided by the first computing power service instance, and forwards the computing power request message according to the segment list.
若第一算力服务实例不满足服务条件,且第一段标识不为段列表中的最后一个段标识,则第一服务节点可以确定不由第一算力服务实例提供目标算力服务,并可以根据该段列表继续转发算力请求报文。例如,第一服务节点可以将该算力请求报文的目的地址字段的值更新为段列表中的第二段标识,并将该算力请求报文转发至第二节点。其中,该第二节点可以是第二段标识指示的第二算力服务实例所属的服务节点,且该第二段标识为段列表中与第一段标识相邻的下一个段标识。If the first computing power service instance does not meet the service conditions, and the first segment identifier is not the last segment identifier in the segment list, the first service node can determine that the target computing power service is not provided by the first computing power service instance, and can Continue to forward the computing power request message according to this segment list. For example, the first service node may update the value of the destination address field of the computing power request message to the second segment identifier in the segment list, and forward the computing power request message to the second node. The second node may be a service node to which the second computing power service instance indicated by the second segment identifier belongs, and the second segment identifier is the next segment identifier adjacent to the first segment identifier in the segment list.
示例的,如图11所示,假设服务节点1确定SID1指示的第一算力服务实例不满足服务条件,则由于段列表中与SID1相邻的下一个段标识为SID2,因此服务节点1可以通过SR模块将算力请求报文的目的地址字段的值更新为SID2。之后,服务节点1即可通过SR模块将该算力请求报文转发至SID2对应的第二节点,即服务节点2。For example, as shown in Figure 11, assuming that service node 1 determines that the first computing power service instance indicated by SID1 does not meet the service conditions, since the next segment adjacent to SID1 in the segment list is identified as SID2, service node 1 can Update the value of the destination address field of the computing power request message to SID2 through the SR module. Afterwards, service node 1 can forward the computing power request message to the second node corresponding to SID2, that is, service node 2, through the SR module.
若服务节点2确定SID2指示的第二算力服务实例也不满足服务条件,则由于段列表中与SID2相邻的下一个段标识为SID3,因此服务节点2可以通过SR模块将算力请求报文的目的地址字段的值更新为SID3。之后,服务节点2即可通过SR模块将该算力请求报文转发至SID3对应的实例节点,即服务节点3。If the service node 2 determines that the second computing power service instance indicated by SID2 does not meet the service conditions, since the next segment adjacent to SID2 in the segment list is identified as SID3, the service node 2 can report the computing power request through the SR module. The value of the destination address field of the message is updated to SID3. Afterwards, service node 2 can forward the computing power request message to the instance node corresponding to SID3, that is, service node 3, through the SR module.
步骤408、第一服务节点确定由第一算力服务实例提供目标算力服务,并向客户端设备发送算力响应报文。Step 408: The first service node determines that the first computing power service instance provides the target computing power service, and sends a computing power response message to the client device.
第一服务节点若检测到第一算力服务实例满足服务条件,则可以确定由该第一算力服务实例提供目标算力服务,并可以向客户端设备发送算力响应报文。或者,第一服务节点若检测到第一算力服务实例不满足服务条件,且该第一段标识为段列表中的最后一个段标识,则也可以确定由该第一算力服务实例提供目标算力服务,并向客户端设备发送算力响应报文。该步骤408的实现过程可以参考上述步骤308的相关描述。If the first service node detects that the first computing power service instance meets the service conditions, it can determine that the first computing power service instance provides the target computing power service, and can send a computing power response message to the client device. Alternatively, if the first service node detects that the first computing power service instance does not meet the service conditions and the first segment identifier is the last segment identifier in the segment list, it may also determine that the first computing power service instance provides the target computing power service and sends a computing power response message to the client device. For the implementation process of step 408, please refer to the relevant description of step 308 above.
示例的,参考图11,若第一服务节点为服务节点3,且服务节点3接收到的算力请求报文中的目的地址字段的值为SID3。若服务节点3确定该SID3指示的第一算力服务实例满足服务条件,则可以确定由该SID3指示的第一算力服务实例提供目标算力服务。并且,服务节点3可以向客户端设备发送算力响应报文。该算力响应报文的目的地址字段的值可以为客户端设备的IP地址C0,源地址字段的值可以是第一段标识SID3。For example, referring to Figure 11, if the first service node is service node 3, and the value of the destination address field in the computing power request message received by service node 3 is SID3. If the service node 3 determines that the first computing power service instance indicated by SID3 satisfies the service conditions, it may determine that the first computing power service instance indicated by SID3 provides the target computing power service. Moreover, the service node 3 can send a computing power response message to the client device. The value of the destination address field of the computing power response message may be the IP address C0 of the client device, and the value of the source address field may be the first segment identifier SID3.
步骤409、客户端设备向第一服务节点发送算力任务报文,该算力任务报文的目的地址字段包括第一段标识。Step 409: The client device sends a computing power task message to the first service node, and the destination address field of the computing power task message includes the first segment identifier.
客户端设备接收到第一服务节点发送的算力响应报文后,可以确定能够向该第一服务节点中部署的第一算力服务实例获取目标算力服务。因此,客户端设备可以记录目标算力服务的标识与该算力响应报文中携带的目标段标识(例如第一段标识)的对应关系。之后,客户端设备即可向该第一服务节点发送算力任务报文,该算力任务报文的目的地址字段包括第一段标识。该步骤409的实现过程可以参考上述步骤309的相关描述。After receiving the computing power response message sent by the first service node, the client device may determine that the target computing power service can be obtained from the first computing power service instance deployed in the first service node. Therefore, the client device can record the corresponding relationship between the identifier of the target computing power service and the target segment identifier (eg, the first segment identifier) carried in the computing power response message. Afterwards, the client device can send a computing power task message to the first service node, and the destination address field of the computing power task message includes the first segment identifier. For the implementation process of step 409, please refer to the relevant description of step 309 above.
对于算力服务实例对应的实例节点(例如第一节点)为服务节点的场景,参考图11可以看出,该服务节点中既包括算力服务模块,也包括支持SRv6技术的SR模块。也即是,该服务节点既可以通过算力服务实例向客户端设备提供算力服务,也能够通过SRv6报文的转发机制转发算力请求报文。For the scenario where the instance node (for example, the first node) corresponding to the computing power service instance is a service node, referring to Figure 11, it can be seen that the service node includes both the computing power service module and the SR module that supports SRv6 technology. That is to say, the service node can not only provide computing power services to client devices through computing power service instances, but also forward computing power request messages through the SRv6 message forwarding mechanism.
对比图8和图10所示的实施例可知,若算力服务实例对应的实例节点(例如第一节点)为服务节点,则服务节点可以基于自身部署的第一算力服务实例的算力负载确定第一算力服务实例是否满足服务条件。由此,算力网络中的路由节点将无需再感知算力服务实例的算力负载,仅需转发报文即可,从而有效降低了路由节点的控制面压力。Comparing the embodiments shown in Figure 8 and Figure 10, it can be seen that if the instance node (for example, the first node) corresponding to the computing power service instance is a service node, the service node can be based on the computing power load of the first computing power service instance deployed by itself. Determine whether the first computing power service instance meets the service conditions. As a result, routing nodes in the computing power network no longer need to sense the computing power load of the computing power service instance and only need to forward messages, thus effectively reducing the control surface pressure of routing nodes.
若算力服务实例对应的实例节点为与服务节点直连的路由节点(例如第二路由节点),则该路由节点可以仅感知其所直连的服务节点中部署的算力服务实例的算力负载,且服务节点无需解析SRH并根据段列表转发报文。由此,可以在有效降低路由节点的控制面压力的情况下,与传统的服务节点的功能有效兼容,即不会增加对服务节点的性能要求。If the instance node corresponding to the computing power service instance is a routing node (such as a second routing node) directly connected to the service node, the routing node can only sense the computing power of the computing power service instance deployed in the service node to which it is directly connected. load, and the service node does not need to parse the SRH and forward the message according to the segment list. As a result, the control surface pressure of the routing node can be effectively reduced while being effectively compatible with the functions of the traditional service node, that is, the performance requirements for the service node will not be increased.
对比图8和图10所示的实施例还可知,若请求节点为与客户端设备直连的路由节点(例如第一路由节点),则客户端设备无需感知并封装与目标算力服务的标识对应的多个段标识,也无需支持SRv6技术。由此,有效降低了对客户端设备的性能要求。Comparing the embodiments shown in Figure 8 and Figure 10, it can also be seen that if the requesting node is a routing node directly connected to the client device (for example, the first routing node), the client device does not need to sense and encapsulate the identity of the target computing service. The corresponding multiple segment identifiers do not need to support SRv6 technology. As a result, the performance requirements for client devices are effectively reduced.
若请求节点为客户端设备,则与客户端设备直连的路由节点无需感知并封装与目标算力服务的标识对应的多个段标识,而仅需基于SRv6报文的转发机制转发报文即可。由此,有效降低了对路由节点的性能要求。If the requesting node is a client device, the routing node directly connected to the client device does not need to sense and encapsulate multiple segment identifiers corresponding to the identifier of the target computing service, but only needs to forward the message based on the forwarding mechanism of the SRv6 message. Can. As a result, the performance requirements for routing nodes are effectively reduced.
上述步骤309和步骤409均是以客户端设备学习算力响应报文中的目标段标识为例进行的说明。可以理解的是,若客户端设备不具备该学习能力,则也可以由客户端设备与服务节点之间的某个路由节点(例如第一路由节点)学习该算力响应报文中的目标段标识。也即是,可以由该路由节点记录目标算力服务的标识与目标段标识的对应关系,该对应关系也可以称为流表。该路由节点接收到客户端设备发送的算力请求报文或算力任务报文后,可以根据该报文中的目标算力服务的标识查找流表,并可以根据查找到的目标段标识转发报文。The above steps 309 and 409 are explained by taking the client device learning the target segment identifier in the computing power response message as an example. It can be understood that if the client device does not have the learning capability, a routing node (such as the first routing node) between the client device and the service node can also learn the target segment in the computing power response message. logo. That is, the routing node can record the correspondence between the identification of the target computing power service and the identification of the target segment. This correspondence can also be called a flow table. After receiving the computing power request message or computing power task message sent by the client device, the routing node can search the flow table according to the identification of the target computing power service in the message, and can forward it according to the found target segment identification. message.
可选地,在本申请实施例中,客户端设备或第一路由节点生成的算力请求报文中还可以封装有目标算力服务的标识。例如,参考图9和图11,算力请求报文的段列表中的最后一个段标识字段,即segment list[0],可以携带该目标算力服务的标识(如S1)。由此,可以便于当算力请求报文中的目的地址字段被更新为该目标算力服务的标识时,算力网络中的路由节点或服务节点对该算力请求报文的源地址和目的地址进行校验时能够校验通过,以避免校验失败而丢弃该算力请求报文。其中,路由节点或服务节点所采用的校验方法可以为TCP或UDP校验和校验和(checksum)。Optionally, in this embodiment of the present application, the identity of the target computing power service may also be encapsulated in the computing power request message generated by the client device or the first routing node. For example, referring to Figures 9 and 11, the last segment identification field in the segment list of the computing power request message, that is, segment list[0], can carry the identification of the target computing power service (such as S1). This makes it easier for the routing node or service node in the computing power network to determine the source address and destination of the computing power request message when the destination address field in the computing power request message is updated to identify the target computing power service. The address can pass the verification during verification to avoid discarding the computing power request message due to verification failure. The verification method used by the routing node or service node may be TCP or UDP verification and checksum.
可以理解的是,在图8所示的实施例中,也可以由客户端设备基于步骤401至步骤403所示的方法生成算力请求报文。在图10所示的实施例中,也可以由第一路由节点基于步骤301至步骤303所示的方法生成算力请求报文。也即是,该请求节点和第一节点可以根据应用场景的需求灵活组合,本申请实施例对此不做限定。It can be understood that in the embodiment shown in FIG. 8 , the client device can also generate a computing power request message based on the method shown in steps 401 to 403 . In the embodiment shown in FIG. 10 , the first routing node may also generate a computing power request message based on the method shown in steps 301 to 303 . That is to say, the requesting node and the first node can be flexibly combined according to the requirements of the application scenario, which is not limited in the embodiments of the present application.
还可以理解的是,本申请实施例提供的算力请求方法的步骤先后顺序可以进行适当调整,步骤也可以根据情况进行相应增减。例如,上述步骤301可以根据情况删除。或者,上述步骤306可以根据情况删除,或可以在步骤305之前执行。又或者,上述步骤401和步骤402可以根据情况删除。再或者,上述步骤406可以根据情况删除,或可以在步骤405之前执行。It can also be understood that the sequence of steps in the computing power request method provided by the embodiments of the present application can be appropriately adjusted, and the steps can also be increased or decreased accordingly according to the situation. For example, the above step 301 can be deleted according to the situation. Alternatively, the above step 306 may be deleted according to circumstances, or may be performed before step 305. Or, the above steps 401 and 402 can be deleted according to the situation. Or, the above step 406 can be deleted according to the situation, or can be performed before step 405.
综上所述,本申请实施例提供了一种算力请求方法,第一算力服务实例对应的第一节点接收到算力请求报文后,能够基于该第一算力服务实例是否满足服务条件,确定是否由该第一算力服务实例向客户端设备提供目标算力服务。由于第一节点能够直接判断其对应的第一算力服务实例是否满足服务条件,并确定是否提供目标算力服务,因此无需再由算力网络中的路由节点集中感知大量算力服务实例的算力负载,并选择匹配的算力服务实例。也即是,本申请实施例提供的方法能够由各个实例节点通过分布式决策的方式确定用于提供目标算力服务的算力服务实例,从而有效减轻了由路由节点集中式决策时的信息发布的压力。进而,有效降低了对路由节点的性能要求,提高了提供算力服务时的效率。To sum up, the embodiments of this application provide a computing power request method. After the first node corresponding to the first computing power service instance receives the computing power request message, it can based on whether the first computing power service instance satisfies the service requirement. condition to determine whether the first computing power service instance provides the target computing power service to the client device. Since the first node can directly determine whether its corresponding first computing power service instance meets the service conditions and determine whether to provide the target computing power service, there is no need for routing nodes in the computing power network to centrally sense the computing power of a large number of computing power service instances. load, and select a matching computing service instance. That is to say, the method provided by the embodiments of this application can allow each instance node to determine the computing power service instance used to provide the target computing power service through distributed decision-making, thereby effectively reducing the information release during centralized decision-making by routing nodes. pressure. Furthermore, it effectively reduces the performance requirements for routing nodes and improves the efficiency of providing computing power services.
图12是本申请实施例提供的再一种算力请求方法的流程图。如图12所示,该方法包括:Figure 12 is a flow chart of yet another computing power request method provided by an embodiment of the present application. As shown in Figure 12, the method includes:
步骤S1、实例节点接收来自客户端设备的算力请求报文。Step S1: The instance node receives the computing power request message from the client device.
该算力请求报文的段列表中包括多个段标识,该多个段标识分别指示多个算力服务实例,其中每个算力服务实例用于提供目标算力服务。该实例节点与该多个算力服务实例中的一个算力服务实例对应,且该实例节点对应的算力服务实例可以是段列表中除最后一个段标识之外的任一段标识指示的算力服务实例。可选地,该实例节点可以是其对应的算力服务实例所属的服务节点,或者可以是与该服务节点直连的路由节点。The segment list of the computing power request message includes multiple segment identifiers, and the multiple segment identifiers respectively indicate multiple computing power service instances, where each computing power service instance is used to provide the target computing power service. The instance node corresponds to one computing power service instance among the multiple computing power service instances, and the computing power service instance corresponding to the instance node can be the computing power indicated by any segment identifier in the segment list except the last segment identifier. Service instance. Optionally, the instance node may be a service node to which its corresponding computing power service instance belongs, or may be a routing node directly connected to the service node.
步骤S2、实例节点在该算力请求报文中封装其对应的算力服务实例的算力负载。Step S2: The instance node encapsulates the computing power load of its corresponding computing power service instance in the computing power request message.
在本申请实施例中,实例节点接收到算力请求报文后,可以将其对应的算力服务实例的算力负载封装至该算力请求报文。例如,实例节点可以将算力负载(如融合开销)封装至算力请求报文的SRH的扩展TLV中。In this embodiment of the present application, after receiving the computing power request message, the instance node can encapsulate the computing power load of its corresponding computing power service instance into the computing power request message. For example, the instance node can encapsulate the computing power load (such as the integration overhead) into the extended TLV of the SRH of the computing power request message.
步骤S3、实例节点根据段列表转发该算力请求报文。Step S3: The instance node forwards the computing power request message according to the segment list.
实例节点可以按照SRv6报文的转发机制,根据段列表转发该算力请求报文。可以理解的是,该段列表中除最后一个段标识之外的每个段标识指示的算力服务实例所对应的实例节点均可以执行上述步骤S1至步骤S3所示的方法。The instance node can forward the computing power request message according to the segment list according to the SRv6 message forwarding mechanism. It can be understood that the instance node corresponding to the computing service instance indicated by each segment identifier except the last segment identifier in the segment list can execute the method shown in steps S1 to S3 above.
步骤S4、尾实例节点基于多个算力服务实例的算力负载,向目标算力服务实例所对应的实例节点发送服务指示。Step S4: The tail instance node sends a service instruction to the instance node corresponding to the target computing service instance based on the computing power load of the multiple computing power service instances.
该尾实例节点是与段列表中最后一个段标识指示的算力服务实例对应的实例节点。基于上述步骤S1至步骤S3可知,尾实例节点接收到的算力请求报文中还包括:该多个算力服务实例中除尾实例节点对应的算力服务实例之外的其他算力服务实例的算力负载。因此,尾实例节点能够基于该多个算力服务实例的算力负载,向目标算力服务实例所对应的实例节点发送服务指示,以使该目标算力服务实例向客户端设备提供目标算力服务。The tail instance node is the instance node corresponding to the computing service instance indicated by the last segment identifier in the segment list. Based on the above steps S1 to S3, it can be known that the computing power request message received by the tail instance node also includes: other computing power service instances among the multiple computing power service instances except the computing power service instance corresponding to the tail instance node. computing power load. Therefore, the tail instance node can send a service instruction to the instance node corresponding to the target computing power service instance based on the computing power load of the multiple computing power service instances, so that the target computing power service instance provides the target computing power to the client device. Serve.
其中,该目标算力服务实例可以满足:目标算力服务实例的算力负载小于第一负载阈值。Wherein, the target computing power service instance can satisfy: the computing power load of the target computing power service instance is less than the first load threshold.
可选地,该目标算力服务实例还可以满足:目标算力服务实例对应的实例节点与第一路由节点之间的网络性能满足第一通信条件。其中,该第一路由节点为与客户端设备连接的路由节点。Optionally, the target computing power service instance may also satisfy: the network performance between the instance node corresponding to the target computing power service instance and the first routing node meets the first communication condition. Wherein, the first routing node is a routing node connected to the client device.
可以理解的是,该算力请求报文中还可以包括:第一路由节点与该多个算力服务实例对应的实例节点之间的网络性能参数,尾实例节点可以基于该网络性能参数确定满足第一通信条件的目标算力服务实例。It can be understood that the computing power request message may also include: network performance parameters between the first routing node and the instance nodes corresponding to the multiple computing power service instances, and the tail instance node may determine based on the network performance parameters that satisfy The target computing power service instance of the first communication condition.
其中,该网络性能参数可以是由第一路由节点检测并封装至算力请求报文的。或者,该多个算力服务实例中,除尾实例节点对应的算力服务实例之外的其他算力服务实例所对应的实例节点可以检测并封装其与第一路由节点之间的网络性能参数。The network performance parameter may be detected by the first routing node and encapsulated into the computing power request message. Alternatively, among the multiple computing power service instances, the instance nodes corresponding to other computing power service instances except the computing power service instance corresponding to the tail instance node can detect and encapsulate the network performance parameters between them and the first routing node. .
还可以理解的是,上述步骤S2可以根据情况删除,相应的,在上述步骤S4中,尾实例节点可以基于其他方式从多个算力服务实例中确定目标算力服务实例。例如,尾实例节点可以从多个算力服务实例中随机选取一个算力服务实例作为目标算力服务实例。或者,尾实例节点可以基于网络性能参数确定目标算力服务实例。It can also be understood that the above-mentioned step S2 can be deleted according to the situation. Correspondingly, in the above-mentioned step S4, the tail instance node can determine the target computing power service instance from multiple computing power service instances based on other methods. For example, the tail instance node can randomly select a computing power service instance from multiple computing power service instances as the target computing power service instance. Alternatively, the tail instance node can determine the target computing service instance based on network performance parameters.
关于上述第一负载阈值和第一通信条件的解释可以参考图8所示实施例中步骤309之后的相关描述,此处不再赘述。For explanations about the above-mentioned first load threshold and first communication condition, reference may be made to the relevant description after step 309 in the embodiment shown in FIG. 8 , which will not be described again here.
图13是本申请实施例提供的一种第一节点的结构示意图。该第一节点可以应用于诸如图1所示的算力网络,且该第一节点对应第一算力服务实例,该第一算力服务实例用于提供目标算力服务。例如,该第一节点可以是部署有第一算力服务实例的服务节点,或者可以是与该服务节点直连的路由节点。并且,该第一节点可以实现上述方法实施例中由第一节点或第二路由节点执行的步骤。参考图13,该第一节点包括:Figure 13 is a schematic structural diagram of a first node provided by an embodiment of the present application. The first node can be applied to a computing power network such as that shown in Figure 1, and the first node corresponds to a first computing power service instance, and the first computing power service instance is used to provide a target computing power service. For example, the first node may be a service node where the first computing power service instance is deployed, or may be a routing node directly connected to the service node. Moreover, the first node can implement the steps performed by the first node or the second routing node in the above method embodiment. Referring to Figure 13, the first node includes:
接收模块501,用于接收来自客户端设备的算力请求报文,该算力请求报文用于请求目标算力服务。该接收模块501的功能实现可以参考上述方法实施例中步骤101、步骤304和步骤404的相关描述。The receiving module 501 is configured to receive a computing power request message from a client device, where the computing power request message is used to request a target computing power service. For the functional implementation of the receiving module 501, please refer to the relevant descriptions of step 101, step 304 and step 404 in the above method embodiment.
确定模块502,用于基于该第一算力服务实例是否满足服务条件,确定是否由该第一算力服务实例向客户端设备提供目标算力服务。该确定模块502的功能实现可以参考上述方法实施例中步骤102、步骤305和步骤405的相关描述。The determination module 502 is configured to determine whether the first computing power service instance provides the target computing power service to the client device based on whether the first computing power service instance satisfies the service condition. For the functional implementation of the determination module 502, please refer to the relevant descriptions of step 102, step 305 and step 405 in the above method embodiment.
可选地,该算力请求报文的段列表中包括多个段标识;该多个段标识分别指示多个算力服务实例,且该多个算力服务实例分别用于提供目标算力服务,该多个算力服务实例包括该第一算力服务实例。Optionally, the segment list of the computing power request message includes multiple segment identifiers; the multiple segment identifiers respectively indicate multiple computing power service instances, and the multiple computing power service instances are respectively used to provide target computing power services. , the plurality of computing power service instances include the first computing power service instance.
可选地,如图13所示,该第一节点还可以包括发送模块503。Optionally, as shown in Figure 13, the first node may also include a sending module 503.
该确定模块502,可以用于基于第一算力服务实例满足该服务条件,确定由该第一算力服务实例向客户端设备提供该目标算力服务,并通过该发送模块503向客户端设备发送算力响应报文;或者,基于第一算力服务实例不满足服务条件,确定不由该第一算力服务实例向客户端设备提供目标算力服务,并通过该发送模块503根据该段列表转发算力请求报文。The determination module 502 may be used to determine that the first computing power service instance provides the target computing power service to the client device based on the first computing power service instance satisfying the service condition, and send the target computing power service to the client device through the sending module 503. Send a computing power response message; or, based on the fact that the first computing power service instance does not meet the service conditions, determine that the first computing power service instance does not provide the target computing power service to the client device, and use the sending module 503 according to the segment list Forward the computing power request message.
其中,该确定模块502和发送模块503的功能实现还可以参考上述方法实施例中步骤307、步骤308、步骤407和步骤408的相关描述。The functional implementation of the determining module 502 and the sending module 503 may also refer to the relevant descriptions of steps 307, 308, 407 and 408 in the above method embodiment.
可选地,该算力请求报文的目的地址字段包括用于指示该第一算力服务实例的第一段标识,该多个段标识包括该第一段标识。该发送模块503可以用于:将该算力请求报文的目的地址字段的值更新为段列表中的第二段标识,并将该算力请求报文转发至第二节点。其中,该第二节点对应第二段标识指示的第二算力服务实例,该第二段标识为段列表中与第一段标识相邻的下一个段标识。Optionally, the destination address field of the computing power request message includes a first segment identifier used to indicate the first computing power service instance, and the plurality of segment identifiers include the first segment identifier. The sending module 503 may be configured to: update the value of the destination address field of the computing power request message to the second segment identifier in the segment list, and forward the computing power request message to the second node. The second node corresponds to the second computing power service instance indicated by the second segment identifier, and the second segment identifier is the next segment identifier adjacent to the first segment identifier in the segment list.
可选地,该发送模块503还可以用于在该算力请求报文中封装第一算力服务实例的算力负载。Optionally, the sending module 503 may also be configured to encapsulate the computing power load of the first computing power service instance in the computing power request message.
可选地,该算力请求报文的目的地址字段包括用于指示该第一算力服务实例的第一段标识,该多个段标识包括该第一段标识。该发送模块503可以用于:基于该第一段标识为段列表中的最后一个段标识,将算力请求报文的目的地址字段的值更新为该段列表中的第三段标识,得到服务指示;将该服务指示转发至第三节点,该第三节点对应第三段标识指示的第三算力服务实例,且该服务指示用于指示该第三算力服务实例向客户端设备提供目标算力服务。Optionally, the destination address field of the computing power request message includes a first segment identifier used to indicate the first computing power service instance, and the plurality of segment identifiers include the first segment identifier. The sending module 503 can be used to: based on the first segment identifier being the last segment identifier in the segment list, update the value of the destination address field of the computing power request message to the third segment identifier in the segment list to obtain service Instruction; forward the service instruction to a third node, the third node corresponds to the third computing power service instance indicated by the third segment identifier, and the service instruction is used to instruct the third computing power service instance to provide the target to the client device Computing service.
可选地,该第三算力服务实例是从多个算力服务实例中随机选取的;或者,该第三算力服务实例满足下述条件中的至少一种:第三算力服务实例的算力负载小于第一负载阈值;第三算力服务实例对应的第三节点与第一路由节点之间的网络性能满足第一通信条件,该第一路由节点为与客户端设备连接的路由节点。Optionally, the third computing power service instance is randomly selected from multiple computing power service instances; or, the third computing power service instance meets at least one of the following conditions: The computing power load is less than the first load threshold; the network performance between the third node corresponding to the third computing power service instance and the first routing node meets the first communication condition, and the first routing node is a routing node connected to the client device .
可选地,该确定模块502,可以用于基于该第一段标识为段列表中的最后一个段标识,确定由第一算力服务实例向客户端设备提供目标算力服务,并通过该发送模块503向客户端设备发送算力响应报文。该确定模块502的功能实现还可以参考上述方法实施例中步骤306和步骤406的相关描述。Optionally, the determining module 502 may be configured to determine, based on the first segment identifier being the last segment identifier in the segment list, to provide the target computing power service to the client device by the first computing power service instance, and send the Module 503 sends a computing power response message to the client device. For the functional implementation of the determination module 502, reference may also be made to the relevant descriptions of steps 306 and 406 in the above method embodiments.
可选地,该算力响应报文中可以包括该第一段标识。例如,算力响应报文的源地址字段或扩展报文头中包括该第一段标识。Optionally, the computing power response message may include the first segment identification. For example, the source address field or extended header of the computing power response message includes the first segment identifier.
可选地,该段列表还可以包括该目标算力服务的标识。Optionally, the segment list may also include the identification of the target computing power service.
可选地,该服务条件可以包括:第一算力服务实例的算力负载小于第二负载阈值。该服务条件还可以包括:客户端设备所连接的第一路由节点与第一节点之间的网络性能满足第二通信条件。Optionally, the service condition may include: the computing power load of the first computing power service instance is less than the second load threshold. The service condition may further include: network performance between the first routing node connected to the client device and the first node meets the second communication condition.
综上所述,本申请实施例提供了一种第一节点,该第一节点接收到算力请求报文后,能够基于其对应的第一算力服务实例是否满足服务条件,确定是否由该第一算力服务实例向客户端设备提供目标算力服务。由于第一节点能够直接判断其对应的第一算力服务实例是否满足服务条件,并确定是否提供目标算力服务,因此无需再由算力网络中的路由节点集中感知大量算力服务实例的算力负载,并选择匹配的算力服务实例。也即是,第一节点以及其他算力服务实例对应的节点能够通过分布式决策的方式确定用于提供目标算力服务的算力服务实例,从而有效减轻了由路由节点集中式决策时的信息发布的压力。进而,有效降低了对路由节点的性能要求,提高了提供算力服务时的效率。To sum up, the embodiments of the present application provide a first node. After receiving the computing power request message, the first node can determine whether to use the computing power request message based on whether the corresponding first computing power service instance satisfies the service conditions. The first computing power service instance provides the target computing power service to the client device. Since the first node can directly determine whether its corresponding first computing power service instance meets the service conditions and determine whether to provide the target computing power service, there is no need for routing nodes in the computing power network to centrally sense the computing power of a large number of computing power service instances. load, and select a matching computing service instance. That is to say, the first node and the nodes corresponding to other computing power service instances can determine the computing power service instance used to provide the target computing power service through distributed decision-making, thereby effectively reducing the information in the centralized decision-making process by the routing node. Pressure to publish. Furthermore, it effectively reduces the performance requirements for routing nodes and improves the efficiency of providing computing power services.
图14是本申请实施例提供的一种请求节点的结构示意图。该请求节点可以应用于诸如图1所示的算力网络,且该请求节点可以是客户端设备,或者可以是与客户端设备直连的路由节点。并且,该请求节点可以实现上述方法实施例中由请求节点、客户端设备或第一路由节点执行的步骤。参考图14,该请求节点包括:Figure 14 is a schematic structural diagram of a request node provided by an embodiment of the present application. The request node may be applied to a computing power network such as that shown in Figure 1, and the request node may be a client device, or may be a routing node directly connected to the client device. Moreover, the requesting node may implement the steps performed by the requesting node, the client device or the first routing node in the above method embodiment. Referring to Figure 14, the request node includes:
生成模块601,用于生成算力请求报文,该算力请求报文的段列表中包括多个段标识。该多个段标识分别指示多个算力服务实例,该多个算力服务实例中的每个算力服务实例用于提供目标算力服务。该生成模块601的功能实现可以参考上述方法实施例中步骤201和步骤403的相关描述。The generation module 601 is configured to generate a computing power request message, where the segment list of the computing power request message includes multiple segment identifiers. The multiple segment identifiers respectively indicate multiple computing power service instances, and each of the multiple computing power service instances is used to provide a target computing power service. For the functional implementation of the generation module 601, please refer to the relevant descriptions of step 201 and step 403 in the above method embodiment.
发送模块602,用于根据该段列表转发该算力请求报文,以使该多个算力服务实例中的满足服务条件的至少一个算力服务实例向客户端设备提供目标算力服务。该发送模块602的功能实现可以参考上述方法实施例中步骤202和步骤404的相关描述。The sending module 602 is configured to forward the computing power request message according to the segment list, so that at least one computing power service instance among the plurality of computing power service instances that meets the service condition provides the target computing power service to the client device. For the functional implementation of the sending module 602, please refer to the relevant descriptions of steps 202 and 404 in the above method embodiment.
可选地,对于该请求节点为客户端设备的场景,该生成模块601,可以用于基于该客户端设备所请求的目标算力服务,将与该目标算力服务对应的多个段标识封装至算力请求报文。Optionally, for a scenario where the requesting node is a client device, the generation module 601 may be configured to encapsulate multiple segment identifiers corresponding to the target computing service based on the target computing service requested by the client device. to the computing power request message.
可选地,该发送模块602,还可以用于向DNS服务器发送针对该目标算力服务的域名请求。并且,如图14所示,该客户端设备还可以包括接收模块603,用于接收该DNS服务器发送的与该目标算力服务对应的多个段标识。Optionally, the sending module 602 can also be used to send a domain name request for the target computing power service to the DNS server. Moreover, as shown in Figure 14, the client device may also include a receiving module 603, configured to receive multiple segment identifiers corresponding to the target computing power service sent by the DNS server.
其中,该发送模块602的功能实现还可以参考上述方法实施例中步骤401的相关描述。该接收模块603的功能实现可以参考上述方法实施例中步骤402的相关描述。The functional implementation of the sending module 602 may also refer to the relevant description of step 401 in the above method embodiment. For the functional implementation of the receiving module 603, please refer to the relevant description of step 402 in the above method embodiment.
可选地,对于该请求节点为与该户端设备连接的路由节点的场景,该生成模块601,可以用于接收客户端设备发送的初始请求报文,该初始请求报文包括目标算力服务的标识;在该初始请求报文中封装与该目标算力服务的标识对应的多个段标识,得到算力请求报文。该生成模块601的功能实现还可以参考上述方法实施例中步骤303的相关描述。Optionally, for the scenario where the requesting node is a routing node connected to the client device, the generating module 601 may be used to receive an initial request message sent by the client device, where the initial request message includes the target computing service. The identifier; encapsulate multiple segment identifiers corresponding to the identifier of the target computing power service in the initial request message to obtain the computing power request message. For the functional implementation of the generation module 601, you may also refer to the relevant description of step 303 in the above method embodiment.
可选地,该接收模块603,可以用于接收其他路由节点通告的与该目标算力服务的标识对应的多个段标识。Optionally, the receiving module 603 may be configured to receive multiple segment identifiers corresponding to the identifier of the target computing service advertised by other routing nodes.
可选地,该生成模块601,还可以用于将该路由节点与该多个段标识指示的算力服务实例所对应的实例节点之间的网络性能参数封装至算力请求报文。Optionally, the generation module 601 may also be configured to encapsulate network performance parameters between the routing node and the instance node corresponding to the computing power service instance indicated by the multiple segment identifiers into the computing power request message.
可选地,该算力请求报文包括SRH,该SRH中的扩展TLV字段包括该网络性能参数。Optionally, the computing power request message includes an SRH, and the extended TLV field in the SRH includes the network performance parameter.
可选地,该生成模块601,还可以用于从与该目标算力服务的标识对应的多个备选段标识中筛选出满足封装条件的多个段标识。其中,该封装条件包括:路由节点与段标识指示的算力服务实例所对应的实例节点之间的网络性能满足通信条件。Optionally, the generation module 601 may also be used to filter out multiple segment identifiers that meet the encapsulation conditions from multiple candidate segment identifiers corresponding to the identifier of the target computing power service. The encapsulation conditions include: the network performance between the routing node and the instance node corresponding to the computing power service instance indicated by the segment identifier meets the communication conditions.
可选地,该多个段标识按照网络性能由高到低的顺序依次排列。Optionally, the multiple segment identifiers are arranged in order from high to low network performance.
可选地,该接收模块603,还可以用于接收目标节点发送的算力响应报文,该算力响应报文中包括该目标节点对应的目标算力服务实例的目标段标识,该目标段标识属于该多个段标识。该接收模块603的功能实现还可以参考上述方法实施例中步骤308和步骤408的相关描述。Optionally, the receiving module 603 can also be used to receive a computing power response message sent by the target node. The computing power response message includes the target segment identifier of the target computing power service instance corresponding to the target node. The target segment The identifier belongs to the multiple segment identifiers. For the functional implementation of the receiving module 603, reference may also be made to the relevant descriptions of steps 308 and 408 in the above method embodiment.
该发送模块602,还可以用于向该目标节点发送算力任务报文,该算力任务报文的目的地址字段包括该目标段标识。该发送模块602的功能实现还可以参考上述方法实施例中步骤308和步骤408的相关描述。The sending module 602 may also be used to send a computing power task message to the target node, where the destination address field of the computing power task message includes the target segment identifier. For the functional implementation of the sending module 602, reference may also be made to the relevant descriptions of steps 308 and 408 in the above method embodiment.
可选地,该算力响应报文的源地址字段或扩展报文头中包括该目标段标识。Optionally, the source address field or extension header of the computing power response message includes the target segment identifier.
可选地,该目标算力服务的标识为该多个段标识的公共前缀。该发送模块602,还可以用于基于该目标段标识的前缀与目标算力服务的标识匹配,记录该目标算力服务与目标段标识的对应关系。Optionally, the identifier of the target computing power service is a common prefix of the multiple segment identifiers. The sending module 602 may also be configured to record the corresponding relationship between the target computing service and the target segment identifier based on the match between the prefix of the target segment identifier and the identifier of the target computing service.
可选地,该段列表中还可以包括该目标算力服务的标识。Optionally, the segment list may also include the identification of the target computing power service.
综上所述,本申请实施例提供了一种请求节点,该请求节点生成的算力请求报文中包括段列表,并可以根据该段列表转发该算力请求报文。由于该段列表中的每个段标识指示一个用于提供目标算力服务的算力服务实例,因此该算力请求报文能够沿段列表指示的转发路径依次转发至各个算力服务实例对应的实例节点。进而,可以由接收到算力请求报文的实例节点基于其对应的算力服务实例是否满足服务条件,并确定是否由该算力服务实例向客户端设备提供该目标算力服务。由此,无需再由算力网络中的路由节点集中感知大量算力服务实例的算力负载,并选择匹配的算力服务实例,从而有效降低了对路由节点的性能要求,提高了提供算力服务时的效率。To sum up, embodiments of the present application provide a requesting node. The computing power request message generated by the requesting node includes a segment list, and the computing power request message can be forwarded according to the segment list. Since each segment identifier in the segment list indicates a computing power service instance used to provide the target computing power service, the computing power request message can be forwarded to the corresponding computing power service instance along the forwarding path indicated by the segment list. Instance node. Furthermore, the instance node that receives the computing power request message can determine whether the computing power service instance provides the target computing power service to the client device based on whether its corresponding computing power service instance meets the service conditions. As a result, routing nodes in the computing network no longer need to centrally sense the computing load of a large number of computing service instances and select matching computing service instances, thereby effectively reducing the performance requirements for routing nodes and improving the provision of computing power. Efficiency in service.
所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,上述描述的第一节点、请求节点及各模块的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。Those skilled in the art can clearly understand that for the convenience and simplicity of description, the specific working processes of the first node, the request node and each module described above can be referred to the corresponding processes in the foregoing method embodiments, and will not be repeated here. Repeat.
应理解的是,本申请实施例提供的第一节点和请求节点还可以用特定应用集成电路(application-specific integrated circuit,ASIC)实现,或可编程逻辑器件(programmable logic device,PLD)实现,上述PLD可以是复杂程序逻辑器件(complexprogrammable logical device,CPLD),现场可编程门阵列(field-programmable gatearray,FPGA),通用阵列逻辑(generic array logic,GAL)或其任意组合。此外,也可以通过软件实现上述方法实施例提供的算力请求方法,当通过软件实现上述方法实施例提供的算力请求方法时,该第一节点和请求节点中的各个功能模块也可以为软件模块。It should be understood that the first node and the requesting node provided by the embodiment of the present application can also be implemented by an application-specific integrated circuit (ASIC) or a programmable logic device (PLD), as mentioned above. PLD can be a complex programmable logical device (CPLD), a field-programmable gate array (FPGA), a general array logic (GAL), or any combination thereof. In addition, the computing power requesting method provided by the above method embodiment can also be implemented through software. When the computing power requesting method provided by the above method embodiment is implemented through software, each functional module in the first node and the requesting node can also be software. module.
图15是本申请实施例提供的一种算力请求设备的结构示意图。该算力请求设备可以应用于诸如图1所示的算力网络,且可以为该网络中的第一节点或请求节点。也即是,该算力请求设备可以是客户端设备、路由节点或服务节点。参考图15,该算力请求设备包括:处理器701、存储器702、网络接口703和总线704。Figure 15 is a schematic structural diagram of a computing power request device provided by an embodiment of the present application. The computing power requesting device can be applied to a computing power network such as that shown in Figure 1, and can be the first node or requesting node in the network. That is, the computing power requesting device may be a client device, a routing node or a service node. Referring to Figure 15, the computing power requesting device includes: a processor 701, a memory 702, a network interface 703 and a bus 704.
其中,存储器702中存储有计算机程序7021,计算机程序7021用于实现各种应用功能。处理器701用于执行该计算机程序7021以实现上述方法实施例提供的应用于第一节点、第二节点或第三节点的方法。例如,处理器701可以用于执行该计算机程序7021以实现上述图13或图14所示节点中各个模块的功能。Among them, the computer program 7021 is stored in the memory 702, and the computer program 7021 is used to implement various application functions. The processor 701 is configured to execute the computer program 7021 to implement the method provided by the above method embodiment and applied to the first node, the second node or the third node. For example, the processor 701 can be used to execute the computer program 7021 to implement the functions of each module in the node shown in Figure 13 or Figure 14.
处理器701可以是中央处理器(central processing unit,CPU),该处理器701还可以是其他通用处理器、数字信号处理器(digital signal processor,DSP)、ASIC、FPGA、图形处理器(graphics processing unit,GPU)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。通用处理器可以是微处理器或者是任何常规的处理器。The processor 701 may be a central processing unit (CPU). The processor 701 may also be other general-purpose processors, digital signal processors (DSP), ASICs, FPGAs, or graphics processing units. unit, GPU) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc. A general purpose processor can be a microprocessor or any conventional processor.
存储器702可以是易失性存储器或非易失性存储器,或可包括易失性和非易失性存储器两者。其中,非易失性存储器可以是只读存储器(read-only memory,ROM)、可编程只读存储器(programmable ROM,PROM)、可擦除可编程只读存储器(erasable PROM,EPROM)、电可擦除可编程只读存储器(electrically EPROM,EEPROM)或闪存。易失性存储器可以是随机存取存储器(random access memory,RAM),其用作外部高速缓存。通过示例性但不是限制性说明,许多形式的RAM可用,例如静态随机存取存储器(static RAM,SRAM)、动态随机存取存储器(dynamic RAM,DRAM)、同步动态随机存取存储器(synchronous DRAM,SDRAM)、双倍数据速率同步动态随机存取存储器(double data date SDRAM,DDR SDRAM)、增强型同步动态随机存取存储器(enhanced SDRAM,ESDRAM)、同步连接动态随机存取存储器(synchlink DRAM,SLDRAM)和直接内存总线随机存取存储器(direct rambus RAM,DRRAM)。Memory 702 may be volatile memory or non-volatile memory, or may include both volatile and non-volatile memory. Among them, the non-volatile memory can be read-only memory (ROM), programmable ROM (PROM), erasable programmable read-only memory (erasable PROM, EPROM), electrically removable memory. Erase programmable read-only memory (electrically EPROM, EEPROM) or flash memory. Volatile memory may be random access memory (RAM), which is used as an external cache. By way of illustration, but not limitation, many forms of RAM are available, such as static random access memory (SRAM), dynamic random access memory (DRAM), synchronous dynamic random access memory (synchronous DRAM, SDRAM), double data rate synchronous dynamic random access memory (double data date SDRAM, DDR SDRAM), enhanced synchronous dynamic random access memory (enhanced SDRAM, ESDRAM), synchronous link dynamic random access memory (synchlink DRAM, SLDRAM) ) and direct memory bus random access memory (direct rambus RAM, DRRAM).
网络接口703可以为多个,且网络接口703用于实现与其他设备之间的通信连接(可以是有线或者无线)。其中,在本申请实施例中,网络接口703用于收发报文。其中,其他设备可以是终端、服务器、VM等设备或其它网络设备。There may be multiple network interfaces 703 , and the network interfaces 703 are used to implement communication connections with other devices (which may be wired or wireless). Among them, in this embodiment of the present application, the network interface 703 is used to send and receive messages. Among them, other devices may be terminals, servers, VMs and other devices or other network devices.
总线704用于连接处理器701、存储器702和网络接口703。并且,总线704除包括数据总线之外,还可以包括电源总线、控制总线和状态信号总线等。但是为了清楚说明起见,在图中将各种总线都标为总线704。The bus 704 is used to connect the processor 701, the memory 702 and the network interface 703. Furthermore, in addition to the data bus, the bus 704 may also include a power bus, a control bus, a status signal bus, etc. However, for the sake of clarity, the various buses are labeled bus 704 in the figure.
若该算力请求设备为第一节点,则该第一节点对应第一算力服务实例,该第一算力服务实例用于提供目标算力服务。该处理器701用于通过网络接口703接收来自客户端设备的算力请求报文,该算力请求报文用于请求目标算力服务,以及用于基于该第一算力服务实例是否满足服务条件,确定是否由该第一算力服务实例向客户端设备所述目标算力服务。该处理器701的详细处理过程请参考上述图6、图8和图10所示方法实施例中由第一节点(例如第二路由节点或第一服务节点)执行的步骤,这里不再赘述。If the computing power requesting device is a first node, the first node corresponds to a first computing power service instance, and the first computing power service instance is used to provide the target computing power service. The processor 701 is configured to receive a computing power request message from the client device through the network interface 703. The computing power request message is used to request a target computing power service, and is used to determine whether the first computing power service instance satisfies the service requirement. condition to determine whether the first computing power service instance provides the target computing power service to the client device. For the detailed processing process of the processor 701, please refer to the steps performed by the first node (for example, the second routing node or the first service node) in the method embodiments shown in FIG. 6, FIG. 8, and FIG. 10, which will not be described again here.
若该算力请求设备为请求节点,则该处理器701用于生成算力请求报文,该算力请求报文的段列表中包括多个段标识,该多个段标识分别指示多个算力服务实例,该多个算力服务实例中的每个算力服务实例用于提供目标算力服务。该处理器701还用于根据该段列表,并通过网络接口703转发该算力请求报文,以使该多个算力服务实例中的满足服务条件的至少一个算力服务实例向客户端设备提供目标算力服务。该处理器701的详细处理过程请参考上述图7、图8和图10所示方法实施例中由请求节点(例如客户端设备或第一路由节点)执行的步骤,这里不再赘述。If the computing power requesting device is a requesting node, the processor 701 is configured to generate a computing power request message. The segment list of the computing power request message includes multiple segment identifiers, and the multiple segment identifiers respectively indicate multiple computing power requests. A computing power service instance, each of the multiple computing power service instances is used to provide a target computing power service. The processor 701 is also configured to forward the computing power request message through the network interface 703 according to the segment list, so that at least one computing power service instance among the multiple computing power service instances that meets the service conditions is forwarded to the client device. Provide target computing power services. For the detailed processing process of the processor 701, please refer to the steps performed by the requesting node (such as the client device or the first routing node) in the method embodiments shown in FIG. 7, FIG. 8 and FIG. 10, which will not be described again here.
图16是本申请实施例提供的另一种算力请求设备的结构示意图。该算力请求设备可以应用于诸如图1所示的算力网络,且可以为该网络中的第一节点或请求节点。也即是,该算力请求设备可以是客户端设备、路由节点或服务节点。如图16所示,该算力请求设备可以包括:主控板801和至少一个接口板(接口板也称为线卡或业务板),例如图16中示出了接口板802和接口板803。多个接口板的情况下算力请求设备还可以包括交换网板804,该交换网板804用于完成各接口板之间的数据交换。Figure 16 is a schematic structural diagram of another computing power requesting device provided by an embodiment of the present application. The computing power requesting device can be applied to a computing power network such as that shown in Figure 1, and can be the first node or requesting node in the network. That is, the computing power requesting device may be a client device, a routing node or a service node. As shown in Figure 16, the computing power requesting device may include: a main control board 801 and at least one interface board (the interface board is also called a line card or a service board). For example, Figure 16 shows the interface board 802 and the interface board 803. . In the case of multiple interface boards, the computing power requesting device may also include a switching network board 804, which is used to complete data exchange between the interface boards.
其中,主控板801也称为主处理单元(main processing unit,MPU)或路由处理卡(route processor card),主控板801用于完成系统管理、设备维护和协议处理等功能。主控板801上主要有3类功能单元:系统管理控制单元、系统时钟单元和系统维护单元。主控板801包括:中央处理器8011和存储器8012。The main control board 801 is also called a main processing unit (MPU) or a route processor card. The main control board 801 is used to complete functions such as system management, device maintenance, and protocol processing. There are three main types of functional units on the main control board 801: system management control unit, system clock unit and system maintenance unit. The main control board 801 includes: a central processing unit 8011 and a memory 8012.
接口板802和803也称为线路接口单元卡(line processing unit,LPU)、线卡(line card)或业务板,接口板用于提供各种业务接口,并实现报文的转发。其中,接口板所提供的业务接口可以包括:基于SONET/SDH的数据包(packet over SONET/SDH,POS)接口、千兆以太网(gigabit Ethernet,GE)接口和异步传输模式(asynchronous transfer mode,ATM)接口等。其中,SONET是指同步光纤网络(synchronous optical network),SDH是指同步数字体系(synchronous digital hierarchy)。主控板801、接口板802以及接口板803之间通过系统总线与系统背板相连以实现互通。如图16所示,接口板802上包括一个或多个中央处理器8021。中央处理器8021用于对接口板802进行控制管理并与主控板801上的中央处理器8011进行通信。接口板802上的存储器8024用于存储转发表项,网络处理器8022可以通过查找存储器8024中存储的转发表项进行报文的转发。存储器8024还可以用于存储程序代码。The interface boards 802 and 803 are also called line processing unit (LPU), line card (line card) or service boards. The interface boards are used to provide various service interfaces and implement packet forwarding. Among them, the service interfaces provided by the interface board may include: SONET/SDH-based packet (packet over SONET/SDH, POS) interface, Gigabit Ethernet (gigabit Ethernet, GE) interface and asynchronous transfer mode (asynchronous transfer mode), ATM) interface, etc. Among them, SONET refers to synchronous optical network (synchronous optical network), and SDH refers to synchronous digital hierarchy (synchronous digital hierarchy). The main control board 801, the interface board 802 and the interface board 803 are connected to the system backplane through a system bus to achieve intercommunication. As shown in Figure 16, the interface board 802 includes one or more central processors 8021. The central processor 8021 is used to control and manage the interface board 802 and communicate with the central processor 8011 on the main control board 801 . The memory 8024 on the interface board 802 is used to store forwarding entries, and the network processor 8022 can forward messages by searching for the forwarding entries stored in the memory 8024. Memory 8024 may also be used to store program code.
该接口板802还包括一个或多个物理接口卡8023,该一个或多个物理接口卡8023用于接收上一跳节点发送的报文,并根据中央处理器8021的指示向下一跳节点发送处理后的报文。The interface board 802 also includes one or more physical interface cards 8023. The one or more physical interface cards 8023 are used to receive messages sent by the previous hop node and send them to the next hop node according to the instructions of the central processor 8021. processed message.
此外,可以理解的是,图16中的接口板802中的中央处理器8021和/或网络处理器8022可以是专用硬件或芯片,如可以采用ASIC来实现上述功能,这种实现方式即为通常所说的转发面采用专用硬件或芯片处理的方式。在另外的实施方式中,所述中央处理器8021和/或网络处理器8022也可以采用通用的处理器,如通用的CPU来实现以上描述的功能。In addition, it can be understood that the central processor 8021 and/or the network processor 8022 in the interface board 802 in Figure 16 can be dedicated hardware or chips. For example, an ASIC can be used to implement the above functions. This implementation method is generally The forwarding plane is processed by dedicated hardware or chips. In other embodiments, the central processor 8021 and/or the network processor 8022 may also use a general-purpose processor, such as a general-purpose CPU, to implement the functions described above.
此外应理解的是,主控板801可能有一块或多块,有多块的时候可以包括主用主控板和备用主控板。接口板可能有一块或多块,该算力请求设备的数据处理能力越强,提供的接口板越多。如图16所示,算力请求设备包括接口板802和接口板803。当采用分布式的转发机制时,接口板803的结构与接口板802的结构基本相同,且接口板803上的操作与接口板802的操作基本相似,为了简洁,不再赘述。算力请求设备具有多块接口板的情况下,该多块接口板之间可以通过一块或多块交换网板804通信,且可以实现负荷分担和冗余备份,以提供大容量的数据交换和处理能力。In addition, it should be understood that there may be one or more main control boards 801, and when there are multiple main control boards, they may include a main main control board and a backup main control board. There may be one or more interface boards. The stronger the data processing capability of the device required by the computing power, the more interface boards are provided. As shown in Figure 16, the computing power requesting device includes an interface board 802 and an interface board 803. When a distributed forwarding mechanism is adopted, the structure of the interface board 803 is basically the same as that of the interface board 802, and the operations on the interface board 803 are basically similar to the operations on the interface board 802. For the sake of simplicity, they will not be described again. When the computing power requesting device has multiple interface boards, the multiple interface boards can communicate with each other through one or more switching network boards 804, and can realize load sharing and redundant backup to provide large-capacity data exchange and processing power.
在集中式转发架构下,该算力请求设备可以不需要交换网板804,接口板承担整个系统的业务数据的处理功能。所以,分布式架构的算力请求设备的数据接入和处理能力要大于集中式架构的算力请求设备。具体采用哪种架构,取决于具体的组网部署场景,此处不做限定。Under the centralized forwarding architecture, the computing power requesting device does not need the switching network board 804, and the interface board is responsible for processing the business data of the entire system. Therefore, the data access and processing capabilities of computing power requesting equipment in a distributed architecture are greater than those in a centralized architecture. The specific architecture used depends on the specific networking deployment scenario and is not limited here.
在本申请实施例中,存储器8012和存储器8024可以是ROM或可存储静态信息和指令的其它类型的静态存储设备,也可以是RAM或者可存储信息和指令的其它类型的动态存储设备,还可以是EEPROM、只读光盘(compact disc read-only Memory,CD-ROM)或其它光盘存储、光碟存储(包括压缩光碟、激光碟、光碟、数字通用光碟、蓝光光碟等)、磁盘或者其它磁存储设备、或者能够用于携带或存储具有指令或数据结构形式的期望的程序代码并能够由计算机存取的任何其它介质,但不限于此。接口板802中的存储器8024可以是独立存在,并通过通信总线与中央处理器8021相连接;或者,存储器8024也可以和中央处理器8021集成在一起。主控板801中的存储器8012可以是独立存在,并通过通信总线与中央处理器8011相连接;或者,存储器8012也可以和中央处理器8011集成在一起。In this embodiment of the present application, the memory 8012 and the memory 8024 can be ROM or other types of static storage devices that can store static information and instructions, or RAM or other types of dynamic storage devices that can store information and instructions. They can also be It is EEPROM, compact disc read-only memory (CD-ROM) or other optical disc storage, optical disc storage (including compressed optical disc, laser disc, optical disc, digital versatile disc, Blu-ray disc, etc.), magnetic disk or other magnetic storage device , or any other medium that can be used to carry or store desired program code in the form of instructions or data structures and that can be accessed by a computer, but is not limited thereto. The memory 8024 in the interface board 802 can exist independently and be connected to the central processor 8021 through a communication bus; or the memory 8024 can also be integrated with the central processor 8021. The memory 8012 in the main control board 801 can exist independently and be connected to the central processor 8011 through a communication bus; or the memory 8012 can also be integrated with the central processor 8011.
存储器8024中存储的程序代码,由中央处理器8021来控制执行,存储器8012存储的程序代码,由中央处理器8011来控制执行。该中央处理器8021和/或中央处理器8011可以通过执行程序代码来实现上述方法实施例所提供的由任一节点执行的方法。存储器8024和/或存储器8012存储的程序代码中可以包括一个或多个软件单元。这一个或多个软件单元可以为图13至图15中任一附图所示的功能模块。The program code stored in the memory 8024 is controlled and executed by the central processor 8021, and the program code stored in the memory 8012 is controlled and executed by the central processor 8011. The central processor 8021 and/or the central processor 8011 can implement the method executed by any node provided by the above method embodiments by executing program codes. One or more software units may be included in the program code stored in memory 8024 and/or memory 8012. This one or more software units may be functional modules shown in any of the figures in Figures 13 to 15.
在本申请实施例中,该物理接口卡8023,可以是使用任何收发器一类的装置,用于与其它设备或通信网络通信,如以太网,无线接入网(radio access network,RAN),无线局域网(wireless local area networks,WLAN)等。In this embodiment of the present application, the physical interface card 8023 can be a device using any transceiver, used to communicate with other devices or communication networks, such as Ethernet, radio access network (radio access network, RAN), Wireless local area networks (WLAN), etc.
可选地,图13至图15中任一附图所示的设备也可以采用图16所示的结构实现。Optionally, the device shown in any one of Figures 13 to 15 can also be implemented using the structure shown in Figure 16 .
本申请实施例还提供了一种计算机可读存储介质,该计算机可读存储介质中存储有指令,当该指令在处理器运行时,使得处理器执行如上述方法实施例中由第一节点或请求节点执行的步骤。Embodiments of the present application also provide a computer-readable storage medium. The computer-readable storage medium stores instructions. When the instructions are run on the processor, the processor executes the steps performed by the first node or the processor in the above method embodiment. The steps to be performed by the requesting node.
本申请实施例还提供了一种包含指令的计算机程序产品,当该指令在处理器上运行时,使得处理器执行如上述方法实施例中由第一节点或请求节点执行的步骤。Embodiments of the present application also provide a computer program product containing instructions. When the instructions are run on a processor, they cause the processor to perform the steps performed by the first node or the requesting node in the above method embodiments.
本申请实施例还提供了一种算力请求系统,该算力请求系统包括多个实例节点,该多个实例节点中的至少一个实例节点可以用于实现上述方法实施例中由第一节点执行的方法。例如,参考图1,该算力请求系统中的路由节点01或者服务节点02可以用于实现上述方法实施例中由第一节点执行的方法。Embodiments of the present application also provide a computing power request system. The computing power request system includes multiple instance nodes. At least one instance node among the multiple instance nodes can be used to implement the method executed by the first node in the above method embodiment. Methods. For example, referring to Figure 1, the routing node 01 or the service node 02 in the computing power request system can be used to implement the method performed by the first node in the above method embodiment.
可选地,该多个实例节点中的每个实例节点可以为其所对应的算力服务实例所属的服务节点;或者,该多个节点中的每个实例节点为与其所对应的算力服务实例所属的服务节点直连的路由节点。Optionally, each instance node among the plurality of instance nodes may be a service node to which its corresponding computing power service instance belongs; or, each instance node among the plurality of nodes may be a service node to which its corresponding computing power service instance belongs. The routing node directly connected to the service node to which the instance belongs.
可选地,该系统还可以包括:请求节点,该请求节点可以用于实现上述方法实施例中由请求节点执行的方法。Optionally, the system may also include: a request node, which may be used to implement the method performed by the request node in the above method embodiment.
可选地,参考图1,该请求节点可以为客户端设备03,或者,该请求节点可以为与客户端设备03连接的路由节点。Optionally, referring to FIG. 1 , the requesting node may be the client device 03 , or the requesting node may be a routing node connected to the client device 03 .
其中,该实例节点的结构可以如图13、图15或图16所示,该请求节点的结构可以如图14、图15或图16所示。The structure of the instance node may be as shown in Figure 13, Figure 15 or Figure 16, and the structure of the request node may be as shown in Figure 14, Figure 15 or Figure 16.
本领域普通技术人员可以理解实现上述实施例的全部或部分步骤可以通过硬件来完成,也可以通过程序来指令相关的硬件完成,所述的程序可以存储于一种计算机可读存储介质中,上述提到的存储介质可以是只读存储器,磁盘或光盘等。Those of ordinary skill in the art can understand that all or part of the steps to implement the above embodiments can be completed by hardware, or can be completed by instructing relevant hardware through a program. The program can be stored in a computer-readable storage medium. The above-mentioned The storage media mentioned can be read-only memory, magnetic disks or optical disks, etc.
在本申请实施例中,术语“第一”、“第二”和“第三”仅用于描述目的,而不能理解为指示或暗示相对重要性。本申请中术语“和/或”,仅仅是一种描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B这三种情况。另外,本文中字符“/”,一般表示前后关联对象是一种“或”的关系。In the embodiments of the present application, the terms "first", "second" and "third" are only used for description purposes and cannot be understood as indicating or implying relative importance. The term "and/or" in this application is just an association relationship describing related objects, indicating that there can be three relationships, for example, A and/or B, which can mean: A exists alone, A and B exist simultaneously, alone There are three situations B. In addition, the character "/" in this article generally indicates that the related objects are an "or" relationship.
以上所述仅为本申请的可选实施例,并不用以限制本申请,凡在本申请的构思和原则之内,所作的任何修改、等同替换、改进等,均应包含在本申请的保护范围之内。The above are only optional embodiments of the present application and are not intended to limit the present application. Any modifications, equivalent substitutions, improvements, etc. made within the concepts and principles of the present application shall be included in the protection of the present application. within the range.
Claims (39)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202211004588 | 2022-08-22 | ||
| CN2022110045889 | 2022-08-22 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN117615017A true CN117615017A (en) | 2024-02-27 |
Family
ID=89944773
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202211195527.5A Pending CN117615017A (en) | 2022-08-22 | 2022-09-28 | Calculation force request method, device and system |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN117615017A (en) |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN119583427A (en) * | 2024-11-22 | 2025-03-07 | 烽火通信科技股份有限公司 | A service routing table management method and device |
| CN119892646A (en) * | 2024-11-28 | 2025-04-25 | 嵩山实验室 | Multimodal computing network system |
-
2022
- 2022-09-28 CN CN202211195527.5A patent/CN117615017A/en active Pending
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN119583427A (en) * | 2024-11-22 | 2025-03-07 | 烽火通信科技股份有限公司 | A service routing table management method and device |
| CN119892646A (en) * | 2024-11-28 | 2025-04-25 | 嵩山实验室 | Multimodal computing network system |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US11283707B2 (en) | Segment routing with fast reroute for container networking | |
| US12368669B2 (en) | Packet sending method, device, and system | |
| US11979322B2 (en) | Method and apparatus for providing service for traffic flow | |
| US10757231B2 (en) | Providing network efficiencies in forwarding packets among provider networks and applying segment routing policies | |
| US10333836B2 (en) | Convergence for EVPN multi-homed networks | |
| CN113207192B (en) | Message forwarding method and device | |
| US9294396B2 (en) | Port extender | |
| US11671483B2 (en) | In-band protocol-based in-network computation offload framework | |
| CN113132229B (en) | Segment identifier determination method and device | |
| US20220278925A1 (en) | Method for Delivering OAM Configuration Information and Control Node | |
| US11863454B2 (en) | Systems and methods for scalable validation of multiple paths in a network using segment routing | |
| US12040966B2 (en) | Path switching method, device, and system | |
| WO2022001835A1 (en) | Method and apparatus for sending message, and network device, system and storage medium | |
| CN108718269B (en) | Message processing method and device | |
| US12224939B2 (en) | Service packet forwarding method, apparatus, and computer storage medium | |
| CN117615017A (en) | Calculation force request method, device and system | |
| CN102946350B (en) | A kind of data transmission method based on priority and equipment | |
| CN117914820A (en) | Computing power routing method, device and system | |
| CN115242698A (en) | Message forwarding method, device, network equipment and storage medium | |
| WO2023005745A1 (en) | Message forwarding method, device and system, and computer-readable storage medium | |
| CN115473765A (en) | Message transmission method, device and system | |
| Nguyen et al. | Rpl border router redundancy in the internet of things | |
| WO2024001701A1 (en) | Data processing method, apparatus and system | |
| JP7273130B2 (en) | Communication method and device | |
| CN117955908B (en) | NDN network slicing method, device, equipment and medium based on SDN controller |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination |