CN113904912B - Method and device for realizing high availability of service of cloud management platform - Google Patents
Method and device for realizing high availability of service of cloud management platform Download PDFInfo
- Publication number
- CN113904912B CN113904912B CN202111487047.1A CN202111487047A CN113904912B CN 113904912 B CN113904912 B CN 113904912B CN 202111487047 A CN202111487047 A CN 202111487047A CN 113904912 B CN113904912 B CN 113904912B
- Authority
- CN
- China
- Prior art keywords
- cloud management
- management platform
- feature
- determining
- item
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/06—Management of faults, events, alarms or notifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/3003—Monitoring arrangements specially adapted to the computing system or computing system component being monitored
- G06F11/3006—Monitoring arrangements specially adapted to the computing system or computing system component being monitored where the computing system is distributed, e.g. networked systems, clusters, multiprocessor systems
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/3051—Monitoring arrangements for monitoring the configuration of the computing system or of the computing system component, e.g. monitoring the presence of processing resources, peripherals, I/O links, software programs
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/3089—Monitoring arrangements determined by the means or processing involved in sensing the monitored data, e.g. interfaces, connectors, sensors, probes, agents
- G06F11/3093—Configuration details thereof, e.g. installation, enabling, spatial arrangement of the probes
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/06—Management of faults, events, alarms or notifications
- H04L41/0654—Management of faults, events, alarms or notifications using network fault recovery
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L43/00—Arrangements for monitoring or testing data switching networks
- H04L43/08—Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computing Systems (AREA)
- Quality & Reliability (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Mathematical Physics (AREA)
- Environmental & Geological Engineering (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention provides a method and a device for realizing high availability of a service of a cloud management platform, wherein the method comprises the following steps: step 1: constructing a coping strategy library; step 2: monitoring the service state of the first cloud management platform to obtain monitoring data; and step 3: determining a first abnormal item based on the monitoring data; and 4, step 4: and determining a first coping strategy corresponding to the first abnormal item based on the coping strategy library, and executing correspondingly. According to the method and the device for realizing high availability of the service of the cloud management platform, the coping strategy is determined based on the constructed coping strategy library, the system can automatically cope with the abnormity and perform corresponding processing, the problem that the cloud management platform cannot be used by a user due to abnormity caused by insufficient experience of maintenance personnel of the cloud management platform is solved, and the high availability of the service provided by the cloud management platform is realized.
Description
Technical Field
The invention relates to the technical field of cloud management platforms, in particular to a method and a device for realizing high availability of services of a cloud management platform.
Background
Currently, various services provided by a cloud management platform [ for example: centralized remote monitoring, unified data storage and the like ] bring convenience to users, but some cloud management platforms are easy to be abnormal due to reasons such as insufficient experience of maintenance personnel of the cloud management platforms (for example: downtime, node hanging and the like, which causes that users cannot use the cloud management platform, and the consequence is serious, so that the usability of the cloud management platform service is low;
therefore, a solution is needed.
Disclosure of Invention
One of the purposes of the invention is to provide a method and a device for realizing high availability of services of a cloud management platform, wherein a coping strategy is determined based on a constructed coping strategy library, so that automatic exception coping of a system can be realized, corresponding processing is carried out, the problem that a user cannot use the cloud management platform due to exception of the cloud management platform caused by insufficient experience of maintenance personnel of the cloud management platform is avoided, and high availability of services provided by the cloud management platform is realized.
The method for realizing high availability of the service of the cloud management platform provided by the embodiment of the invention comprises the following steps:
step 1: constructing a coping strategy library;
step 2: monitoring the service state of the first cloud management platform to obtain monitoring data;
and step 3: determining a first abnormal item based on the monitoring data;
and 4, step 4: and determining a first coping strategy corresponding to the first abnormal item based on the coping strategy library, and executing the coping strategy correspondingly.
Preferably, step 1: constructing a coping strategy library, comprising:
acquiring first attribute information of the first cloud management platform, wherein the first attribute information comprises: a plurality of first attribute items;
obtaining the value degree of the first attribute item;
if the value degree is larger than or equal to a preset value degree threshold value, taking the corresponding first attribute item as a second attribute item, and simultaneously taking the rest first attribute items as third attribute items;
acquiring second attribute information of a plurality of second cloud management platforms, wherein the second attribute information comprises: a plurality of fourth attribute items;
matching the second attribute item with the corresponding fourth attribute item to obtain a first matching degree;
summarizing the first matching degree to obtain a first matching degree sum;
if the first matching degree sum is larger than or equal to a preset matching degree sum threshold, taking the corresponding second cloud management platform as a screening target;
acquiring exception handling information of the screening target, wherein the exception handling information comprises: a plurality of one-to-one corresponding second abnormal items, second coping strategies and first effect values;
screening out the second coping strategies corresponding to the same second abnormal item and taking the second coping strategies as third coping strategies;
determining the first effect value of the third coping strategy as a second effect value;
determining the first matching degree sum corresponding to the screening target and corresponding to the third coping strategy, and taking the first matching degree sum as a second matching degree sum;
inquiring a preset effect value-matching degree and-sorting score table, and determining the second effect value, the second matching degree and a commonly corresponding sorting score;
sorting the third coping strategies based on the sorting branch to obtain strategy sequences;
selecting the first n third coping strategies in the strategy sequence and taking the third coping strategies as fourth coping strategies;
combining the fourth coping strategy with the second abnormal item to obtain a first coping group;
acquiring a preset blank database, and storing the first control group into the blank database;
when the first contrast groups which need to be stored in the blank database are all stored, the blank database is used as a coping strategy library;
wherein n is a positive integer.
Preferably, the obtaining of the second attribute information of the plurality of second cloud management platforms includes:
acquiring a cooperative relationship of the second cloud management platform relative to the first cloud management platform, wherein the cooperative relationship comprises: active cooperation and passive cooperation;
when the cooperation relationship is active cooperation, acquiring a first cooperation circle corresponding to the second cloud management platform;
determining a first position of a first circle center of the first cooperation circle, determining a second position of the first cloud management platform in the first cooperation circle, and simultaneously determining a first distance between the first position and the second position;
based on the first circle center, the first distance is as long as a first radius, and a second cooperation circle is formed in the first cooperation circle;
determining a third location of a plurality of third cloud management platforms within the second collaboration circle;
determining a second distance between the first location and the third location;
obtaining a first length of a second radius of the first cooperative turn;
calculating a first ratio between the second distance and the first length to obtain a first ratio;
acquiring a third cooperation circle of the third cloud management platform;
if the second cloud management platform is in the third cooperation circle, determining a fourth position of the second cloud management platform in the third cooperation circle;
determining a fifth position of a second circle center of the third cooperative circle, and simultaneously determining a third distance between the fourth position and the fifth position;
obtaining a second length of a third radius of the third composite circle;
calculating a second ratio between the third distance and the second length to obtain a second ratio;
if the second occupation ratio is smaller than or equal to the first occupation ratio, taking the corresponding third cloud management platform as a fourth cloud management platform;
acquiring negative evaluation information of the fourth cloud management platform on the second cloud management platform, wherein the negative evaluation information comprises: a plurality of one-to-one negation terms and negation values;
inquiring a preset negative item-negative value-negative score table, and determining a negative score corresponding to the negative item and the negative value;
summarizing the negative scores to obtain a negative score sum corresponding to the second cloud management platform;
if the negative score sum is less than or equal to a preset negative score sum threshold value, second attribute information corresponding to the second cloud management platform is obtained, and if not, the second attribute information is not obtained;
if the cooperation relationship is passive cooperation, acquiring guarantee information of the second cloud management platform, wherein the guarantee information comprises: a plurality of one-to-one corresponding guarantors, guaranty types, and guaranty values;
inquiring a preset guarantor-guarantor type-guaranty value-guaranty rating table, and determining a guaranty rating corresponding to the guarantor, the guaranty type and the guaranty value;
summarizing the guarantee scores to obtain a sum of guarantee scores;
and if the guarantee score sum is greater than or equal to a preset guarantee score sum threshold, acquiring second attribute information corresponding to the second cloud management platform, otherwise, not acquiring.
Preferably, step 3: determining a first anomaly item based on the monitoring data, comprising:
extracting a plurality of newly generated first data items corresponding to the same monitoring type in the monitoring data;
acquiring the generation time of the first data item;
establishing a time axis on which to represent the corresponding first data item based on the generation time;
performing feature extraction on the first data item to obtain a first feature;
acquiring a preset trigger feature library, and matching the first feature with a second feature in the trigger feature library to acquire a second matching degree;
if the second matching degree is larger than or equal to a preset first threshold value, taking the corresponding second feature as a third feature, and simultaneously taking the corresponding first data item as a second data item;
inquiring a preset feature-selection direction table, and determining at least one selection direction corresponding to the third feature;
selecting a plurality of first data items preset in the selecting direction of the second data items on the time axis, and using the first data items as fourth data items;
performing feature extraction on the fourth data item to obtain a fourth feature;
randomly combining the first feature and the fourth feature to obtain a plurality of combined features;
querying a preset feature-abnormal combination feature table, and determining at least one first abnormal combination feature corresponding to the third feature;
matching the combined feature with the first abnormal combined feature to obtain a third matching degree;
if the third matching degree is greater than or equal to a preset second threshold value, taking the corresponding first abnormal combination feature as a second abnormal combination feature;
and querying a preset abnormal combination feature-abnormal item table, determining at least one first abnormal item corresponding to the second abnormal combination feature, and finishing the determination.
Preferably, step 4: based on the coping strategy library, determining a first coping strategy corresponding to the first abnormal item, and executing correspondingly, wherein the coping strategy library comprises the following steps:
the first abnormal item corresponds to the second abnormal item in the first control group, and the corresponding first control group is used as a second control group;
extracting the third coping strategies in the second control group as the first coping strategies;
inquiring a preset corresponding strategy-priority table, and determining the priority corresponding to the first corresponding strategy;
and sequentially executing the first corresponding strategies based on the priority.
The device for realizing high availability of the service of the cloud management platform provided by the embodiment of the invention comprises the following steps:
the construction module is used for constructing a coping strategy library;
the monitoring module is used for monitoring the service state of the first cloud management platform and acquiring monitoring data;
a determination module for determining a first abnormal item based on the monitoring data;
and the execution module is used for determining a first coping strategy corresponding to the first abnormal item based on the coping strategy library and executing corresponding execution.
Preferably, the building module performs the following operations:
acquiring first attribute information of the first cloud management platform, wherein the first attribute information comprises: a plurality of first attribute items;
obtaining the value degree of the first attribute item;
if the value degree is larger than or equal to a preset value degree threshold value, taking the corresponding first attribute item as a second attribute item, and simultaneously taking the rest first attribute items as third attribute items;
acquiring second attribute information of a plurality of second cloud management platforms, wherein the second attribute information comprises: a plurality of fourth attribute items;
matching the second attribute item with the corresponding fourth attribute item to obtain a first matching degree;
summarizing the first matching degree to obtain a first matching degree sum;
if the first matching degree sum is larger than or equal to a preset matching degree sum threshold, taking the corresponding second cloud management platform as a screening target;
acquiring exception handling information of the screening target, wherein the exception handling information comprises: a plurality of one-to-one corresponding second abnormal items, second coping strategies and first effect values;
screening out the second coping strategies corresponding to the same second abnormal item and taking the second coping strategies as third coping strategies;
determining the first effect value of the third coping strategy as a second effect value;
determining the first matching degree sum corresponding to the screening target and corresponding to the third coping strategy, and taking the first matching degree sum as a second matching degree sum;
inquiring a preset effect value-matching degree and-sorting score table, and determining the second effect value, the second matching degree and a commonly corresponding sorting score;
sorting the third coping strategies based on the sorting branch to obtain strategy sequences;
selecting the first n third coping strategies in the strategy sequence and taking the third coping strategies as fourth coping strategies;
combining the fourth coping strategy with the second abnormal item to obtain a first coping group;
acquiring a preset blank database, and storing the first control group into the blank database;
when the first contrast groups which need to be stored in the blank database are all stored, the blank database is used as a coping strategy library;
wherein n is a positive integer.
Preferably, the building module performs the following operations:
acquiring a cooperative relationship of the second cloud management platform relative to the first cloud management platform, wherein the cooperative relationship comprises: active cooperation and passive cooperation;
when the cooperation relationship is active cooperation, acquiring a first cooperation circle corresponding to the second cloud management platform;
determining a first position of a first circle center of the first cooperation circle, determining a second position of the first cloud management platform in the first cooperation circle, and simultaneously determining a first distance between the first position and the second position;
based on the first circle center, the first distance is as long as a first radius, and a second cooperation circle is formed in the first cooperation circle;
determining a third location of a plurality of third cloud management platforms within the second collaboration circle;
determining a second distance between the first location and the third location;
obtaining a first length of a second radius of the first cooperative turn;
calculating a first ratio between the second distance and the first length to obtain a first ratio;
acquiring a third cooperation circle of the third cloud management platform;
if the second cloud management platform is in the third cooperation circle, determining a fourth position of the second cloud management platform in the third cooperation circle;
determining a fifth position of a second circle center of the third cooperative circle, and simultaneously determining a third distance between the fourth position and the fifth position;
obtaining a second length of a third radius of the third composite circle;
calculating a second ratio between the third distance and the second length to obtain a second ratio;
if the second occupation ratio is smaller than or equal to the first occupation ratio, taking the corresponding third cloud management platform as a fourth cloud management platform;
acquiring negative evaluation information of the fourth cloud management platform on the second cloud management platform, wherein the negative evaluation information comprises: a plurality of one-to-one negation terms and negation values;
inquiring a preset negative item-negative value-negative score table, and determining a negative score corresponding to the negative item and the negative value;
summarizing the negative scores to obtain a negative score sum corresponding to the second cloud management platform;
if the negative score sum is less than or equal to a preset negative score sum threshold value, second attribute information corresponding to the second cloud management platform is obtained, and if not, the second attribute information is not obtained;
if the cooperation relationship is passive cooperation, acquiring guarantee information of the second cloud management platform, wherein the guarantee information comprises: a plurality of one-to-one corresponding guarantors, guaranty types, and guaranty values;
inquiring a preset guarantor-guarantor type-guaranty value-guaranty rating table, and determining a guaranty rating corresponding to the guarantor, the guaranty type and the guaranty value;
summarizing the guarantee scores to obtain a sum of guarantee scores;
and if the guarantee score sum is greater than or equal to a preset guarantee score sum threshold, acquiring second attribute information corresponding to the second cloud management platform, otherwise, not acquiring.
Preferably, the determining module performs the following operations:
extracting a plurality of newly generated first data items corresponding to the same monitoring type in the monitoring data;
acquiring the generation time of the first data item;
establishing a time axis on which to represent the corresponding first data item based on the generation time;
performing feature extraction on the first data item to obtain a first feature;
acquiring a preset trigger feature library, and matching the first feature with a second feature in the trigger feature library to acquire a second matching degree;
if the second matching degree is larger than or equal to a preset first threshold value, taking the corresponding second feature as a third feature, and simultaneously taking the corresponding first data item as a second data item;
inquiring a preset feature-selection direction table, and determining at least one selection direction corresponding to the third feature;
selecting a plurality of first data items preset in the selecting direction of the second data items on the time axis, and using the first data items as fourth data items;
performing feature extraction on the fourth data item to obtain a fourth feature;
randomly combining the first feature and the fourth feature to obtain a plurality of combined features;
querying a preset feature-abnormal combination feature table, and determining at least one first abnormal combination feature corresponding to the third feature;
matching the combined feature with the first abnormal combined feature to obtain a third matching degree;
if the third matching degree is greater than or equal to a preset second threshold value, taking the corresponding first abnormal combination feature as a second abnormal combination feature;
and querying a preset abnormal combination feature-abnormal item table, determining at least one first abnormal item corresponding to the second abnormal combination feature, and finishing the determination.
Preferably, the execution module performs the following operations:
the first abnormal item corresponds to the second abnormal item in the first control group, and the corresponding first control group is used as a second control group;
extracting the third coping strategies in the second control group as the first coping strategies;
inquiring a preset corresponding strategy-priority table, and determining the priority corresponding to the first corresponding strategy;
and sequentially executing the first corresponding strategies based on the priority.
Additional features and advantages of the invention will be set forth in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention. The objectives and other advantages of the invention will be realized and attained by the structure particularly pointed out in the written description and claims hereof as well as the appended drawings.
The technical solution of the present invention is further described in detail by the accompanying drawings and embodiments.
Drawings
The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the principles of the invention and not to limit the invention. In the drawings:
fig. 1 is a flowchart of a method for implementing high availability of a service on a cloud management platform according to an embodiment of the present invention;
fig. 2 is a flowchart of another method for implementing high availability of services for a cloud management platform according to an embodiment of the present invention;
fig. 3 is a schematic diagram of a device for implementing high availability of services by a cloud management platform in an embodiment of the present invention.
Detailed Description
The preferred embodiments of the present invention will be described in conjunction with the accompanying drawings, and it will be understood that they are described herein for the purpose of illustration and explanation and not limitation.
The embodiment of the invention provides a method for realizing high availability of a service by a cloud management platform, as shown in fig. 1, the method comprises the following steps:
step 1: constructing a coping strategy library;
step 2: monitoring the service state of the first cloud management platform to obtain monitoring data;
and step 3: determining a first abnormal item based on the monitoring data;
and 4, step 4: and determining a first coping strategy corresponding to the first abnormal item based on the coping strategy library, and executing the coping strategy correspondingly.
The working principle and the beneficial effects of the technical scheme are as follows:
a coping strategy base [ database ] is constructed, and coping strategies corresponding to different abnormal types are stored, for example: the abnormal type is that the number of people visiting a certain service module of the cloud management platform is too large, the pressure of the cloud management platform is too large, and the coping strategy is as follows: carrying out time-sharing current limiting processing on a user who requests to access later; monitoring the service state of the cloud management platform to obtain monitoring data; determining a first anomaly item [ anomaly type ] based on the monitoring data; determining a first coping strategy corresponding to the first abnormal item based on the constructed coping strategy library, and executing to realize abnormal coping;
according to the embodiment of the invention, the coping strategy is determined based on the constructed coping strategy library, so that the system can automatically cope with the abnormity and perform corresponding processing, the problem that the cloud management platform cannot be used by a user due to abnormity caused by insufficient experience of maintenance personnel of the cloud management platform is avoided, and the high availability of the service provided by the cloud management platform is realized.
The embodiment of the invention provides a method for realizing high availability of services of a cloud management platform, which comprises the following steps of 1: constructing a coping strategy library, comprising:
acquiring first attribute information of the first cloud management platform, wherein the first attribute information comprises: a plurality of first attribute items;
obtaining the value degree of the first attribute item;
if the value degree is larger than or equal to a preset value degree threshold value, taking the corresponding first attribute item as a second attribute item, and simultaneously taking the rest first attribute items as third attribute items;
acquiring second attribute information of a plurality of second cloud management platforms, wherein the second attribute information comprises: a plurality of fourth attribute items;
matching the second attribute item with the corresponding fourth attribute item to obtain a first matching degree;
summarizing the first matching degree to obtain a first matching degree sum;
if the first matching degree sum is larger than or equal to a preset matching degree sum threshold, taking the corresponding second cloud management platform as a screening target;
acquiring exception handling information of the screening target, wherein the exception handling information comprises: a plurality of one-to-one corresponding second abnormal items, second coping strategies and first effect values;
screening out the second coping strategies corresponding to the same second abnormal item and taking the second coping strategies as third coping strategies;
determining the first effect value of the third coping strategy as a second effect value;
determining the first matching degree sum corresponding to the screening target and corresponding to the third coping strategy, and taking the first matching degree sum as a second matching degree sum;
inquiring a preset effect value-matching degree and-sorting score table, and determining the second effect value, the second matching degree and a commonly corresponding sorting score;
sorting the third coping strategies based on the sorting branch to obtain strategy sequences;
selecting the first n third coping strategies in the strategy sequence and taking the third coping strategies as fourth coping strategies;
combining the fourth coping strategy with the second abnormal item to obtain a first coping group;
acquiring a preset blank database, and storing the first control group into the blank database;
when the first contrast groups which need to be stored in the blank database are all stored, the blank database is used as a coping strategy library;
wherein n is a positive integer.
The working principle and the beneficial effects of the technical scheme are as follows:
the preset value degree threshold specifically comprises the following steps: for example, 85; the preset matching degree and the threshold are specifically as follows: for example, 97; the preset effect value-matching degree and-sorting sub-table specifically comprises the following steps: the comparison table comprises different effect values, different matching degrees and corresponding sorting scores, and the larger the effect value is, the larger the matching degree sum is, the larger the sorting score is; the preset blank database specifically comprises the following steps: a database having no content therein;
when the coping strategy library is constructed, coping strategies which are historically performed against some abnormal execution by other second cloud management platforms can be obtained, but different cloud platforms have different attribute information, and the coping strategies cannot be directly obtained to construct the coping strategy library; therefore, first attribute information of the first cloud management platform is obtained, where the first attribute information includes a plurality of first attribute items [ for example: maximum number of visitors allowed, registration mode, etc.); when acquiring the coping strategy, some attribute items belong to irrelevant important items, and do not influence the reference use of the coping strategy, such as: the registration modes include WeChat login, microblog login and the like, the value degree is low, and some attribute items belong to important items, such as: the maximum number of allowed visitors is 100; therefore, the value degree of the first attribute item is obtained, and the important second attribute item and a plurality of fourth attribute items in the second attribute information of the second cloud management platform are screened out [ for example: maximum allowed number of visitors, registration mode and the like, wherein the matching is more consistent, namely the first matching degree sum is higher, and the first matching degree sum is used as a screening target; the second cloud management platform records the effect values of coping strategies adopted for coping with different abnormal types each time, and the effect value is larger, the coping strategy effect is better; in addition, the higher the first matching degree sum is, the more the coping strategy corresponding to the second cloud management platform can be adopted; therefore, based on the effect value and the first matching degree sum, determining a ranking score, and ranking the third corresponding strategy to obtain a strategy sequence; selecting the front n [ generally 1 ] in the strategy sequence, and if a user wants to select a coping strategy at the later stage, the user can set the coping strategy as a constant larger than 1 ] to serve as a fourth coping strategy; combining the fourth corresponding strategy and the corresponding second abnormal item to obtain a first control group, and storing the first control group in a blank database;
the embodiment of the invention is based on the historical execution coping strategy record of the second cloud management platform, constructs the coping strategy library, and has higher applicability under the trend of data sharing (big data); simultaneously, before obtaining, carry out attribute matching, reply strategy sequencing selects the reply strategy that can be suitable for self cloud management platform and effect preferred, has promoted the accurate nature of reply strategy selection, sets up rationally and very intelligent.
The embodiment of the invention provides a method for realizing high availability of services of cloud management platforms, which is used for acquiring second attribute information of a plurality of second cloud management platforms and comprises the following steps:
acquiring a cooperative relationship of the second cloud management platform relative to the first cloud management platform, wherein the cooperative relationship comprises: active cooperation and passive cooperation;
when the cooperation relationship is active cooperation, acquiring a first cooperation circle corresponding to the second cloud management platform;
determining a first position of a first circle center of the first cooperation circle, determining a second position of the first cloud management platform in the first cooperation circle, and simultaneously determining a first distance between the first position and the second position;
based on the first circle center, the first distance is as long as a first radius, and a second cooperation circle is formed in the first cooperation circle;
determining a third location of a plurality of third cloud management platforms within the second collaboration circle;
determining a second distance between the first location and the third location;
obtaining a first length of a second radius of the first cooperative turn;
calculating a first ratio between the second distance and the first length to obtain a first ratio;
acquiring a third cooperation circle of the third cloud management platform;
if the second cloud management platform is in the third cooperation circle, determining a fourth position of the second cloud management platform in the third cooperation circle;
determining a fifth position of a second circle center of the third cooperative circle, and simultaneously determining a third distance between the fourth position and the fifth position;
obtaining a second length of a third radius of the third composite circle;
calculating a second ratio between the third distance and the second length to obtain a second ratio;
if the second occupation ratio is smaller than or equal to the first occupation ratio, taking the corresponding third cloud management platform as a fourth cloud management platform;
acquiring negative evaluation information of the fourth cloud management platform on the second cloud management platform, wherein the negative evaluation information comprises: a plurality of one-to-one negation terms and negation values;
inquiring a preset negative item-negative value-negative score table, and determining a negative score corresponding to the negative item and the negative value;
summarizing the negative scores to obtain a negative score sum corresponding to the second cloud management platform;
if the negative score sum is less than or equal to a preset negative score sum threshold value, second attribute information corresponding to the second cloud management platform is obtained, and if not, the second attribute information is not obtained;
if the cooperation relationship is passive cooperation, acquiring guarantee information of the second cloud management platform, wherein the guarantee information comprises: a plurality of one-to-one corresponding guarantors, guaranty types, and guaranty values;
inquiring a preset guarantor-guarantor type-guaranty value-guaranty rating table, and determining a guaranty rating corresponding to the guarantor, the guaranty type and the guaranty value;
summarizing the guarantee scores to obtain a sum of guarantee scores;
and if the guarantee score sum is greater than or equal to a preset guarantee score sum threshold, acquiring second attribute information corresponding to the second cloud management platform, otherwise, not acquiring.
The working principle and the beneficial effects of the technical scheme are as follows:
the preset difference threshold specifically comprises: for example, 0.7; the preset negation item-negation value-negation scoring table is specifically as follows: the comparison table includes negative scores corresponding to different negative terms and different negative values, and the more serious the negative term is, the larger the negative value is, the higher the negative score is, for example: the negative terms are: the uploading of the coping strategy is not real, the negative value is 8, and the negative score is 95; the preset negative score and the threshold are specifically: for example, 755; the preset guarantor-guaranty type-guaranty value-guaranty rating table is specifically as follows: the comparison table contains the guarantee scores corresponding to different guarantees, different guarantee types and different guarantee values, such as: the guaranty is a high manager, the guaranty type is a money guaranty providing a certain guaranty, the larger the money is, the larger the guaranty value is, if the guaranty value is 95, the guaranty score is 99; the preset guarantee score and threshold are specifically: for example, 850;
the collaboration relationships between different cloud management platforms to share coping policies are divided into active collaboration [ for example: the cloud management platform a actively provides a coping strategy for the cloud management platform B, and the cooperation relationship of the cloud management platform a relative to the cloud management platform B is active cooperation and passive cooperation [ for example: the cloud management platform A and the cloud management platform B belong to the same company, the company appoints the cloud management platform A to provide a coping strategy for the cloud management platform B, and then the cooperation relationship of the cloud management platform A relative to the cloud management platform B is passive cooperation; the active cooperation cloud management platforms correspond to a cooperation circle, and the cooperation circle specifically comprises the following steps: the radius of the circle is fixed, the cloud management platforms actively cooperated by the cloud management platforms all have corresponding positions in the circle, the cooperation relationship between the cloud management platforms actively cooperated by the cloud management platforms and the cloud management platform is stable, and the longer the cooperation time is, the closer the position of the cloud management platforms to the circle center of the circle is; firstly, a second cooperation circle is made, the cooperation time of a third cloud management platform and a second cloud management platform in the second cooperation circle is longer than that of the first cloud management platform, and the evaluation on the second cloud management platform is more valuable; then, a first proportion and a second proportion are determined, and the second proportion is smaller than the first proportion due to the fact that the radiuses of different cooperation circles are fixed, so that the active cooperation relationship of the fourth cloud management platform corresponding to the second cloud management platform is more stable, and the evaluation of the fourth cloud management platform on the second cloud management platform is more valuable; obtaining negative evaluation information of a fourth cloud management platform, which contains different negative items [ for example: the unreal strategy is responded, and a corresponding negative value is uploaded, wherein the larger the negative value is, the larger the severity of the negative item is; determining the negative score sum, wherein the larger the negative score sum is, the more bad records corresponding to the second cloud management platform are, and the bad records are not acquired; when the cooperation relationship is passive cooperation, the first cloud management platform and the second cloud management platform belong to the same company, the guarantee information of the second cloud management platform is obtained, and the sum of guarantee scores is determined; the larger the guarantee score sum is, the higher the cost of generating bad records corresponding to the second cloud management platform is, the bad records are not easy to generate and are granted to be obtained;
according to the embodiment of the invention, the cooperation relationship between the second cloud management platform and the first cloud management platform is firstly determined, different screening strategies are adopted according to different cooperation relationships, the qualified second cloud management platform is screened out to obtain the second attribute information, the unqualified second cloud management platform does not obtain the second attribute information, namely the corresponding coping strategy record is not obtained, and the obtaining quality is ensured to a great extent.
The embodiment of the invention provides a method for realizing high availability of services of a cloud management platform, which comprises the following steps: determining a first anomaly item based on the monitoring data, comprising:
extracting a plurality of newly generated first data items corresponding to the same monitoring type in the monitoring data;
acquiring the generation time of the first data item;
establishing a time axis on which to represent the corresponding first data item based on the generation time;
performing feature extraction on the first data item to obtain a first feature;
acquiring a preset trigger feature library, and matching the first feature with a second feature in the trigger feature library to acquire a second matching degree;
if the second matching degree is larger than or equal to a preset first threshold value, taking the corresponding second feature as a third feature, and simultaneously taking the corresponding first data item as a second data item;
inquiring a preset feature-selection direction table, and determining at least one selection direction corresponding to the third feature;
selecting a plurality of first data items preset in the selecting direction of the second data items on the time axis, and using the first data items as fourth data items;
performing feature extraction on the fourth data item to obtain a fourth feature;
randomly combining the first feature and the fourth feature to obtain a plurality of combined features;
querying a preset feature-abnormal combination feature table, and determining at least one first abnormal combination feature corresponding to the third feature;
matching the combined feature with the first abnormal combined feature to obtain a third matching degree;
if the third matching degree is greater than or equal to a preset second threshold value, taking the corresponding first abnormal combination feature as a second abnormal combination feature;
and querying a preset abnormal combination feature-abnormal item table, determining at least one first abnormal item corresponding to the second abnormal combination feature, and finishing the determination.
The working principle and the beneficial effects of the technical scheme are as follows:
the preset trigger feature library specifically comprises: a database in which a number of trigger characteristics are stored, such as: a plurality of partial features after splitting of a certain abnormal feature; the preset first threshold specifically includes: for example, 87; the preset feature-selection direction table specifically comprises: the comparison table contains the selection directions corresponding to different features, such as: if the characteristic is the former part of the abnormal characteristic, if the characteristic of the latter part of the abnormal characteristic appears later, the abnormal seat is solid, and therefore, the selection direction is the rear direction (the direction from the starting point to the end point of the time axis); the preset number is specifically as follows: for example, 5; the preset feature-abnormal combination feature table specifically comprises: the comparison table stores abnormal combination features corresponding to different features, such as: if a certain characteristic is the previous part of a certain abnormal characteristic, the abnormal characteristic corresponding to the certain characteristic is the abnormal combination characteristic; the preset second threshold specifically is: for example, 88; the preset abnormal combination characteristic-abnormal item table is specifically as follows: comparing the table, wherein the brain comprises abnormal items (abnormal types) corresponding to different abnormal combination characteristics;
extracting newly generated data corresponding to the same monitoring type [ for example: number of visitors monitored ] of a plurality of first data items [ e.g.: acquiring the number of visitors every 3 milliseconds, wherein the first data item is the number of visitors acquired every 3 milliseconds; performing feature extraction on the first data item to obtain a first feature [ for example: the number of visitors is 100 ℃; match the first feature against a second feature in the trigger feature library [ e.g.: the number of visitors is 99 ], and the matching degree is greater than a first threshold value; if the maximum number of visiting persons of the cloud management platform is 150, the number of visiting persons needing to watch the cloud management platform is selected in the rear direction; selecting a fourth data item appearing later, and performing feature extraction to obtain a second feature [ for example: the number of visitors is 120, the number of visitors is 130 ], the combination characteristics of the number of visitors which is increasing and is about to reach the upper limit are obtained through random combination, the combination characteristics are matched with the corresponding abnormal combination characteristics, the matching degree is greater than a second threshold value, and the abnormality is determined as the number of visitors which is about to reach the upper limit;
the embodiment of the invention is provided with the trigger feature library, thereby improving the efficiency of finding abnormal local features, not verifying all data items one by one and reducing the occupation of system resources; the selection direction is determined by self, more fourth data items are obtained for carrying out abnormity judgment, and the method is more intelligent.
The embodiment of the invention provides a method for realizing high availability of services of a cloud management platform, as shown in fig. 2, step 4: based on the coping strategy library, determining a first coping strategy corresponding to the first abnormal item, and executing correspondingly, wherein the coping strategy library comprises the following steps:
step 401: the first abnormal item corresponds to the second abnormal item in the first control group, and the corresponding first control group is used as a second control group;
step 402: extracting the third coping strategies in the second control group as the first coping strategies;
step 403: inquiring a preset corresponding strategy-priority table, and determining the priority corresponding to the first corresponding strategy;
step 404: and sequentially executing the first corresponding strategies based on the priority.
The working principle and the beneficial effects of the technical scheme are as follows:
the preset coping strategy-priority table specifically comprises: the comparison table internally contains priorities corresponding to different hardness strategies, and the priorities of coping strategies are different due to different abnormal severity degrees, so that coping strategies with high priorities need to be executed preferentially;
when a first coping strategy corresponding to the first abnormal item is determined based on a coping strategy library, only the first abnormal item needs to be compared with a second abnormal item in a first contrast group in the coping strategy library, if the first abnormal item is the same as the second abnormal item, the first abnormal item corresponds to the second abnormal item, and a corresponding third coping strategy is extracted as a first coping strategy; when the method is executed, the corresponding coping strategies are executed based on the priority, the abnormal situation with high severity is guaranteed to be processed firstly, and the setting is reasonable.
An embodiment of the present invention provides an apparatus for implementing high availability of a service on a cloud management platform, as shown in fig. 3, including:
the construction module 1 is used for constructing a coping strategy library;
the monitoring module 2 is used for monitoring the service state of the first cloud management platform and acquiring monitoring data;
a determination module 3, configured to determine a first abnormal item based on the monitoring data;
and the execution module 4 is used for determining a first coping strategy corresponding to the first abnormal item based on the coping strategy library and executing corresponding execution.
The working principle and the beneficial effects of the technical scheme are as follows:
a coping strategy base [ database ] is constructed, and coping strategies corresponding to different abnormal types are stored, for example: the abnormal type is that the number of people visiting a certain service module of the cloud management platform is too large, the pressure of the cloud management platform is too large, and the coping strategy is as follows: carrying out time-sharing current limiting processing on a user who requests to access later; monitoring the service state of the cloud management platform to obtain monitoring data; determining a first anomaly item [ anomaly type ] based on the monitoring data; determining a first coping strategy corresponding to the first abnormal item based on the constructed coping strategy library, and executing to realize abnormal coping;
according to the embodiment of the invention, the coping strategy is determined based on the constructed coping strategy library, so that the system can automatically cope with the abnormity and perform corresponding processing, the problem that the cloud management platform cannot be used by a user due to abnormity caused by insufficient experience of maintenance personnel of the cloud management platform is avoided, and the high availability of the service provided by the cloud management platform is realized.
The embodiment of the invention provides a device for realizing high availability of services of a cloud management platform, wherein a construction module 1 executes the following operations:
acquiring first attribute information of the first cloud management platform, wherein the first attribute information comprises: a plurality of first attribute items;
obtaining the value degree of the first attribute item;
if the value degree is larger than or equal to a preset value degree threshold value, taking the corresponding first attribute item as a second attribute item, and simultaneously taking the rest first attribute items as third attribute items;
acquiring second attribute information of a plurality of second cloud management platforms, wherein the second attribute information comprises: a plurality of fourth attribute items;
matching the second attribute item with the corresponding fourth attribute item to obtain a first matching degree;
summarizing the first matching degree to obtain a first matching degree sum;
if the first matching degree sum is larger than or equal to a preset matching degree sum threshold, taking the corresponding second cloud management platform as a screening target;
acquiring exception handling information of the screening target, wherein the exception handling information comprises: a plurality of one-to-one corresponding second abnormal items, second coping strategies and first effect values;
screening out the second coping strategies corresponding to the same second abnormal item and taking the second coping strategies as third coping strategies;
determining the first effect value of the third coping strategy as a second effect value;
determining the first matching degree sum corresponding to the screening target and corresponding to the third coping strategy, and taking the first matching degree sum as a second matching degree sum;
inquiring a preset effect value-matching degree and-sorting score table, and determining the second effect value, the second matching degree and a commonly corresponding sorting score;
sorting the third coping strategies based on the sorting branch to obtain strategy sequences;
selecting the first n third coping strategies in the strategy sequence and taking the third coping strategies as fourth coping strategies;
combining the fourth coping strategy with the second abnormal item to obtain a first coping group;
acquiring a preset blank database, and storing the first control group into the blank database;
when the first contrast groups which need to be stored in the blank database are all stored, the blank database is used as a coping strategy library;
wherein n is a positive integer.
The working principle and the beneficial effects of the technical scheme are as follows:
the preset value degree threshold specifically comprises the following steps: for example, 85; the preset matching degree and the threshold are specifically as follows: for example, 97; the preset effect value-matching degree and-sorting sub-table specifically comprises the following steps: the comparison table comprises different effect values, different matching degrees and corresponding sorting scores, and the larger the effect value is, the larger the matching degree sum is, the larger the sorting score is; the preset blank database specifically comprises the following steps: a database having no content therein;
when the coping strategy library is constructed, coping strategies which are historically performed against some abnormal execution by other second cloud management platforms can be obtained, but different cloud platforms have different attribute information, and the coping strategies cannot be directly obtained to construct the coping strategy library; therefore, first attribute information of the first cloud management platform is obtained, where the first attribute information includes a plurality of first attribute items [ for example: maximum number of visitors allowed, registration mode, etc.); when acquiring the coping strategy, some attribute items belong to irrelevant important items, and do not influence the reference use of the coping strategy, such as: the registration modes include WeChat login, microblog login and the like, the value degree is low, and some attribute items belong to important items, such as: the maximum number of allowed visitors is 100; therefore, the value degree of the first attribute item is obtained, and the important second attribute item and a plurality of fourth attribute items in the second attribute information of the second cloud management platform are screened out [ for example: maximum allowed number of visitors, registration mode and the like, wherein the matching is more consistent, namely the first matching degree sum is higher, and the first matching degree sum is used as a screening target; the second cloud management platform records the effect values of coping strategies adopted for coping with different abnormal types each time, and the effect value is larger, the coping strategy effect is better; in addition, the higher the first matching degree sum is, the more the coping strategy corresponding to the second cloud management platform can be adopted; therefore, based on the effect value and the first matching degree sum, determining a ranking score, and ranking the third corresponding strategy to obtain a strategy sequence; selecting the front n [ generally 1 ] in the strategy sequence, and if a user wants to select a coping strategy at the later stage, the user can set the coping strategy as a constant larger than 1 ] to serve as a fourth coping strategy; combining the fourth corresponding strategy and the corresponding second abnormal item to obtain a first control group, and storing the first control group in a blank database;
the embodiment of the invention is based on the historical execution coping strategy record of the second cloud management platform, constructs the coping strategy library, and has higher applicability under the trend of data sharing (big data); simultaneously, before obtaining, carry out attribute matching, reply strategy sequencing selects the reply strategy that can be suitable for self cloud management platform and effect preferred, has promoted the accurate nature of reply strategy selection, sets up rationally and very intelligent.
The embodiment of the invention provides a device for realizing high availability of services of a cloud management platform, wherein a construction module 1 executes the following operations:
acquiring a cooperative relationship of the second cloud management platform relative to the first cloud management platform, wherein the cooperative relationship comprises: active cooperation and passive cooperation;
when the cooperation relationship is active cooperation, acquiring a first cooperation circle corresponding to the second cloud management platform;
determining a first position of a first circle center of the first cooperation circle, determining a second position of the first cloud management platform in the first cooperation circle, and simultaneously determining a first distance between the first position and the second position;
based on the first circle center, the first distance is as long as a first radius, and a second cooperation circle is formed in the first cooperation circle;
determining a third location of a plurality of third cloud management platforms within the second collaboration circle;
determining a second distance between the first location and the third location;
obtaining a first length of a second radius of the first cooperative turn;
calculating a first ratio between the second distance and the first length to obtain a first ratio;
acquiring a third cooperation circle of the third cloud management platform;
if the second cloud management platform is in the third cooperation circle, determining a fourth position of the second cloud management platform in the third cooperation circle;
determining a fifth position of a second circle center of the third cooperative circle, and simultaneously determining a third distance between the fourth position and the fifth position;
obtaining a second length of a third radius of the third composite circle;
calculating a second ratio between the third distance and the second length to obtain a second ratio;
if the second occupation ratio is smaller than or equal to the first occupation ratio, taking the corresponding third cloud management platform as a fourth cloud management platform;
acquiring negative evaluation information of the fourth cloud management platform on the second cloud management platform, wherein the negative evaluation information comprises: a plurality of one-to-one negation terms and negation values;
inquiring a preset negative item-negative value-negative score table, and determining a negative score corresponding to the negative item and the negative value;
summarizing the negative scores to obtain a negative score sum corresponding to the second cloud management platform;
if the negative score sum is less than or equal to a preset negative score sum threshold value, second attribute information corresponding to the second cloud management platform is obtained, and if not, the second attribute information is not obtained;
if the cooperation relationship is passive cooperation, acquiring guarantee information of the second cloud management platform, wherein the guarantee information comprises: a plurality of one-to-one corresponding guarantors, guaranty types, and guaranty values;
inquiring a preset guarantor-guarantor type-guaranty value-guaranty rating table, and determining a guaranty rating corresponding to the guarantor, the guaranty type and the guaranty value;
summarizing the guarantee scores to obtain a sum of guarantee scores;
and if the guarantee score sum is greater than or equal to a preset guarantee score sum threshold, acquiring second attribute information corresponding to the second cloud management platform, otherwise, not acquiring.
The working principle and the beneficial effects of the technical scheme are as follows:
the preset difference threshold specifically comprises: for example, 0.7; the preset negation item-negation value-negation scoring table is specifically as follows: the comparison table includes negative scores corresponding to different negative terms and different negative values, and the more serious the negative term is, the larger the negative value is, the higher the negative score is, for example: the negative terms are: the uploading of the coping strategy is not real, the negative value is 8, and the negative score is 95; the preset negative score and the threshold are specifically: for example, 755; the preset guarantor-guaranty type-guaranty value-guaranty rating table is specifically as follows: the comparison table contains the guarantee scores corresponding to different guarantees, different guarantee types and different guarantee values, such as: the guaranty is a high manager, the guaranty type is a money guaranty providing a certain guaranty, the larger the money is, the larger the guaranty value is, if the guaranty value is 95, the guaranty score is 99; the preset guarantee score and threshold are specifically: for example, 850;
the collaboration relationships between different cloud management platforms to share coping policies are divided into active collaboration [ for example: the cloud management platform a actively provides a coping strategy for the cloud management platform B, and the cooperation relationship of the cloud management platform a relative to the cloud management platform B is active cooperation and passive cooperation [ for example: the cloud management platform A and the cloud management platform B belong to the same company, the company appoints the cloud management platform A to provide a coping strategy for the cloud management platform B, and then the cooperation relationship of the cloud management platform A relative to the cloud management platform B is passive cooperation; the active cooperation cloud management platforms correspond to a cooperation circle, and the cooperation circle specifically comprises the following steps: the radius of the circle is fixed, the cloud management platforms actively cooperated by the cloud management platforms all have corresponding positions in the circle, the cooperation relationship between the cloud management platforms actively cooperated by the cloud management platforms and the cloud management platform is stable, and the longer the cooperation time is, the closer the position of the cloud management platforms to the circle center of the circle is; firstly, a second cooperation circle is made, the cooperation time of a third cloud management platform and a second cloud management platform in the second cooperation circle is longer than that of the first cloud management platform, and the evaluation on the second cloud management platform is more valuable; then, a first proportion and a second proportion are determined, and the second proportion is smaller than the first proportion due to the fact that the radiuses of different cooperation circles are fixed, so that the active cooperation relationship of the fourth cloud management platform corresponding to the second cloud management platform is more stable, and the evaluation of the fourth cloud management platform on the second cloud management platform is more valuable; obtaining negative evaluation information of a fourth cloud management platform, which contains different negative items [ for example: the unreal strategy is responded, and a corresponding negative value is uploaded, wherein the larger the negative value is, the larger the severity of the negative item is; determining the negative score sum, wherein the larger the negative score sum is, the more bad records corresponding to the second cloud management platform are, and the bad records are not acquired; when the cooperation relationship is passive cooperation, the first cloud management platform and the second cloud management platform belong to the same company, the guarantee information of the second cloud management platform is obtained, and the sum of guarantee scores is determined; the larger the guarantee score sum is, the higher the cost of generating bad records corresponding to the second cloud management platform is, the bad records are not easy to generate and are granted to be obtained;
according to the embodiment of the invention, the cooperation relationship between the second cloud management platform and the first cloud management platform is firstly determined, different screening strategies are adopted according to different cooperation relationships, the qualified second cloud management platform is screened out to obtain the second attribute information, the unqualified second cloud management platform does not obtain the second attribute information, namely the corresponding coping strategy record is not obtained, and the obtaining quality is ensured to a great extent.
The embodiment of the invention provides a device for realizing high availability of services of a cloud management platform, wherein a determining module 3 executes the following operations:
extracting a plurality of newly generated first data items corresponding to the same monitoring type in the monitoring data;
acquiring the generation time of the first data item;
establishing a time axis on which to represent the corresponding first data item based on the generation time;
performing feature extraction on the first data item to obtain a first feature;
acquiring a preset trigger feature library, and matching the first feature with a second feature in the trigger feature library to acquire a second matching degree;
if the second matching degree is larger than or equal to a preset first threshold value, taking the corresponding second feature as a third feature, and simultaneously taking the corresponding first data item as a second data item;
inquiring a preset feature-selection direction table, and determining at least one selection direction corresponding to the third feature;
selecting a plurality of first data items preset in the selecting direction of the second data items on the time axis, and using the first data items as fourth data items;
performing feature extraction on the fourth data item to obtain a fourth feature;
randomly combining the first feature and the fourth feature to obtain a plurality of combined features;
querying a preset feature-abnormal combination feature table, and determining at least one first abnormal combination feature corresponding to the third feature;
matching the combined feature with the first abnormal combined feature to obtain a third matching degree;
if the third matching degree is greater than or equal to a preset second threshold value, taking the corresponding first abnormal combination feature as a second abnormal combination feature;
and querying a preset abnormal combination feature-abnormal item table, determining at least one first abnormal item corresponding to the second abnormal combination feature, and finishing the determination.
The working principle and the beneficial effects of the technical scheme are as follows:
the preset trigger feature library specifically comprises: a database in which a number of trigger characteristics are stored, such as: a plurality of partial features after splitting of a certain abnormal feature; the preset first threshold specifically includes: for example, 87; the preset feature-selection direction table specifically comprises: the comparison table contains the selection directions corresponding to different features, such as: if the characteristic is the former part of the abnormal characteristic, if the characteristic of the latter part of the abnormal characteristic appears later, the abnormal seat is solid, and therefore, the selection direction is the rear direction (the direction from the starting point to the end point of the time axis); the preset number is specifically as follows: for example, 5; the preset feature-abnormal combination feature table specifically comprises: the comparison table stores abnormal combination features corresponding to different features, such as: if a certain characteristic is the previous part of a certain abnormal characteristic, the abnormal characteristic corresponding to the certain characteristic is the abnormal combination characteristic; the preset second threshold specifically is: for example, 88; the preset abnormal combination characteristic-abnormal item table is specifically as follows: comparing the table, wherein the brain comprises abnormal items (abnormal types) corresponding to different abnormal combination characteristics;
extracting newly generated data corresponding to the same monitoring type [ for example: number of visitors monitored ] of a plurality of first data items [ e.g.: acquiring the number of visitors every 3 milliseconds, wherein the first data item is the number of visitors acquired every 3 milliseconds; performing feature extraction on the first data item to obtain a first feature [ for example: the number of visitors is 100 ℃; match the first feature against a second feature in the trigger feature library [ e.g.: the number of visitors is 99 ], and the matching degree is greater than a first threshold value; if the maximum number of visiting persons of the cloud management platform is 150, the number of visiting persons needing to watch the cloud management platform is selected in the rear direction; selecting a fourth data item appearing later, and performing feature extraction to obtain a second feature [ for example: the number of visitors is 120, the number of visitors is 130 ], the combination characteristics of the number of visitors which is increasing and is about to reach the upper limit are obtained through random combination, the combination characteristics are matched with the corresponding abnormal combination characteristics, the matching degree is greater than a second threshold value, and the abnormality is determined as the number of visitors which is about to reach the upper limit;
the embodiment of the invention is provided with the trigger feature library, thereby improving the efficiency of finding abnormal local features, not verifying all data items one by one and reducing the occupation of system resources; the selection direction is determined by self, more fourth data items are obtained for carrying out abnormity judgment, and the method is more intelligent.
The embodiment of the invention provides a device for realizing high availability of services of a cloud management platform, wherein an execution module 4 executes the following operations:
the first abnormal item corresponds to the second abnormal item in the first control group, and the corresponding first control group is used as a second control group;
extracting the third coping strategies in the second control group as the first coping strategies;
inquiring a preset corresponding strategy-priority table, and determining the priority corresponding to the first corresponding strategy;
and sequentially executing the first corresponding strategies based on the priority.
The working principle and the beneficial effects of the technical scheme are as follows:
the preset coping strategy-priority table specifically comprises: the comparison table internally contains priorities corresponding to different hardness strategies, and the priorities of coping strategies are different due to different abnormal severity degrees, so that coping strategies with high priorities need to be executed preferentially;
when a first coping strategy corresponding to the first abnormal item is determined based on a coping strategy library, only the first abnormal item needs to be compared with a second abnormal item in a first contrast group in the coping strategy library, if the first abnormal item is the same as the second abnormal item, the first abnormal item corresponds to the second abnormal item, and a corresponding third coping strategy is extracted as a first coping strategy; when the method is executed, the corresponding coping strategies are executed based on the priority, the abnormal situation with high severity is guaranteed to be processed firstly, and the setting is reasonable.
It will be apparent to those skilled in the art that various changes and modifications may be made in the present invention without departing from the spirit and scope of the invention. Thus, if such modifications and variations of the present invention fall within the scope of the claims of the present invention and their equivalents, the present invention is also intended to include such modifications and variations.
Claims (8)
1. A method for realizing high service availability of a cloud management platform is characterized by comprising the following steps:
step 1: constructing a coping strategy library;
step 2: monitoring the service state of the first cloud management platform to obtain monitoring data;
and step 3: determining a first abnormal item based on the monitoring data;
and 4, step 4: determining a first coping strategy corresponding to the first abnormal item based on the coping strategy library, and executing correspondingly;
step 1: constructing a coping strategy library, comprising:
acquiring first attribute information of the first cloud management platform, wherein the first attribute information comprises: a plurality of first attribute items;
obtaining the value degree of the first attribute item;
if the value degree is larger than or equal to a preset value degree threshold value, taking the corresponding first attribute item as a second attribute item, and simultaneously taking the rest first attribute items as third attribute items;
acquiring second attribute information of a plurality of second cloud management platforms, wherein the second attribute information comprises: a plurality of fourth attribute items;
matching the second attribute item with the corresponding fourth attribute item to obtain a first matching degree;
summarizing the first matching degree to obtain a first matching degree sum;
if the first matching degree sum is larger than or equal to a preset matching degree sum threshold, taking the corresponding second cloud management platform as a screening target;
acquiring exception handling information of the screening target, wherein the exception handling information comprises: a plurality of one-to-one corresponding second abnormal items, second coping strategies and first effect values;
screening out the second coping strategies corresponding to the same second abnormal item and taking the second coping strategies as third coping strategies;
determining the first effect value of the third coping strategy as a second effect value;
determining the first matching degree sum corresponding to the screening target and corresponding to the third coping strategy, and taking the first matching degree sum as a second matching degree sum;
inquiring a preset effect value-matching degree and-sorting score table, and determining the second effect value, the second matching degree and a commonly corresponding sorting score;
sorting the third coping strategies based on the sorting branch to obtain strategy sequences;
selecting the first n third coping strategies in the strategy sequence and taking the third coping strategies as fourth coping strategies;
combining the fourth coping strategy with the second abnormal item to obtain a first coping group;
acquiring a preset blank database, and storing the first control group into the blank database;
when the first contrast groups which need to be stored in the blank database are all stored, the blank database is used as a coping strategy library;
wherein n is a positive integer.
2. The method for realizing high availability of the service of the cloud management platform according to claim 1, wherein the obtaining of the second attribute information of the plurality of second cloud management platforms includes:
acquiring a cooperative relationship of the second cloud management platform relative to the first cloud management platform, wherein the cooperative relationship comprises: active cooperation and passive cooperation;
when the cooperation relationship is active cooperation, acquiring a first cooperation circle corresponding to the second cloud management platform;
determining a first position of a first circle center of the first cooperation circle, determining a second position of the first cloud management platform in the first cooperation circle, and simultaneously determining a first distance between the first position and the second position;
based on the first circle center, the first distance is as long as a first radius, and a second cooperation circle is formed in the first cooperation circle;
determining a third location of a plurality of third cloud management platforms within the second collaboration circle;
determining a second distance between the first location and the third location;
obtaining a first length of a second radius of the first cooperative turn;
calculating a first ratio between the second distance and the first length to obtain a first ratio;
acquiring a third cooperation circle of the third cloud management platform;
if the second cloud management platform is in the third cooperation circle, determining a fourth position of the second cloud management platform in the third cooperation circle;
determining a fifth position of a second circle center of the third cooperative circle, and simultaneously determining a third distance between the fourth position and the fifth position;
obtaining a second length of a third radius of the third composite circle;
calculating a second ratio between the third distance and the second length to obtain a second ratio;
if the second occupation ratio is smaller than or equal to the first occupation ratio, taking the corresponding third cloud management platform as a fourth cloud management platform;
acquiring negative evaluation information of the fourth cloud management platform on the second cloud management platform, wherein the negative evaluation information comprises: a plurality of one-to-one negation terms and negation values;
inquiring a preset negative item-negative value-negative score table, and determining a negative score corresponding to the negative item and the negative value;
summarizing the negative scores to obtain a negative score sum corresponding to the second cloud management platform;
if the negative score sum is less than or equal to a preset negative score sum threshold value, second attribute information corresponding to the second cloud management platform is obtained, and if not, the second attribute information is not obtained;
if the cooperation relationship is passive cooperation, acquiring guarantee information of the second cloud management platform, wherein the guarantee information comprises: a plurality of one-to-one corresponding guarantors, guaranty types, and guaranty values;
inquiring a preset guarantor-guarantor type-guaranty value-guaranty rating table, and determining a guaranty rating corresponding to the guarantor, the guaranty type and the guaranty value;
summarizing the guarantee scores to obtain a sum of guarantee scores;
and if the guarantee score sum is greater than or equal to a preset guarantee score sum threshold, acquiring second attribute information corresponding to the second cloud management platform, otherwise, not acquiring.
3. The method for realizing high availability of the service of the cloud management platform according to claim 1, wherein the step 3: determining a first anomaly item based on the monitoring data, comprising:
extracting a plurality of newly generated first data items corresponding to the same monitoring type in the monitoring data;
acquiring the generation time of the first data item;
establishing a time axis on which to represent the corresponding first data item based on the generation time;
performing feature extraction on the first data item to obtain a first feature;
acquiring a preset trigger feature library, and matching the first feature with a second feature in the trigger feature library to acquire a second matching degree;
if the second matching degree is larger than or equal to a preset first threshold value, taking the corresponding second feature as a third feature, and simultaneously taking the corresponding first data item as a second data item;
inquiring a preset feature-selection direction table, and determining at least one selection direction corresponding to the third feature;
selecting a plurality of first data items preset in the selecting direction of the second data items on the time axis, and using the first data items as fourth data items;
performing feature extraction on the fourth data item to obtain a fourth feature;
randomly combining the first feature and the fourth feature to obtain a plurality of combined features;
querying a preset feature-abnormal combination feature table, and determining at least one first abnormal combination feature corresponding to the third feature;
matching the combined feature with the first abnormal combined feature to obtain a third matching degree;
if the third matching degree is greater than or equal to a preset second threshold value, taking the corresponding first abnormal combination feature as a second abnormal combination feature;
and querying a preset abnormal combination feature-abnormal item table, determining at least one first abnormal item corresponding to the second abnormal combination feature, and finishing the determination.
4. The method for realizing high availability of the service of the cloud management platform according to claim 1, wherein the step 4: based on the coping strategy library, determining a first coping strategy corresponding to the first abnormal item, and executing correspondingly, wherein the coping strategy library comprises the following steps:
the first abnormal item corresponds to the second abnormal item in the first control group, and the corresponding first control group is used as a second control group;
extracting the third coping strategies in the second control group as the first coping strategies;
inquiring a preset corresponding strategy-priority table, and determining the priority corresponding to the first corresponding strategy;
and sequentially executing the first corresponding strategies based on the priority.
5. An apparatus for implementing high availability of service by a cloud management platform, comprising:
the construction module is used for constructing a coping strategy library;
the monitoring module is used for monitoring the service state of the first cloud management platform and acquiring monitoring data;
a determination module for determining a first abnormal item based on the monitoring data;
the execution module is used for determining a first coping strategy corresponding to the first abnormal item based on the coping strategy library and executing corresponding execution;
the building module performs the following operations:
acquiring first attribute information of the first cloud management platform, wherein the first attribute information comprises: a plurality of first attribute items;
obtaining the value degree of the first attribute item;
if the value degree is larger than or equal to a preset value degree threshold value, taking the corresponding first attribute item as a second attribute item, and simultaneously taking the rest first attribute items as third attribute items;
acquiring second attribute information of a plurality of second cloud management platforms, wherein the second attribute information comprises: a plurality of fourth attribute items;
matching the second attribute item with the corresponding fourth attribute item to obtain a first matching degree;
summarizing the first matching degree to obtain a first matching degree sum;
if the first matching degree sum is larger than or equal to a preset matching degree sum threshold, taking the corresponding second cloud management platform as a screening target;
acquiring exception handling information of the screening target, wherein the exception handling information comprises: a plurality of one-to-one corresponding second abnormal items, second coping strategies and first effect values;
screening out the second coping strategies corresponding to the same second abnormal item and taking the second coping strategies as third coping strategies;
determining the first effect value of the third coping strategy as a second effect value;
determining the first matching degree sum corresponding to the screening target and corresponding to the third coping strategy, and taking the first matching degree sum as a second matching degree sum;
inquiring a preset effect value-matching degree and-sorting score table, and determining the second effect value, the second matching degree and a commonly corresponding sorting score;
sorting the third coping strategies based on the sorting branch to obtain strategy sequences;
selecting the first n third coping strategies in the strategy sequence and taking the third coping strategies as fourth coping strategies;
combining the fourth coping strategy with the second abnormal item to obtain a first coping group;
acquiring a preset blank database, and storing the first control group into the blank database;
when the first contrast groups which need to be stored in the blank database are all stored, the blank database is used as a coping strategy library;
wherein n is a positive integer.
6. The apparatus for implementing high availability of services for a cloud management platform according to claim 5, wherein the building module performs the following operations:
acquiring a cooperative relationship of the second cloud management platform relative to the first cloud management platform, wherein the cooperative relationship comprises: active cooperation and passive cooperation;
when the cooperation relationship is active cooperation, acquiring a first cooperation circle corresponding to the second cloud management platform;
determining a first position of a first circle center of the first cooperation circle, determining a second position of the first cloud management platform in the first cooperation circle, and simultaneously determining a first distance between the first position and the second position;
based on the first circle center, the first distance is as long as a first radius, and a second cooperation circle is formed in the first cooperation circle;
determining a third location of a plurality of third cloud management platforms within the second collaboration circle;
determining a second distance between the first location and the third location;
obtaining a first length of a second radius of the first cooperative turn;
calculating a first ratio between the second distance and the first length to obtain a first ratio;
acquiring a third cooperation circle of the third cloud management platform;
if the second cloud management platform is in the third cooperation circle, determining a fourth position of the second cloud management platform in the third cooperation circle;
determining a fifth position of a second circle center of the third cooperative circle, and simultaneously determining a third distance between the fourth position and the fifth position;
obtaining a second length of a third radius of the third composite circle;
calculating a second ratio between the third distance and the second length to obtain a second ratio;
if the second occupation ratio is smaller than or equal to the first occupation ratio, taking the corresponding third cloud management platform as a fourth cloud management platform;
acquiring negative evaluation information of the fourth cloud management platform on the second cloud management platform, wherein the negative evaluation information comprises: a plurality of one-to-one negation terms and negation values;
inquiring a preset negative item-negative value-negative score table, and determining a negative score corresponding to the negative item and the negative value;
summarizing the negative scores to obtain a negative score sum corresponding to the second cloud management platform;
if the negative score sum is less than or equal to a preset negative score sum threshold value, second attribute information corresponding to the second cloud management platform is obtained, and if not, the second attribute information is not obtained;
if the cooperation relationship is passive cooperation, acquiring guarantee information of the second cloud management platform, wherein the guarantee information comprises: a plurality of one-to-one corresponding guarantors, guaranty types, and guaranty values;
inquiring a preset guarantor-guarantor type-guaranty value-guaranty rating table, and determining a guaranty rating corresponding to the guarantor, the guaranty type and the guaranty value;
summarizing the guarantee scores to obtain a sum of guarantee scores;
and if the guarantee score sum is greater than or equal to a preset guarantee score sum threshold, acquiring second attribute information corresponding to the second cloud management platform, otherwise, not acquiring.
7. The apparatus for implementing high availability of services for a cloud management platform according to claim 5, wherein the determining module performs the following operations:
extracting a plurality of newly generated first data items corresponding to the same monitoring type in the monitoring data;
acquiring the generation time of the first data item;
establishing a time axis on which to represent the corresponding first data item based on the generation time;
performing feature extraction on the first data item to obtain a first feature;
acquiring a preset trigger feature library, and matching the first feature with a second feature in the trigger feature library to acquire a second matching degree;
if the second matching degree is larger than or equal to a preset first threshold value, taking the corresponding second feature as a third feature, and simultaneously taking the corresponding first data item as a second data item;
inquiring a preset feature-selection direction table, and determining at least one selection direction corresponding to the third feature;
selecting a plurality of first data items preset in the selecting direction of the second data items on the time axis, and using the first data items as fourth data items;
performing feature extraction on the fourth data item to obtain a fourth feature;
randomly combining the first feature and the fourth feature to obtain a plurality of combined features;
querying a preset feature-abnormal combination feature table, and determining at least one first abnormal combination feature corresponding to the third feature;
matching the combined feature with the first abnormal combined feature to obtain a third matching degree;
if the third matching degree is greater than or equal to a preset second threshold value, taking the corresponding first abnormal combination feature as a second abnormal combination feature;
and querying a preset abnormal combination feature-abnormal item table, determining at least one first abnormal item corresponding to the second abnormal combination feature, and finishing the determination.
8. The apparatus for implementing high availability of services for a cloud management platform according to claim 5, wherein the executing module performs the following operations:
the first abnormal item corresponds to the second abnormal item in the first control group, and the corresponding first control group is used as a second control group;
extracting the third coping strategies in the second control group as the first coping strategies;
inquiring a preset corresponding strategy-priority table, and determining the priority corresponding to the first corresponding strategy;
and sequentially executing the first corresponding strategies based on the priority.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202111487047.1A CN113904912B (en) | 2021-12-08 | 2021-12-08 | Method and device for realizing high availability of service of cloud management platform |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202111487047.1A CN113904912B (en) | 2021-12-08 | 2021-12-08 | Method and device for realizing high availability of service of cloud management platform |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN113904912A CN113904912A (en) | 2022-01-07 |
| CN113904912B true CN113904912B (en) | 2022-04-08 |
Family
ID=79025632
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202111487047.1A Active CN113904912B (en) | 2021-12-08 | 2021-12-08 | Method and device for realizing high availability of service of cloud management platform |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN113904912B (en) |
Families Citing this family (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN114548590B (en) * | 2022-03-02 | 2023-02-03 | 江苏悦达绿色建筑科技有限公司 | Internet of things-based engineering finishing intelligent management platform and method |
| CN115392505B (en) * | 2022-08-29 | 2023-07-18 | 智迪机器人技术(盐城)有限公司 | Abnormality processing system and method for auto-parts automatic installation robot |
| CN116668106B (en) * | 2023-05-22 | 2024-01-09 | 山东鼎夏智能科技有限公司 | Threat information processing system and method |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN106161135A (en) * | 2015-04-23 | 2016-11-23 | 中国移动通信集团福建有限公司 | Business transaction failure analysis methods and device |
| CN113157524A (en) * | 2021-04-22 | 2021-07-23 | 深圳壹账通创配科技有限公司 | Big data based exception problem solving method, system, equipment and storage medium |
| CN113377559A (en) * | 2020-03-10 | 2021-09-10 | 北京同邦卓益科技有限公司 | Big data based exception handling method, device, equipment and storage medium |
| CN113572746A (en) * | 2021-07-12 | 2021-10-29 | 腾讯科技(深圳)有限公司 | Data processing method and device, electronic equipment and storage medium |
Family Cites Families (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP5731223B2 (en) * | 2011-02-14 | 2015-06-10 | インターナショナル・ビジネス・マシーンズ・コーポレーションInternational Business Machines Corporation | Abnormality detection device, monitoring control system, abnormality detection method, program, and recording medium |
-
2021
- 2021-12-08 CN CN202111487047.1A patent/CN113904912B/en active Active
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN106161135A (en) * | 2015-04-23 | 2016-11-23 | 中国移动通信集团福建有限公司 | Business transaction failure analysis methods and device |
| CN113377559A (en) * | 2020-03-10 | 2021-09-10 | 北京同邦卓益科技有限公司 | Big data based exception handling method, device, equipment and storage medium |
| CN113157524A (en) * | 2021-04-22 | 2021-07-23 | 深圳壹账通创配科技有限公司 | Big data based exception problem solving method, system, equipment and storage medium |
| CN113572746A (en) * | 2021-07-12 | 2021-10-29 | 腾讯科技(深圳)有限公司 | Data processing method and device, electronic equipment and storage medium |
Non-Patent Citations (1)
| Title |
|---|
| 基于嗅探器和故障策略库的网络故障管理系统的设计与实现;马月玲等;《微型电脑应用》;20040220(第02期);第27-29、36页 * |
Also Published As
| Publication number | Publication date |
|---|---|
| CN113904912A (en) | 2022-01-07 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN113904912B (en) | Method and device for realizing high availability of service of cloud management platform | |
| CN105808694B (en) | Online customer service answering system and method | |
| US9699166B2 (en) | Method and apparatus for processing authentication request message in a social network | |
| US8249903B2 (en) | Method and system of determining and evaluating a business relationship network for forming business relationships | |
| CN105335400B (en) | Enquirement for user is intended to obtain the method and device of answer information | |
| CN111079557B (en) | Face recognition-based automatic distribution method and system for power business hall customer service terminals | |
| CN110830551B (en) | Service request processing method, device and system | |
| CN106534392B (en) | Positioning information acquisition method, positioning method and device | |
| CN110166991A (en) | For the method for Positioning Electronic Devices, unit and storage medium | |
| CN110738577A (en) | Community discovery method, device, computer equipment and storage medium | |
| CN111651741B (en) | User identity recognition method, device, computer equipment and storage medium | |
| US20170104744A1 (en) | Method and apparatus for processing authentication request message in a social network | |
| US7853588B2 (en) | Data processing apparatus for propagative correlation | |
| CN111611322B (en) | User information association method and system | |
| CN105335266B (en) | Method for determining the failure of tested equipment | |
| CN110413607B (en) | Distributed counting method, server and system | |
| CN117290719B (en) | Inspection management method and device based on data analysis and storage medium | |
| US20210241367A1 (en) | Debt management capability assessment system and method of analyzing debt management capability information using the same | |
| CN117035800A (en) | Multi-person interaction session method and device based on telephone bank and electronic equipment | |
| CN111711702A (en) | Distributed cooperative interaction method and system based on communication topology | |
| CN113360627B (en) | Online customer service real-time distribution method and system | |
| CN114665986B (en) | Bluetooth key testing system and method | |
| CN114417136B (en) | User object recommendation method and device | |
| CN113127542B (en) | Data anomaly analysis method and device | |
| CN111815255A (en) | Configuration method and device of user application system based on internet insurance data platform |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant | ||
| EE01 | Entry into force of recordation of patent licensing contract | ||
| EE01 | Entry into force of recordation of patent licensing contract |
Application publication date: 20220107 Assignee: Guangzhou lubangtong Internet of things Technology Co.,Ltd. Assignor: Guangzhou lubangtong Intelligent Technology Co.,Ltd. Contract record no.: X2024980017701 Denomination of invention: A method and device for achieving high business availability on a cloud management platform Granted publication date: 20220408 License type: Common License Record date: 20241010 |