CN110334869A - A training method for mangrove ecological health prediction based on dynamic group optimization algorithm - Google Patents
A training method for mangrove ecological health prediction based on dynamic group optimization algorithm Download PDFInfo
- Publication number
- CN110334869A CN110334869A CN201910612175.0A CN201910612175A CN110334869A CN 110334869 A CN110334869 A CN 110334869A CN 201910612175 A CN201910612175 A CN 201910612175A CN 110334869 A CN110334869 A CN 110334869A
- Authority
- CN
- China
- Prior art keywords
- algorithm
- group
- population
- mangrove
- data
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/004—Artificial life, i.e. computing arrangements simulating life
- G06N3/006—Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/084—Backpropagation, e.g. using gradient descent
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/04—Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/06—Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
- G06Q10/063—Operations research, analysis or management
- G06Q10/0639—Performance analysis of employees; Performance analysis of enterprise or organisation operations
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/02—Agriculture; Fishing; Forestry; Mining
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Human Resources & Organizations (AREA)
- General Physics & Mathematics (AREA)
- Economics (AREA)
- Strategic Management (AREA)
- General Business, Economics & Management (AREA)
- Entrepreneurship & Innovation (AREA)
- General Health & Medical Sciences (AREA)
- Marketing (AREA)
- Tourism & Hospitality (AREA)
- Development Economics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Evolutionary Computation (AREA)
- Operations Research (AREA)
- Mathematical Physics (AREA)
- General Engineering & Computer Science (AREA)
- Educational Administration (AREA)
- Computing Systems (AREA)
- Molecular Biology (AREA)
- Data Mining & Analysis (AREA)
- Game Theory and Decision Science (AREA)
- Computational Linguistics (AREA)
- Biophysics (AREA)
- Software Systems (AREA)
- Quality & Reliability (AREA)
- Biomedical Technology (AREA)
- Artificial Intelligence (AREA)
- Agronomy & Crop Science (AREA)
- Animal Husbandry (AREA)
- Marine Sciences & Fisheries (AREA)
- Mining & Mineral Resources (AREA)
- Primary Health Care (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
本发明公开了一种基于动态群优化算法的红树林生态健康预测训练方法,包括步骤A,数据预处理;步骤A1,基于PSO算法的数据生成模块;步骤A2,基于相对熵和余弦相似度的相似性评估模块;步骤B,基于DGO算法进行模型训练;步骤B1,组内合作;步骤B2,每一组的质心与其他组的质心进行通信;步骤B3,每一组的成员与种群内的其他成员进行随机交叉;步骤B4,在每一质心之间的通信和每一成员的随机交叉过程中,搜寻所述突变情况,基于该突变情况查出与此对应的成员,将该成员随机转移到其它组;步骤C,结合BP算法和DGO算法配合步骤B进行前馈神经网络的训练。本发明提高了预测数据的准确性以及训练模型的完整度。
The invention discloses a mangrove ecological health prediction training method based on dynamic group optimization algorithm, comprising step A, data preprocessing; step A1, data generation module based on PSO algorithm; step A2, based on relative entropy and cosine similarity Similarity evaluation module; step B, model training based on DGO algorithm; step B1, intra-group cooperation; step B2, the centroid of each group communicates with the centroids of other groups; step B3, the members of each group Other members perform random crossover; step B4, during the communication between each centroid and the random crossover process of each member, search for the mutation situation, find out the corresponding member based on the mutation situation, and transfer the member randomly Go to other groups; step C, combine BP algorithm and DGO algorithm with step B to train the feedforward neural network. The invention improves the accuracy of the prediction data and the integrity of the training model.
Description
技术领域technical field
本发明涉及红树林生态保护技术领域及数据挖掘技术领域,尤其涉及一种 基于动态群优化算法的红树林生态健康预测训练方法。The invention relates to the technical field of mangrove ecological protection and data mining, in particular to a mangrove ecological health prediction training method based on a dynamic group optimization algorithm.
背景技术Background technique
红树林是生长于热带和亚热带海岸潮间带、兼具陆地和海洋生态系统特 性的木本植物群落,有着“造陆先锋”、“海岸卫士”、“生物净化筛”之称, 在防浪护堤、净化污染等方面发挥着至关重要的作用。红树林湿地生态系统素 有“物种基因库”之称,保护好红树林生态系统的完整性是维持生物多样性的 重要基础之一。Mangrove is a woody plant community that grows in the intertidal zone of tropical and subtropical coasts and has the characteristics of both terrestrial and marine ecosystems. It plays a vital role in berms, purification of pollution, etc. Mangrove wetland ecosystems are known as "species gene pools", and protecting the integrity of mangrove ecosystems is one of the important foundations for maintaining biodiversity.
红树林湿地生态环境的监测、保护及病虫害控制对于支撑我国东南沿海 的近海海洋生态安全和可持续发展具有重要战略意义。然而近年来,由于人类 活动的破坏,红树林资源急剧减少,建立起一个可靠的红树林生态健康监测和 评估系统以此加快生态环境监测网络的建设,进而为环保标准的制定和政府决 策提供重要依据,促进环境保障工作的有序高效进行,从而提升环境质量是必 需的。The monitoring, protection and pest control of mangrove wetland ecological environment are of great strategic significance to support the coastal marine ecological security and sustainable development along the southeast coast of my country. However, in recent years, due to the destruction of human activities, mangrove resources have decreased sharply. A reliable mangrove ecological health monitoring and evaluation system has been established to speed up the construction of ecological environment monitoring network, which in turn provides important information for the formulation of environmental protection standards and government decision-making. It is necessary to promote the orderly and efficient progress of environmental protection work, thereby improving the quality of the environment.
为此,现有技术中出现了主要通过神经网络算法而进行的生态监测。基于 神经网络的结构建设,其包括一种学习训练方法,该训练方法主要分为梯度法 和启发法两类:梯度法是指通过计算目标函数的一阶导数、甚至二阶导数快速 求得局部最小值的方法,其是根据梯度下降的方向去寻找最小值,因此它能快 速收敛到最近的最小值,即目标最优值,梯度法中目前应用最广泛的是反向传 播(BP)算法,它是1986年由Rumelhart和McClelland为首的科学家提出的概 念,是一种按照误差逆向传播算法训练的多层前馈神经网络;启发式算法是一 个基于直观或经验构造的算法,在可接受的花费(指计算时间和空间)下给出待解决组合优化问题每一个实例的一个可行解,该可行解与最优解的偏离程度 一般不能被预计。For this reason, ecological monitoring mainly through neural network algorithms has appeared in the prior art. The structure construction based on neural network includes a learning and training method. The training method is mainly divided into two categories: gradient method and heuristic method. The minimum value method is to find the minimum value according to the direction of gradient descent, so it can quickly converge to the nearest minimum value, that is, the target optimal value. The most widely used gradient method is the back propagation (BP) algorithm. , which is a concept proposed by scientists led by Rumelhart and McClelland in 1986. It is a multi-layer feedforward neural network trained according to the error back-propagation algorithm; heuristic algorithm is an algorithm based on intuition or experience. A feasible solution for each instance of the combinatorial optimization problem to be solved is given under the cost (referring to computing time and space), and the degree of deviation of the feasible solution from the optimal solution cannot generally be predicted.
目前,启发式算法以仿自然体算法为主,主要有遗传算法、模拟退火法、 粒子群优化算法等。与启发式算法相比,梯度法局部搜索能力强,能快速收敛 到最优值。但是,其很难找到全局的最优值,同样地,在该梯度法原理下,初 始点的位置显得至关重要,如图2所示,如果红点所在位置是初始值,那么该 算法会带着它向黑色线条所标注的方向去找最小值,然后就会找到右边的局部 最小值并且停滞不前,可见,该点并不是目标全局最优值。梯度算法在具体训 练中存在以下问题:1)算法对初始值敏感,不同的初始值会得到不同的结果; 2)容易陷入局部最优值;3)学习率是梯度法中一个很重要的超参数,如果学 习率过大,算法难以收敛,太小又可能过早收敛,而调节参数是一件比较困难 的事情;4)如果搜索空间很复杂,容易出现震荡的现象。At present, the heuristic algorithm is mainly based on the natural body algorithm, mainly including genetic algorithm, simulated annealing method, particle swarm optimization algorithm and so on. Compared with the heuristic algorithm, the gradient method has strong local search ability and can quickly converge to the optimal value. However, it is difficult to find the global optimal value. Similarly, under the principle of the gradient method, the position of the initial point is very important. As shown in Figure 2, if the position of the red point is the initial value, then the algorithm will Take it to the direction marked by the black line to find the minimum value, and then you will find the local minimum on the right and stagnate. It can be seen that this point is not the target global optimal value. The gradient algorithm has the following problems in the specific training: 1) The algorithm is sensitive to the initial value, and different initial values will get different results; 2) It is easy to fall into the local optimal value; 3) The learning rate is a very important superimposition in the gradient method. If the learning rate is too large, the algorithm is difficult to converge, and if the learning rate is too small, it may converge prematurely, and it is difficult to adjust the parameters; 4) If the search space is very complex, it is prone to oscillation.
与梯度法相比,启发式算法非单一地按照特定的方向更新,其允许随机突 变,并三类算法:基于单个解的算法,基于种群的算法和其他相关算法。大多 数启发式算法主要是通过模拟动物的行为,从而形成一种寻找全局最优解的搜 索方法。比如早期的遗传算法,它是通过模拟大自然中生物进化的历程来解决 问题的。大自然中一个种群经历过若干代的自然选择后,剩下的种群必定是适 应环境的。把一个问题所有的解看做一个种群,经历过若干次的自然选择以后, 剩下的解便问题的最优解。遗传算法大致过程分为初始化编码、个体评价、选 择、交叉、变异。启发式算法的优点就在于,它克服了梯度法对初始值敏感和 容易陷入局部最优值的缺陷,在全局探索上更有优势。但也由于它的随机性, 它的收敛速度就要慢一些,而且在局部搜索方面都有一定的限制。Compared with the gradient method, the heuristic algorithm does not update in a specific direction, it allows random mutation, and there are three types of algorithms: algorithm based on a single solution, algorithm based on population and other related algorithms. Most of the heuristic algorithms mainly form a search method to find the global optimal solution by simulating the behavior of animals. For example, the early genetic algorithm solves problems by simulating the process of biological evolution in nature. After a population in nature has undergone several generations of natural selection, the remaining populations must be adapted to the environment. All solutions of a problem are regarded as a population, and after several natural selections, the remaining solutions are the optimal solutions of the problem. The general process of genetic algorithm is divided into initialization coding, individual evaluation, selection, crossover and mutation. The advantage of the heuristic algorithm is that it overcomes the defect that the gradient method is sensitive to the initial value and easy to fall into the local optimal value, and has more advantages in global exploration. But also because of its randomness, its convergence speed is slower, and there are certain limitations in local search.
而对于红树林生态系统而言,其监测数据涉及时间、空间及其数值变化, 同时预测模型属于非线性模式,复杂程度和不完整性较高,利用神经网络结构 以及上述训练方法得到的模型与现实差异较大,预测准确率偏低。因此,一种 新的方法或模型用于红树林生态系统健康的预测。For the mangrove ecosystem, the monitoring data involves time, space and its numerical changes. At the same time, the prediction model is a nonlinear model with high complexity and incompleteness. The model obtained by using the neural network structure and the above training method is consistent with the The reality is quite different, and the prediction accuracy is low. Therefore, a new method or model is used for the prediction of mangrove ecosystem health.
发明内容SUMMARY OF THE INVENTION
本发明的目的在于提供了一种基于动态群优化算法的红树林生态健康预 测训练方法,提供了科学的训练模型,提高了预测数据的准确性以及训练模型 的完整度。The purpose of the present invention is to provide a mangrove ecological health prediction training method based on a dynamic group optimization algorithm, provide a scientific training model, and improve the accuracy of the prediction data and the integrity of the training model.
本发明解决其技术问题所采用的技术方案是:一种基于动态群优化算法的 红树林生态健康预测训练方法,所述方法包括以下步骤:The technical scheme adopted by the present invention to solve the technical problem is: a mangrove ecological health prediction training method based on a dynamic group optimization algorithm, and the method comprises the following steps:
步骤X,建立红树林生态数据集;Step X, establishing a mangrove ecological data set;
步骤A,数据预处理,基于粒子群优化和信息熵的算法进行数据生成;Step A, data preprocessing, data generation based on the algorithm of particle swarm optimization and information entropy;
步骤A1,基于PSO算法的数据生成模块;Step A1, based on the data generation module of PSO algorithm;
步骤A2,基于相对熵和余弦相似度的相似性评估模块;Step A2, a similarity evaluation module based on relative entropy and cosine similarity;
步骤B,基于DGO算法进行模型训练,其模型包括种群,种群内包括若 干个由质心和成员所组成的组;In step B, model training is performed based on the DGO algorithm, and its model includes a population, and the population includes several groups consisting of centroids and members;
步骤B1,组内合作,其用于成员更新,以组合适应度最大的成员并得出 最优解,该成员更新包括正常更新和突变;Step B1, intra-group cooperation, which is used for member update to combine members with the greatest fitness and obtain an optimal solution, and this member update includes normal update and mutation;
步骤B2,每一组的质心与其他组的质心进行通信;Step B2, the centroid of each group communicates with the centroids of other groups;
步骤B3,每一组的成员随机与种群内的其他成员进行交叉;Step B3, the members of each group randomly cross with other members in the population;
步骤B4,在步骤B2和B3中,搜寻所述突变情况,基于该突变情况查出 与此对应的成员,将该成员随机转移到其它组;Step B4, in steps B2 and B3, search for the mutation situation, find out the member corresponding to this based on the mutation situation, and transfer the member to other groups at random;
步骤C,结合BP算法和DGO算法配合步骤B进行前馈神经网络的训练。Step C, combine the BP algorithm and the DGO algorithm with the step B to train the feedforward neural network.
优选的,所述步骤C中的训练过程包括以下步骤:Preferably, the training process in the step C includes the following steps:
步骤C1,基于DGO算法,通过组间通信和组内合作进行种群中每组内的 个体成员之间适应度的计算,该计算是基于BP算法,其包括首先参数初始化, 而后对其进行梯度下降训练,最后通过函数损失值的计算得出所述适应度;Step C1, based on the DGO algorithm, calculate the fitness between individual members in each group in the population through inter-group communication and intra-group cooperation. The calculation is based on the BP algorithm, which includes first parameter initialization, and then gradient descent. training, and finally the fitness is obtained by calculating the loss value of the function;
步骤C2,基于步骤B4更新每一组的质心;Step C2, update the centroid of each group based on step B4;
步骤C3,是否满足迭代终止条件,若是,则根据当前数据信息得出相关 参数及适应度;若否,则返回步骤C1并依次重复前述步骤,直到满足迭代终 止条件为止。Step C3, whether the iteration termination condition is satisfied, if yes, obtain relevant parameters and fitness according to the current data information; if not, return to Step C1 and repeat the foregoing steps in turn until the iteration termination condition is satisfied.
优选的,所述步骤A2还包括计算种群内每个个体成员之间的适合度,并 根据该适合度更新种群内的个体,同时得到最大适应度的个体,将其结果添加 到数据集中,以此对数据集进行评估,该评估过程包括:Preferably, the step A2 also includes calculating the fitness between each individual member in the population, and updating the individuals in the population according to the fitness, obtaining the individual with the largest fitness, and adding the result to the data set to This dataset is evaluated, and the evaluation process includes:
S1,将数据集编码为种群S1, encode the dataset into populations
P=(g1,g2,...,gn)P=(g 1 , g 2 , ..., g n )
g=(x1,x2,...,xm)g=(x 1 , x 2 , . . . , x m )
其中,P为种群,g为种群中的组,n为设置的种群组数,m为组内个体的 长度;Among them, P is the population, g is the group in the population, n is the set number of population groups, and m is the length of the individuals in the group;
S2,通过PSO算法进行种群扩充,PSO中包括与其每一个个体对应的粒 子,该粒子存在于d维空间中,在其种群扩充过程中,包括对粒子在该d维空 间中的位置、速度和其经过的历史最优点位置,以及种群所经过的最优点位置, 上述表达式如下:S2, the population is expanded by the PSO algorithm. The PSO includes a particle corresponding to each individual, and the particle exists in the d-dimensional space. During the population expansion process, it includes the position, velocity and The historical optimal position that it has passed through, and the optimal position that the population has passed through, the above expressions are as follows:
粒子i的位置,xi=(xi1,xi2,...,xid),i=1,2,...,m;the position of particle i, x i = (x i1 , x i2 ,..., x id ), i=1, 2,..., m;
粒子i的速度,vi=(vi1,vi2,...,vid),i=1,2,...,m;Velocity of particle i, v i =(v i1 , v i2 , . . . , v id ), i=1, 2, . . . , m;
粒子i经过的最优点位置,pi=(pi1,pi2,...,pid),i=1,2,...,m;The position of the optimal point passed by the particle i , pi =( pi1 , pi2 ,..., pid ), i=1, 2,...,m;
种群所经过的最优点位置,pg=(pg1,pg2,...,pgd);The optimal point position passed by the population, p g = (p g1 , p g2 , . . . , p gd );
其中,每个粒子以当前的位置和速度进行位置更新,所述更新公式如下:The position of each particle is updated with the current position and velocity, and the update formula is as follows:
t时刻到t+1时刻的速度, The velocity from time t to time t+1,
t时刻到t+1时刻的位置, The position from time t to time t+1,
其中,为当前粒子所经过的最优点位置,为当前粒子所经过的最优点 位置,ω为惯性权重,c1c2为学习因子,r1r2为[0,1]之间的随机数;in, is the optimal point position passed by the current particle, is the optimal point position passed by the current particle, ω is the inertia weight, c 1 c 2 is the learning factor, r 1 r 2 is a random number between [0, 1];
S3,评估所扩充的新个体,其包括对相对熵和余弦相似度的计算,计算公 式如下:S3, evaluate the expanded new individual, which includes the calculation of relative entropy and cosine similarity, and the calculation formula is as follows:
f(PO,PG)=fE(PO,PG)+fK(PO,PG)+fC(PO,PG)f(PO, PG ) = fE (PO, PG ) + fK (PO, PG ) + fC ( PO , PG )
其中,PO,PG分别表示原始的红树林生态监测数据和生成数据,f(PO,PG)表 示相似度;Among them, PO and PG represent the original mangrove ecological monitoring data and generated data, respectively, and f( PO , PG ) represents the similarity;
fE(PO,PG)=|H(PG)-H(Po)|f E (P O , P G )=|H(P G )-H(P o )|
其中,H(PG)表示生成数据的信息熵,H(Po)表示原始数据的信息熵;Among them, H(P G ) represents the information entropy of the generated data, and H(P o ) represents the information entropy of the original data;
和分别表示PO,PG在第i位置的概率值; and respectively represent the probability value of PO and PG at the i - th position;
PG(Xi)和PO(Xi)分别表示PO,PG在第i位置的值;P G (X i ) and P O (X i ) represent the values of P O and P G at the i-th position, respectively;
依次重复步骤S2和步骤S3,以配合所述步骤C3,直到满足迭代终止条 件为止。Steps S2 and S3 are sequentially repeated to cooperate with the step C3 until the iteration termination condition is satisfied.
优选的,所述步骤B1包括更新公式:Preferably, the step B1 includes updating the formula:
其中,xi,j,k表示第k代第i组的第j个成员,w是决定移动方向的权重,r为0 到1之间的随机数,Ci为第i组的质心,Gbest为全局最优值,μ是服从μ~(0,s2)的随 机数,s为步长。Among them, x i, j, k represent the j-th member of the i-th group of the k-th generation, w is the weight that determines the moving direction, r is a random number between 0 and 1, C i is the centroid of the i-th group, G best is the global optimal value, μ is a random number obeying μ~(0, s 2 ), and s is the step size.
优选的,所述步骤B2中每一组的质心与其他组的质心进行通信包括质心 的移动,其质心移动采用Lévy随机游走,该Lévy随机游走的公式如下:Preferably, in the step B2, the communication between the centroid of each group and the centroids of other groups includes the movement of the centroid, and the movement of the centroid adopts a Lévy random walk, and the formula of the Lévy random walk is as follows:
其中,α为其步长,表示第k代第i组的质心,表示一种entry-wise乘法, Lévy(λ)是一个服从Lévy分布的随机数。where α is the step size, represents the centroid of the i-th group of the k-th generation, Representing an entry-wise multiplication, Lévy(λ) is a random number that obeys the Lévy distribution.
优选的,所述步骤B3还包括随机选择交叉算子和偏置随机游走算子,并 以如下公式进行选择:Preferably, the step B3 also includes randomly selecting a crossover operator and a biased random walk operator, and selects it with the following formula:
随机选择算子, random selection operator,
偏置随机游走算子,Biased random walk operator,
其中,r是随机数生成器,Cr是交叉概率。where r is the random number generator and Cr is the crossover probability.
优选的,所述步骤C将MSE作为BP算法中的学习误差函数,其计算公 式为: Preferably, in the step C, MSE is used as the learning error function in the BP algorithm, and its calculation formula is:
DGO算法中的适应度计算公式为: The fitness calculation formula in the DGO algorithm is:
其中,表示第i个训练样本的第j个输出值,yi,j表示第i个训练样本的第j个 期望输出值,n是训练样本的个数,m是输出维数。in, represents the jth output value of the ith training sample, y i, j represents the jth expected output value of the ith training sample, n is the number of training samples, and m is the output dimension.
本发明的有益效果在于:通过BP算法与DGO算法的结合,实现了复杂 度较高的红树林生态健康数据的预测模型训练;通过基于粒子群优化和信息熵 的算法,进而基于红树林生态的原始数据生成规模可控的仿真数据,从而为训 练模型提供了可靠的数据支撑;通过数据预处理,提高了数据生成效率,同时, 对传统数据集采集后的不完整性和预测分类的不准确性进行了有效解决,增强 了训练模型的抗过拟合能力;通过BP算法和动态群优化算法训练前馈神经网 络,将两种算法的优势相结合,既在避免全局探索时陷入局部最优点的条件下 使得局部搜索收敛速度较快,进而对复杂的训练模型能够发挥高效的训练能力,从而在红树林生态保护技术领域之外的数据挖掘技术领域也能得到较好的 应用,增强了实用性。The beneficial effects of the invention are as follows: through the combination of the BP algorithm and the DGO algorithm, the prediction model training of the mangrove ecological health data with high complexity is realized; The original data generates simulation data with a controllable scale, thus providing reliable data support for the training model; through data preprocessing, the data generation efficiency is improved, and at the same time, the incompleteness of traditional data sets after collection and the inaccuracy of prediction and classification are eliminated. It effectively solves the problem and enhances the anti-overfitting ability of the training model; the feedforward neural network is trained by the BP algorithm and the dynamic group optimization algorithm, and the advantages of the two algorithms are combined, which not only avoids falling into the local optimum point during global exploration Under the conditions of the local search, the convergence speed is faster, and the complex training model can be effectively trained, so that it can also be well applied in the field of data mining technology outside the field of mangrove ecological protection technology, which enhances the practicality. sex.
附图说明Description of drawings
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将结合附 图及实施例对本发明作进一步说明,下面描述中的附图仅仅是本发明的部分实 施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以 根据这些附图获得其他附图:In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the present invention will be further described below with reference to the accompanying drawings and embodiments. Ordinary technicians can also obtain other drawings based on these drawings without creative labor:
图1为本发明一种基于动态群优化算法的红树林生态健康预测训练方法中 训练模型的训练流程图;Fig. 1 is a kind of training flow chart of training model in the mangrove ecological health prediction training method based on dynamic group optimization algorithm of the present invention;
图2为本发明一种基于动态群优化算法的红树林生态健康预测训练方法中 全局最优质搜寻示意图;Fig. 2 is a kind of global best quality search schematic diagram in a kind of mangrove ecological health prediction training method based on dynamic group optimization algorithm of the present invention;
图3为本发明一种基于动态群优化算法的红树林生态健康预测训练方法中 群主变异的转移示意图;Fig. 3 is the transfer schematic diagram of group master variation in a kind of mangrove ecological health prediction training method based on dynamic group optimization algorithm of the present invention;
图4为本发明一种基于动态群优化算法的红树林生态健康预测训练方法中 LF和CA在训练数据集上的最小值和平均值的列表;Fig. 4 is a list of minimum values and mean values of LF and CA on the training data set in a kind of mangrove ecological health prediction training method based on dynamic group optimization algorithm of the present invention;
图5为本发明一种基于动态群优化算法的红树林生态健康预测训练方法中 LF和CA在测试数据集上的最小值和平均值的列表;Fig. 5 is the list of the minimum value and the mean value of LF and CA on the test data set in a kind of mangrove ecological health prediction training method based on dynamic group optimization algorithm of the present invention;
图6为本发明一种基于动态群优化算法的红树林生态健康预测训练方法中 反映适应度如何随迭代次数变化的适应度收敛曲线的示意图;6 is a schematic diagram of a fitness convergence curve reflecting how fitness changes with the number of iterations in a mangrove ecological health prediction training method based on a dynamic group optimization algorithm of the present invention;
图7为本发明一种基于动态群优化算法的红树林生态健康预测训练方法中 的数据集特征表;Fig. 7 is a kind of data set characteristic table in the mangrove ecological health prediction training method based on dynamic group optimization algorithm of the present invention;
图8为本发明一种基于动态群优化算法的红树林生态健康预测训练方法中 的数据集标签表。Fig. 8 is a data set label table in a mangrove ecological health prediction training method based on the dynamic group optimization algorithm of the present invention.
具体实施方式Detailed ways
为了使本发明实施例的目的、技术方案和优点更加清楚,下面将结合本发 明实施例中的技术方案进行清楚、完整的描述,显然,所描述的实施例是本发 明的部分实施例,而不是全部实施例。基于本发明的实施例,本领域普通技术 人员在没有付出创造性劳动的前提下所获得的所有其他实施例,都属于本发明 的保护范围。In order to make the purposes, technical solutions and advantages of the embodiments of the present invention clearer, the following will be described clearly and completely in combination with the technical solutions in the embodiments of the present invention. Obviously, the described embodiments are part of the embodiments of the present invention, and Not all examples. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative work fall within the protection scope of the present invention.
需要注意的是,本申请所述的“步骤A1”和“步骤A2”属于“步骤A” 中的两部分构成内容,其没有定向的先后关系,具体流程关系以说明书中的实 施例为准,另外,它们之间的实施顺序不会对数据预处理造成影响。It should be noted that "step A1" and "step A2" described in this application belong to the two parts of "step A", and there is no directional relationship between them. The specific process relationship is subject to the embodiments in the description. In addition, the order in which they are implemented does not affect data preprocessing.
在实施例1中,如图1所示,一种基于动态群优化算法的红树林生态健康 预测训练方法,所述方法包括以下步骤:In embodiment 1, as shown in Figure 1, a kind of mangrove ecological health prediction training method based on dynamic group optimization algorithm, described method comprises the following steps:
步骤X,建立红树林生态数据集;Step X, establishing a mangrove ecological data set;
步骤A,数据预处理,基于粒子群优化和信息熵的算法进行数据生成;Step A, data preprocessing, data generation based on the algorithm of particle swarm optimization and information entropy;
步骤A1,基于PSO算法的数据生成模块;Step A1, based on the data generation module of PSO algorithm;
步骤A2,基于相对熵和余弦相似度的相似性评估模块;Step A2, a similarity evaluation module based on relative entropy and cosine similarity;
步骤B,基于DGO算法进行模型训练,其模型包括种群,种群内包括由 质心和成员所组成的若干个组;In step B, model training is performed based on the DGO algorithm, and its model includes a population, and the population includes several groups consisting of centroids and members;
步骤B1,组内合作,其用于成员更新,以组合适应度最大的成员并得出 最优解,该更新包括正常更新和突变;Step B1, intra-group cooperation, which is used for member update, to combine members with the greatest fitness and obtain an optimal solution, and this update includes normal update and mutation;
步骤B2,每一组的质心与其他组的质心进行通信;Step B2, the centroid of each group communicates with the centroids of other groups;
步骤B3,每一组的成员与种群内的其他成员进行随机交叉;Step B3, the members of each group are randomly crossed with other members in the population;
步骤B4,在每一质心之间的通信和每一成员的随机交叉过程中,搜寻所 述突变情况,基于该突变情况查出与此对应的成员,将该成员随机转移到其它 组;Step B4, in the communication between each centroid and the random crossover process of each member, search for the mutation situation, find out the corresponding member based on the mutation situation, and randomly transfer the member to other groups;
步骤C,结合BP算法和DGO算法配合步骤B进行前馈神经网络的训练。Step C, combine the BP algorithm and the DGO algorithm with the step B to train the feedforward neural network.
首先,在步骤X中,建立红树林数据集,如图该红树林数据集包括不同特 征,其中每一个特征对应一个预测指标:盐度年度变化、PH值、活性磷酸盐 /(μg/L)、无机氮/(μg/L)、汞/(μg/g)、镉/(μg/g)、铅/(μg/g)、砷/(μg/g)、油 类/(μg/g)、硫化物/(μg/g)、有机碳(%)、浮游植物密度/(x105个/m3)、浮游动 物密度/(x103个/m3)、浮游动物生物量/(mg/m3)、底栖动物密度/(个/m2)和底栖 动物生物量/(g/m2),与前述指标依次对应的数据为1.534、9.25、11.833、298.333、 0.12、0.119、1.6667、1.0067、48.333、44.7、0.96、2.39、0.813、27.68、360 和365.35。根据该数据建立特征集X:First, in step X, a mangrove data set is established. As shown in the figure, the mangrove data set includes different features, each of which corresponds to a predictor: annual change in salinity, pH value, active phosphate/(μg/L) , inorganic nitrogen/(μg/L), mercury/(μg/g), cadmium/(μg/g), lead/(μg/g), arsenic/(μg/g), oil/(μg/g) , sulfide/(μg/g), organic carbon (%), phytoplankton density/(x10 5 /m 3 ), zooplankton density/(x10 3 /m 3 ), zooplankton biomass/(mg/ m 3 ), benthic density/(individuals/m 2 ) and benthic biomass/(g/m 2 ), the data corresponding to the aforementioned indicators are 1.534, 9.25, 11.833, 298.333, 0.12, 0.119, 1.6667 , 1.0067, 48.333, 44.7, 0.96, 2.39, 0.813, 27.68, 360, and 365.35. Create a feature set X from this data:
其中,n为样本个数,m为特征数,一个样本包含1个以上的特征。如图 8所示,根据指标数据进行标签划分,数据值以下的指标归为标签“0”,等于 数据值的指标归为标签“1”,数据值以上的指标归为标签:“2”,标签“0” 对应的等级为健康,标签“1”对应的等级为亚健康,标签“2”对应的等级为 不健康。最终按照标签表进行预测结果的输出,输出标签集为Y:Among them, n is the number of samples, m is the number of features, and a sample contains more than one feature. As shown in Figure 8, labels are divided according to the index data. The index below the data value is classified as the label "0", the index equal to the data value is classified as the label "1", and the index above the data value is classified as the label: "2", The level corresponding to the label "0" is healthy, the level corresponding to the label "1" is sub-health, and the level corresponding to the label "2" is unhealthy. Finally, the output of the prediction result is performed according to the label table, and the output label set is Y:
具体的,在所述步骤A中,粒子群优化和信息熵的算法是用于生成数据, 以通过模拟少量的原始红树林生态监测数据生成规模性的与原始数据分布相 近的数据集,并形成包含d维空间的训练模型。在步骤A1和步骤A2所结合 的两种算法中,所有原始数据处于一个数据群体中,其数据群体中包含与每一 单位数据所对应的个体。其中,对群体中的所有个体进行随机初始化,其每个 个体构成一条生成数据。在其算法中,包括迭代机制,在其每次迭代过程中, 首先对每个个体相互的适合度进行计算,包括适应度函数的设置,该适应度函 数是指相似度评估函数;而后根据适应度大小更新种群个体,并以其得出适应 度最高的个体及该适应度;最后,将适应度最高的个体作为结果并添加至数据 集中。Specifically, in the step A, the algorithm of particle swarm optimization and information entropy is used to generate data, so as to generate a large-scale data set similar to the original data distribution by simulating a small amount of original mangrove ecological monitoring data, and form A trained model containing a d-dimensional space. In the two algorithms combined in step A1 and step A2, all the original data are in a data population, and the data population includes the individual corresponding to each unit of data. Among them, all individuals in the group are randomly initialized, and each individual constitutes a piece of generated data. In its algorithm, it includes an iterative mechanism. In each iteration process, the mutual fitness of each individual is first calculated, including the setting of the fitness function, which refers to the similarity evaluation function; Update the population individuals with the size of the fitness, and obtain the individual with the highest fitness and the fitness; finally, the individual with the highest fitness is used as the result and added to the data set.
PSO算法为粒子群优化算法,该PSO算法中包括粒子群集,粒子群集由 规模化的粒子组成,每一粒子对应为种群中的一个个体。其中,每一个粒子均 在d维空间中进行最优点位置的搜索,每一个粒子均匹配适应度函数确定适应 值以判断当前位置与最优点的接近程度。所述粒子具有最优点记忆,在其搜索 过程中,会对所经过的最优点位置进行记录。每个粒子在搜索过程中的速度会 决定其运动的距离和方向,该速度是根据它本身的运动习惯以及相伴粒子的运 动习惯进行动态调整。所述运动习惯为粒子以往到达最优点位置所取的速度, 以及与种群以往所经过的最优点位置的对应关系,以其惯性参数相模拟所形成的训练经验。The PSO algorithm is a particle swarm optimization algorithm. The PSO algorithm includes particle clusters. The particle clusters are composed of large-scale particles, and each particle corresponds to an individual in the population. Among them, each particle searches for the optimal point position in the d-dimensional space, and each particle matches the fitness function to determine the fitness value to judge the proximity of the current position to the optimal point. The particle has an optimal point memory, and during its search, it records the location of the optimal point that it passes through. The speed of each particle in the search process will determine the distance and direction of its motion, and the speed is dynamically adjusted according to its own motion habits and the motion habits of accompanying particles. The motion habit is the speed that the particle has taken to reach the optimal point position in the past, and the corresponding relationship with the optimal point position that the population has passed through in the past, and the training experience formed by simulating its inertial parameters.
DGO算法为动态群优化算法,其主要通过群算法和进化算法相结合而进 行计算,其中,群算法用于全局的粒子探索;进化算法用于局部的粒子最优点 开发。在此计算过程的训练模型中,包括每个组由两部分组成的质心和成员, 若干个组可构成一个种群,其中,每组的质心记录该组所发现的最优解,即与 个体间相对应的最大适合度。The DGO algorithm is a dynamic swarm optimization algorithm, which is mainly calculated by combining the swarm algorithm and the evolutionary algorithm. Among them, the swarm algorithm is used for global particle exploration; the evolutionary algorithm is used for local particle optimal point development. In the training model of this calculation process, including the centroid and members of each group consisting of two parts, several groups can form a population, wherein the centroid of each group records the optimal solution found by the group, that is, the relationship with the individual. The corresponding maximum fitness.
在所述步骤B4中,如图3所示,所述突变情况属于群组变异,该群组变 异是为了节约计算资源和避免陷入局部最优而设置的。其中,如果某一组中的 某一个成员在给定的尝试次数内无法改进自己,就将它随机转移到其他组。与 其他启发式算法算法相比,DGO算法对于训练神经网络更有效,因为它既考 虑了全局探索,也考虑了局部开发。上述四个步骤过程确保了这些良好的性能, 在解决复杂问题时有效地避免了收敛慢、对初始值敏感以及容易陷入局部最优 的问题。In the step B4, as shown in Fig. 3, the mutation situation belongs to the group mutation, and the group mutation is set to save computing resources and avoid falling into a local optimum. Among them, if a member of a certain group cannot improve itself within a given number of attempts, it is randomly transferred to other groups. Compared with other heuristic algorithms, the DGO algorithm is more efficient for training neural networks because it considers both global exploration and local exploitation. The above four-step process ensures these good performances, effectively avoiding the problems of slow convergence, sensitivity to initial values, and easy to fall into local optimum when solving complex problems.
进一步地,所述步骤C中的训练过程包括以下步骤:Further, the training process in the step C includes the following steps:
步骤C1,基于DGO算法,通过组间通信和组内合作进行种群中每组内的 个体成员之间适应度的计算,该计算是基于BP算法,其包括首先参数初始化, 而后对其进行梯度下降训练,最后通过函数损失值的计算得出所述适应度;Step C1, based on the DGO algorithm, calculate the fitness between individual members in each group in the population through inter-group communication and intra-group cooperation. The calculation is based on the BP algorithm, which includes first parameter initialization, and then gradient descent. training, and finally the fitness is obtained by calculating the loss value of the function;
步骤C2,基于步骤B4更新每一组的质心;Step C2, update the centroid of each group based on step B4;
步骤C3,是否满足迭代终止条件,若是,则根据当前数据信息得出相关 参数及适应度;若否,则返回步骤C1并依次重复前述步骤,直到满足迭代终 止条件为止。Step C3, whether the iteration termination condition is satisfied, if yes, obtain relevant parameters and fitness according to the current data information; if not, return to Step C1 and repeat the foregoing steps in turn until the iteration termination condition is satisfied.
其中,在初始化阶段,参数会被初始化。将部分参数预先设定阈值,例如 种群大小和每组所含成员个数;而另一些参数是随机生成的,比如种群的出发 点,即所有个体的初始状态。每一次迭代,通过DGO算法将得到的个体作为 BP算法的参数初始值,再经过BP算法的训练得到适应度,然后DGO算法会 根据所有个体的适应度大小对种群进行更新。如此循环迭代,得到最终的最佳 参数和最佳适应度。所述迭代终止条件为组合适应度最大的成员并得出最优 解,即组内成员更新的完成。Among them, in the initialization phase, the parameters will be initialized. Some parameters are preset with thresholds, such as population size and the number of members in each group; while other parameters are randomly generated, such as the starting point of the population, that is, the initial state of all individuals. In each iteration, the individual obtained by the DGO algorithm is used as the initial value of the parameters of the BP algorithm, and then the fitness is obtained through the training of the BP algorithm, and then the DGO algorithm will update the population according to the fitness of all individuals. Iterates in this way to get the final optimal parameters and optimal fitness. The iterative termination condition is to combine the members with the largest fitness and obtain the optimal solution, that is, the completion of the update of members in the group.
进一步地,所述步骤A2还包括计算种群内每个个体成员之间的适合度, 并根据该适合度更新种群内的个体,同时得到最大适应度的个体,将其结果添 加到数据集中,以此对数据集进行评估,该评估过程包括:Further, the step A2 also includes calculating the fitness between each individual member in the population, and updating the individuals in the population according to the fitness, obtaining the individual with the largest fitness at the same time, and adding the result to the data set. This dataset is evaluated, and the evaluation process includes:
S1,将数据集编码为种群S1, encode the dataset into populations
P=(g1,g2,...,gn)P=(g 1 , g 2 , ..., g n )
g=(x1,x2,...,xm)g=(x 1 , x 2 , . . . , x m )
其中,P为种群,g为种群中的组,n为设置的种群组数,m为组内个体的 长度;Among them, P is the population, g is the group in the population, n is the set number of population groups, and m is the length of the individuals in the group;
S2,通过PSO算法进行种群扩充,PSO中包括与其每一个个体对应的粒 子,该粒子存在于d维空间中,在其种群扩充过程中,包括对粒子在该d维空 间中的位置、速度和其经过的历史最优点位置,以及种群所经过的最优点位置, 上述表达式如下:S2, the population is expanded by the PSO algorithm. The PSO includes a particle corresponding to each individual, and the particle exists in the d-dimensional space. During the population expansion process, it includes the position, velocity and The historical optimal position that it has passed through, and the optimal position that the population has passed through, the above expressions are as follows:
粒子i的位置,xi=(xi1,xi2,...,xid),i=1,2,...,m;the position of particle i, x i = (x i1 , x i2 ,..., x id ), i=1, 2,..., m;
粒子i的速度,vi=(vi1,vi2,...,vid),i=1,2,...,m;Velocity of particle i, v i =(v i1 , v i2 , . . . , v id ), i=1, 2, . . . , m;
粒子i经过的最优点位置,pi=(pi1,pi2,...,pid),i=1,2,...,m;The position of the optimal point passed by the particle i , pi =( pi1 , pi2 ,..., pid ), i=1, 2,...,m;
种群所经过的最优点位置,pg=(pg1,pg2,...,pgd);The optimal point position passed by the population, p g = (p g1 , p g2 , . . . , p gd );
其中,每个粒子以当前的位置和速度进行位置更新,所述更新公式如下:The position of each particle is updated with the current position and velocity, and the update formula is as follows:
t时刻到t+1时刻的速度, The velocity from time t to time t+1,
t时刻到t+1时刻的位置, The position from time t to time t+1,
其中,为当前粒子所经过的最优点位置,为当前粒子所经过的最优点 位置,ω为惯性权重,c1c2为学习因子,r1r2为[0,1]之间的随机数;in, is the optimal point position passed by the current particle, is the optimal point position passed by the current particle, ω is the inertia weight, c 1 c 2 is the learning factor, r 1 r 2 is a random number between [0, 1];
S3,评估所扩充的新个体,其包括对相对熵和余弦相似度的计算,计算公 式如下:S3, evaluate the expanded new individual, which includes the calculation of relative entropy and cosine similarity, and the calculation formula is as follows:
f(PO,PG)=fE(PO,PG)+fK(PO,PG)+fC(PO,PG)f(PO, PG ) = fE (PO, PG ) + fK (PO, PG ) + fC ( PO , PG )
其中,PO,PG分别表示原始的红树林生态监测数据和生成数据,f(PO,PG)表 示相似度;Among them, PO and PG represent the original mangrove ecological monitoring data and generated data, respectively, and f( PO , PG ) represents the similarity;
fE(PO,PG)=|H(PG)-H(Po)|f E (P O , P G )=|H(P G )-H(P o )|
其中,H(PG)表示生成数据的信息熵,H(Po)表示原始数据的信息熵;Among them, H(P G ) represents the information entropy of the generated data, and H(P o ) represents the information entropy of the original data;
和分别表示PO,PG在第i位置的概率值; and respectively represent the probability value of PO and PG at the i - th position;
PG(Xi)和PO(Xi)分别表示PO,PG在第i位置的值;P G (X i ) and P O (X i ) represent the values of P O and P G at the i-th position, respectively;
依次重复步骤S2和步骤S3,以配合所述步骤C3,直到满足迭代终止条 件为止。Steps S2 and S3 are sequentially repeated to cooperate with the step C3 until the iteration termination condition is satisfied.
进一步地,所述步骤B1包括更新公式:Further, the step B1 includes updating the formula:
其中,xi,j,k表示第k代第i组的第j个成员,w是决定移动方向的权重,r为0 到1之间的随机数,Ci为第i组的质心,Gbest为全局最优值,μ是服从μ~(0,s2)的随 机数,s为步长。Among them, x i, j, k represent the j-th member of the i-th group of the k-th generation, w is the weight that determines the moving direction, r is a random number between 0 and 1, C i is the centroid of the i-th group, G best is the global optimal value, μ is a random number obeying μ~(0, s 2 ), and s is the step size.
上述计算过程用于成员更新,包括正常更新和突变。更新过程根据每组最 优解和全局最优解进行计算。突变可以保证种群的多样性,从而避免陷入局部 最优。The above calculation process is used for member update, including normal update and mutation. The update process is calculated based on each set of optimal solutions and the global optimal solution. Mutation can ensure the diversity of the population, thus avoiding falling into a local optimum.
进一步地,所述步骤B2中每一组的质心与其他组的质心进行通信包括质 心的移动,其质心移动采用Lévy随机游走,该Lévy随机游走的公式如下:Further, the communication between the centroid of each group and the centroids of other groups in the step B2 includes the movement of the centroid, and the movement of the centroid adopts a Lévy random walk, and the formula of the Lévy random walk is as follows:
其中,α为其步长,表示第k代第i组的质心,表示一种entry-wise乘法, Lévy(λ)是一个服从Lévy分布的随机数。where α is the step size, represents the centroid of the i-th group of the k-th generation, Representing an entry-wise multiplication, Lévy(λ) is a random number that obeys the Lévy distribution.
Lévy随机游走适用于上述计算过程,该计算过程提供了种群多样性和扩 大搜索范围的能力,Lévy游走的数更新在这个过程中,各组的质心与其他组 的质心进行通信,进而提高了通信效率,以此节省通讯成本。The Lévy random walk is suitable for the above calculation process, which provides population diversity and the ability to expand the search range. The number of Lévy walks is updated. During this process, the centroids of each group communicate with the centroids of other groups, thereby improving the It improves communication efficiency and saves communication costs.
进一步地,所述步骤B3还包括随机选择交叉算子和偏置随机游走算子, 并以如下公式进行选择:Further, the step B3 also includes randomly selecting a crossover operator and a biased random walk operator, and selecting them according to the following formula:
随机选择算子, random selection operator,
偏置随机游走算子,Biased random walk operator,
其中,r是随机数生成器,Cr是交叉概率。where r is the random number generator and Cr is the crossover probability.
在上述过程中,每个成员以随机概率与种群内的其他成员进行交叉。该交 叉动作是一种高效的全局优化方式,旨在快捷地解决陷入局部最优的问题。In the above process, each member crosses with other members of the population with random probability. This crossover action is an efficient global optimization method that aims to quickly solve problems stuck in local optima.
进一步地,所述步骤C将MSE作为BP算法中的学习误差函数,其计算 公式为: Further, described step C uses MSE as the learning error function in the BP algorithm, and its calculation formula is:
DGO算法中的适应度计算公式为: The fitness calculation formula in the DGO algorithm is:
其中,表示第i个训练样本的第j个输出值,yi,j表示第i个训练样本的第j个 期望输出值,n是训练样本的个数,m是输出维数。in, represents the jth output value of the ith training sample, y i, j represents the jth expected output value of the ith training sample, n is the number of training samples, and m is the output dimension.
在上述训练过程中,将BP和DGO算法结合起来作为前馈神经网络的训 练方法。将DGO算法添加到传统的BP神经网络中进行优化训练,由于BP神 经网络存在对初始值敏感和容易陷入局部最优的缺陷,而DGO算法的高度随 机性和多样性能够较好地弥补所述缺陷。前馈神经网络包括各个神经元,其每 一个神经元分层排列,神经元之间具有上下层的连接关系,而本实施例中前馈 神经网络的优化主要是找到一组各层连接的权重和偏置,以此丰富训练参数, 进而提高模型预测的结果的准确率。其中,预测结果的准确性又与学习损失相 关,故此,通过设定学习损失函数,以最小化损失函数进行数据目标的优化。 其中,前馈神经网络中的权重和偏置均转化为一个向量,每个向量表示一个个 体的位置。In the above training process, the BP and DGO algorithms are combined as the training method of the feedforward neural network. The DGO algorithm is added to the traditional BP neural network for optimization training. Since the BP neural network has the defects of being sensitive to the initial value and easy to fall into the local optimum, the high randomness and diversity of the DGO algorithm can better make up for the above. defect. The feedforward neural network includes various neurons, and each neuron is arranged in layers, and the neurons have a connection relationship between the upper and lower layers, and the optimization of the feedforward neural network in this embodiment is mainly to find a set of weights for each layer connection and bias to enrich the training parameters, thereby improving the accuracy of the results predicted by the model. Among them, the accuracy of the prediction results is related to the learning loss. Therefore, by setting the learning loss function, the data target is optimized to minimize the loss function. Among them, the weights and biases in the feedforward neural network are converted into a vector, and each vector represents the position of an individual.
具体地,在本实施例中通过4个不同的训练模型进行比对,其中,DGBPNN 模型为本申请所提供的训练模型。同时包括2种评价技术效果的指标,如下:Specifically, in this embodiment, four different training models are used for comparison, wherein the DGBPNN model is the training model provided by this application. At the same time, it includes two indicators to evaluate the technical effect, as follows:
1)定量的统计性能:1) Quantitative statistical performance:
A.LF:20次独立运行的损失函数(LF)的最小值和平均值,其计算公式 是为所述适应度计算公式. A. LF: the minimum and average value of the loss function (LF) for 20 independent runs, which is calculated as the fitness formula.
B.CA:20次独立运行的健康状况分类准确率的最大值和平均值,其计算 公式为 B.CA: The maximum and average value of the health status classification accuracy for 20 independent runs, calculated as
其中是分类正确的样本数,是用于分类的总样本数。where is the number of correctly classified samples and is the total number of samples used for classification.
如图4及5所示,从这两个表中可以看出,DGBPNN在拟合和泛化能力 方面的表现优于其他三种算法。损失函数的平均值和最小值均是三种算法中最 小的,在分类准确率方面,平均值和最佳值也都是最好的。As shown in Figures 4 and 5, it can be seen from these two tables that DGBPNN outperforms the other three algorithms in terms of fitting and generalization ability. The average and minimum values of the loss function are the smallest among the three algorithms, and in terms of classification accuracy, the average and optimal values are also the best.
2)反映适应度如何随迭代次数变化的适应度收敛曲线:如图6所示,相比之 下,DGBPNN的收敛速度是最快,它的收敛速度与DGEFNN接近。虽然两种 算法的收敛速度相近,但DGBPNN的适应度更小。在曲线位置上,DGBPNN 的曲线接近PSBNN,但收敛速度又比PSBNN快。由此结果表明DGBPNN在 解决红树林生态系统健康的预测训练模型上具有最佳性能。2) The fitness convergence curve that reflects how the fitness changes with the number of iterations: as shown in Figure 6, in contrast, the convergence speed of DGBPNN is the fastest, and its convergence speed is close to that of DGEFNN. Although the convergence speed of the two algorithms is similar, the fitness of DGBPNN is smaller. In the curve position, the curve of DGBPNN is close to PSBNN, but the convergence speed is faster than PSBNN. These results show that DGBPNN has the best performance in solving the prediction training model of mangrove ecosystem health.
Claims (8)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201910612175.0A CN110334869A (en) | 2019-08-15 | 2019-08-15 | A training method for mangrove ecological health prediction based on dynamic group optimization algorithm |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201910612175.0A CN110334869A (en) | 2019-08-15 | 2019-08-15 | A training method for mangrove ecological health prediction based on dynamic group optimization algorithm |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN110334869A true CN110334869A (en) | 2019-10-15 |
Family
ID=68143300
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201910612175.0A Pending CN110334869A (en) | 2019-08-15 | 2019-08-15 | A training method for mangrove ecological health prediction based on dynamic group optimization algorithm |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN110334869A (en) |
Cited By (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN113571134A (en) * | 2021-07-28 | 2021-10-29 | 合肥综合性国家科学中心人工智能研究院(安徽省人工智能实验室) | Method and device for selecting gene data characteristics based on backbone particle swarm optimization |
| CN114638311A (en) * | 2022-03-22 | 2022-06-17 | 韶关学院 | A Parallel Support Vector Machine Optimization Method Based on Relative Entropy and Cosine Similarity |
| CN116680637A (en) * | 2023-08-02 | 2023-09-01 | 北京世纪慈海科技有限公司 | Construction method and device of sensing data analysis model of community-built elderly people |
| CN118823589A (en) * | 2024-09-18 | 2024-10-22 | 北京林业大学 | A method for monitoring the health of artificial forests |
Citations (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP0379436A1 (en) * | 1989-01-20 | 1990-07-25 | CNRS, Centre National de la Recherche Scientifique | Apparatus and process for request arbitration and resolution of conflicts linked with the access to memories with independent banks for computing machines |
| US20050031237A1 (en) * | 2003-06-23 | 2005-02-10 | Masato Gomyo | Hydrodynamic bearing device and a recording disk drive equipped with it |
| CN1860211A (en) * | 2003-08-01 | 2006-11-08 | 巴斯福植物科学有限公司 | Method for the production of multiply-unsaturated fatty acids in transgenic organisms |
| JP2008206421A (en) * | 2007-02-23 | 2008-09-11 | Kansai Electric Power Co Inc:The | Mangrove growth forecasting system, mangrove afforestation right land judging system, mangrove growth forecasting method, and mangrove afforestation right land judging method |
| WO2013139889A1 (en) * | 2012-03-23 | 2013-09-26 | Omya Development Ag | Preparation of pigments |
| US20140222398A1 (en) * | 2011-06-14 | 2014-08-07 | Florida State University Research Foundation, Inc. | Methods and apparatus for double-integration orthogonal space tempering |
| WO2016094338A1 (en) * | 2014-12-09 | 2016-06-16 | Schlumberger Canada Limited | Sustainability screeing tool with gaussian plume model screeing module |
| CN106446765A (en) * | 2016-07-26 | 2017-02-22 | 重庆大学 | Health state evaluation system based on multidimensional physiological big data depth learning |
| CN107817466A (en) * | 2017-06-19 | 2018-03-20 | 重庆大学 | Based on the indoor orientation method for stacking limited Boltzmann machine and random forests algorithm |
| CN109726705A (en) * | 2019-01-24 | 2019-05-07 | 中国科学院地理科学与资源研究所 | Mangrove information extraction method, device and electronic equipment |
-
2019
- 2019-08-15 CN CN201910612175.0A patent/CN110334869A/en active Pending
Patent Citations (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP0379436A1 (en) * | 1989-01-20 | 1990-07-25 | CNRS, Centre National de la Recherche Scientifique | Apparatus and process for request arbitration and resolution of conflicts linked with the access to memories with independent banks for computing machines |
| US20050031237A1 (en) * | 2003-06-23 | 2005-02-10 | Masato Gomyo | Hydrodynamic bearing device and a recording disk drive equipped with it |
| CN1860211A (en) * | 2003-08-01 | 2006-11-08 | 巴斯福植物科学有限公司 | Method for the production of multiply-unsaturated fatty acids in transgenic organisms |
| JP2008206421A (en) * | 2007-02-23 | 2008-09-11 | Kansai Electric Power Co Inc:The | Mangrove growth forecasting system, mangrove afforestation right land judging system, mangrove growth forecasting method, and mangrove afforestation right land judging method |
| US20140222398A1 (en) * | 2011-06-14 | 2014-08-07 | Florida State University Research Foundation, Inc. | Methods and apparatus for double-integration orthogonal space tempering |
| WO2013139889A1 (en) * | 2012-03-23 | 2013-09-26 | Omya Development Ag | Preparation of pigments |
| WO2016094338A1 (en) * | 2014-12-09 | 2016-06-16 | Schlumberger Canada Limited | Sustainability screeing tool with gaussian plume model screeing module |
| CN106446765A (en) * | 2016-07-26 | 2017-02-22 | 重庆大学 | Health state evaluation system based on multidimensional physiological big data depth learning |
| CN107817466A (en) * | 2017-06-19 | 2018-03-20 | 重庆大学 | Based on the indoor orientation method for stacking limited Boltzmann machine and random forests algorithm |
| CN109726705A (en) * | 2019-01-24 | 2019-05-07 | 中国科学院地理科学与资源研究所 | Mangrove information extraction method, device and electronic equipment |
Non-Patent Citations (4)
| Title |
|---|
| LEI ZHANG,等: "Estiination of Flow Patterns by Applying Artificial Neural Networks", 《IEEE》 * |
| 张汝波,等: "计算智能基础", 哈尔滨工程出版社 * |
| 杨绮雯,等: "基于改进PSR和灰色预测的湛江红树林生态系统健康评价", 《南方农业》 * |
| 邹庆红,等: "Study of Dynamic Group Evolution for Health Prediction of Mangrove Ecosystem", 《2019 8TH INTERNATIONAL CONGRESS ON ADVANCED APPLIED INFORMATICS (IIAI-AAI)》 * |
Cited By (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN113571134A (en) * | 2021-07-28 | 2021-10-29 | 合肥综合性国家科学中心人工智能研究院(安徽省人工智能实验室) | Method and device for selecting gene data characteristics based on backbone particle swarm optimization |
| CN114638311A (en) * | 2022-03-22 | 2022-06-17 | 韶关学院 | A Parallel Support Vector Machine Optimization Method Based on Relative Entropy and Cosine Similarity |
| CN116680637A (en) * | 2023-08-02 | 2023-09-01 | 北京世纪慈海科技有限公司 | Construction method and device of sensing data analysis model of community-built elderly people |
| CN116680637B (en) * | 2023-08-02 | 2023-11-03 | 北京世纪慈海科技有限公司 | Methods and devices for constructing sensor data analysis models for elderly people living in communities |
| CN118823589A (en) * | 2024-09-18 | 2024-10-22 | 北京林业大学 | A method for monitoring the health of artificial forests |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN110597240B (en) | A fault diagnosis method for hydro-generator units based on deep learning | |
| Zhu et al. | Coke price prediction approach based on dense GRU and opposition-based learning salp swarm algorithm | |
| CN110334869A (en) | A training method for mangrove ecological health prediction based on dynamic group optimization algorithm | |
| CN116542382A (en) | Sewage treatment dissolved oxygen concentration prediction method based on mixed optimization algorithm | |
| CN102496077B (en) | Harmful disaster prediction system and method | |
| CN111371607A (en) | Network traffic prediction method for optimizing LSTM based on decision-making graying algorithm | |
| CN106503802A (en) | A kind of method of utilization genetic algorithm optimization BP neural network system | |
| Gill et al. | Training back propagation neural networks with genetic algorithm for weather forecasting | |
| CN109214503B (en) | Power transmission and transformation project cost prediction method based on KPCA-LA-RBM | |
| Emami | Seasons optimization algorithm | |
| CN117454124A (en) | Ship motion prediction method and system based on deep learning | |
| Beiranvand et al. | A systematic review of optimization of dams reservoir operation using the meta-heuristic algorithms | |
| CN114154401A (en) | Soil erosion modulus calculation method and system based on machine learning and observation data | |
| CN118536391A (en) | Intelligent tool wear state monitoring method and system based on improved dung beetle algorithm | |
| CN111144666A (en) | An ocean thermocline prediction method based on deep spatiotemporal residual network | |
| CN116468181A (en) | Improved whale-based optimization method | |
| CN117114915A (en) | Aquaculture PH value prediction method based on improved particle swarm optimization | |
| Zhang et al. | Support vector machine weather prediction technology based on the improved quantum optimization algorithm | |
| CN115169215A (en) | Multi-objective optimization method and system considering nitrate pollution and seawater intrusion process | |
| Ma et al. | Optimized deep extreme learning machine for traffic prediction and autonomous vehicle lane change decision-making | |
| CN113762591B (en) | Short-term electric quantity prediction method and system based on GRU and multi-core SVM countermeasure learning | |
| Sokół et al. | Application of the Hierarchic Memetic Strategy HMS in Neuroevolution | |
| Guo et al. | Improved CS algorithm and its application in parking space prediction | |
| CN113487009A (en) | RESN-SOL neural network-based classification method | |
| Bi et al. | Task pre-assignment method for uav swarm based on generative adversarial network |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| RJ01 | Rejection of invention patent application after publication | ||
| RJ01 | Rejection of invention patent application after publication |
Application publication date: 20191015 |