CN111832828B - Intelligent precipitation prediction method based on Fengyun-4 meteorological satellite - Google Patents
Intelligent precipitation prediction method based on Fengyun-4 meteorological satellite Download PDFInfo
- Publication number
- CN111832828B CN111832828B CN202010693436.9A CN202010693436A CN111832828B CN 111832828 B CN111832828 B CN 111832828B CN 202010693436 A CN202010693436 A CN 202010693436A CN 111832828 B CN111832828 B CN 111832828B
- Authority
- CN
- China
- Prior art keywords
- data
- satellite
- precipitation
- model
- gbdt
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/04—Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/12—Computing arrangements based on biological models using genetic models
- G06N3/126—Evolutionary algorithms, e.g. genetic algorithms or genetic programming
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Business, Economics & Management (AREA)
- Theoretical Computer Science (AREA)
- Biophysics (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Economics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Strategic Management (AREA)
- Human Resources & Organizations (AREA)
- General Physics & Mathematics (AREA)
- Evolutionary Biology (AREA)
- Operations Research (AREA)
- Biomedical Technology (AREA)
- Tourism & Hospitality (AREA)
- Quality & Reliability (AREA)
- Marketing (AREA)
- Entrepreneurship & Innovation (AREA)
- Game Theory and Decision Science (AREA)
- Development Economics (AREA)
- Physiology (AREA)
- Genetics & Genomics (AREA)
- Artificial Intelligence (AREA)
- General Business, Economics & Management (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- Evolutionary Computation (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
Description
技术领域Technical Field
本发明属于气象预测的技术领域,尤其是涉及一种基于风云四号气象卫星的智能降水预测方法。The present invention belongs to the technical field of meteorological forecasting, and in particular relates to an intelligent precipitation forecasting method based on the Fengyun-4 meteorological satellite.
背景技术Background Art
风云四号卫星成像仪观测时,要兼顾云导风制作、飞轮卸载以及扫描区域频率最大化,因此观测的时间范围有5分钟和15分钟两种,观测的空间范围有全圆盘和中国区域两种,且每天都要发生多次变化,还有因为飞轮卸载空缺的时次,因此,为了最大限度的使用风云四号卫星的数据,必须对风云四号卫星的观测的结果数据进行时间和空间的连续性整合。When the FY-4 satellite imager observes, it must take into account cloud guide wind production, flywheel unloading and maximizing the scanning area frequency. Therefore, the observation time range is 5 minutes and 15 minutes, and the observation space range is the entire disk and the Chinese region. It changes many times every day, and there are also vacancies due to flywheel unloading. Therefore, in order to maximize the use of FY-4 satellite data, the observation results of the FY-4 satellite must be continuously integrated in time and space.
而现有的基于风云四号气象卫星的降水预测方法,其数学模型不能够将风云四号卫星的观测的结果数据进行时间和空间的连续性整合,导致利用风云四号卫星的观测的结果数据进行训练得到的数学模型鲁棒性差,并且预测的结果准确性差,可信度低。并且现有的关于降水量预测数学模型通常采用单一的算法,不能够将多种有效的算法进行融合和优化,导致数学模型本身处理数据的能力不足,并且对于具体涉及降水量的预测,适应性差,预测结果不准确,稳定性不足。However, the mathematical model of the existing precipitation prediction method based on the Fengyun-4 meteorological satellite cannot integrate the observation result data of the Fengyun-4 satellite in time and space, resulting in poor robustness of the mathematical model trained with the observation result data of the Fengyun-4 satellite, poor accuracy of the prediction results, and low credibility. In addition, the existing mathematical model for precipitation prediction usually adopts a single algorithm and cannot integrate and optimize multiple effective algorithms, resulting in insufficient data processing ability of the mathematical model itself, and poor adaptability, inaccurate prediction results, and insufficient stability for specific precipitation predictions.
发明内容Summary of the invention
本发明的目的在于解决现有技术中存在的上述技术问题,并提供一种基于风云四号气象卫星的智能降水预测方法,其利用风云四号卫星的观测数据完成中国区域内的降水量的预测。The purpose of the present invention is to solve the above-mentioned technical problems existing in the prior art and to provide an intelligent precipitation prediction method based on the Fengyun-4 meteorological satellite, which uses the observation data of the Fengyun-4 satellite to complete the prediction of precipitation in the Chinese region.
本发明的具体技术方案如下:The specific technical solutions of the present invention are as follows:
步骤一,卫星观测数据时间整合:Step 1: Time integration of satellite observation data:
将连续3个5分钟时间段内,卫星的各个通道对中国区域常规观测数据的均值分别整合为一个15分钟数据,将15分钟设为一个时次,即新生成的15分钟中国区域观测数据的各通道值为:The average values of the satellite's conventional observation data for China in each channel in three consecutive 5-minute time periods are integrated into a 15-minute data, and 15 minutes is set as a time period. That is, the values of each channel of the newly generated 15-minute observation data for China are:
其中i为风云四号的通道号,取值范围为1到14,j为5分钟数据的序号,n为3,即要平均的5分钟数据的数量。Where i is the channel number of FY-4, ranging from 1 to 14, j is the sequence number of the 5-minute data, and n is 3, which is the number of 5-minute data to be averaged.
对于17:15因卫星下点午夜进行飞轮卸载停止观测缺失的数据,用前后两个时次17:00、17:30的数据均值代替,由于17:00的数据是全圆盘图观测,17:30的是中国区域观测,范围不一样,需要整合到中国区域观测大小再进行均值计算;For the missing data at 17:15 due to the stop of observation for flywheel unloading at midnight, the data at the satellite descent point is replaced by the average of the data at 17:00 and 17:30. Since the data at 17:00 is the full disk map observation and the data at 17:30 is the China regional observation, the ranges are different and need to be integrated into the China regional observation size before the average calculation;
步骤二,卫星扫描数据空间整合:Step 2: Spatial integration of satellite scanning data:
将时间整合好后的全圆盘常规观测数据进行裁切,得到和中国区域常规观测相同的区域,从而使所有的数据都有相同的范围和时间频次,卫星数据的时间和空间整合完毕,整合后的卫星数据一小时4次,范围是中国区域常规观测范围;The full-disk conventional observation data after time integration is cut to obtain the same area as the conventional observation in the Chinese region, so that all data have the same range and time frequency. The time and space integration of satellite data is completed. The integrated satellite data is 4 times an hour, and the range is the conventional observation range of the Chinese region;
步骤三,风云四号卫星降水预测通道选择Step 3: Fengyun-4 satellite precipitation prediction channel selection
风云四号卫星一共有14个通道,做卫星预测降水,会根据物理机理选择一个红外通道加一个水汽通道进行计算,选择更多的通道时,无法分析通道之间的关系以及对降水预测的作用,但是每个通道都包含对降水有贡献的信息,选择的通道少就会造成信息丢失,从而无法提高降水的精度,为了充分地挖掘通道内的信息,在利用风云四号卫星数据时,采用列表中7-14共8个卫星通道;Fengyun-4 satellite has a total of 14 channels. When satellites predict precipitation, they will select an infrared channel and a water vapor channel for calculation according to the physical mechanism. When more channels are selected, it is impossible to analyze the relationship between the channels and their role in precipitation prediction. However, each channel contains information that contributes to precipitation. If fewer channels are selected, information will be lost, and thus the accuracy of precipitation cannot be improved. In order to fully mine the information in the channels, when using Fengyun-4 satellite data, 8 satellite channels from 7 to 14 in the list are used;
步骤四,遗传算法的设置及参数配置Step 4: Genetic algorithm settings and parameter configuration
利用遗传算法与GBDT(Gradient Boosting Decision Tree)算法相结合的数学模型,对降水量进行预测;利用遗传算法将GBDT算法模型的参数的初始值进行优化,得到GBDT算法模型的最优的初始值;The mathematical model combining genetic algorithm and GBDT (Gradient Boosting Decision Tree) algorithm is used to predict precipitation. The initial values of the parameters of the GBDT algorithm model are optimized using the genetic algorithm to obtain the optimal initial values of the GBDT algorithm model.
其中,遗传算法的具体设计为:Among them, the specific design of the genetic algorithm is:
1、种群初始化与染色体编码1. Population initialization and chromosome encoding
个体编码方法为实数编码,每个个体均为一个实数串,个体包含了GBDT算法模型的全部参数;初始种群的数量为:20-100;The individual encoding method is real number encoding. Each individual is a real number string. The individual contains all the parameters of the GBDT algorithm model. The number of initial populations is: 20-100;
2、确定目标函数和适应度函数2. Determine the objective function and fitness function
根据个体得到的GBDT算法模型的全部参数,用训练数据训练GBDT算法模型后预测系统输出,把预测得到期望输出与实际输出之间的误差绝对值作为个体适应度值F,其计算公式为:According to all the parameters of the GBDT algorithm model obtained by the individual, the system output is predicted after the GBDT algorithm model is trained with the training data. The absolute value of the error between the expected output and the actual output is taken as the individual fitness value F, and its calculation formula is:
式中,n为网络输出节点数;yi为GBDT算法模型第i颗回归树的叶子节点的期望输出,oi为第i颗回归树的叶子节点的实际输出;k为系数;Where n is the number of network output nodes; yi is the expected output of the leaf node of the i-th regression tree of the GBDT algorithm model; oi is the actual output of the leaf node of the i-th regression tree; k is the coefficient;
3、选择操作3. Select an operation
选择操作设置为比例选择法,即基于适应度比例的选择策略,每个个体i的选择概率pi为: The selection operation is set as the proportional selection method, that is, the selection strategy based on the fitness ratio, and the selection probability pi of each individual i is:
式中,Fi为个体i的适应度值;N为种群个体数目;In the formula, Fi is the fitness value of individual i; N is the number of individuals in the population;
4、交叉操作4. Crossover Operation
由于个体采用实数编码,所以交叉操作方法采用实数交叉法,第k个染色体ak和第l个染色体al在j个基因的交叉操作方法如下:Since individuals are coded with real numbers, the crossover operation method uses the real number crossover method. The crossover operation method of the kth chromosome ak and the lth chromosome a l in the jth gene is as follows:
akj=akj(1-b)+aljba kj = a kj (1-b) + a lj b
alj=alj(1-b)+akjba lj = a lj (1-b) + a kj b
式中,系数b是[0,1]之间的随机数;交叉概率设置为0.4-0.99;Where, coefficient b is a random number between [0,1]; the crossover probability is set to 0.4-0.99;
5、变异操作5. Mutation Operation
选取第i个染色体个体的第j个基因aij进行变异操作,方法如下:Select the jth gene aij of the i-th chromosome individual for mutation operation, the method is as follows:
式中,amax为基因aij的上界;amin为基因aij的下界;g为当前迭代次数,遗传算法的终止进化代数为100-500;Gmax是最大的进化次数;r为[0,1]间的随机数;变异概率为0.0001-0.1;In the formula, a max is the upper bound of gene a ij ; a min is the lower bound of gene a ij ; g is the current iteration number, the termination evolution number of the genetic algorithm is 100-500; G max is the maximum evolution number; r is a random number between [0,1]; the mutation probability is 0.0001-0.1;
步骤五,GBDT算法的设置及计算过程Step 5: Setting and calculation process of GBDT algorithm
在步骤四中,利用遗传算法将GBDT算法模型的参数的初始值进行优化,得到GBDT算法模型的最优的参数的初始值;在步骤五中,再将整合后的卫星观测点扫描的历史数据,其可以是经纬度信息、当天的年序列号以及该点的海拔高度代入GBDT算法模型,得到上述卫星观测点对应的预测的降水量;将该预测的降水量与卫星观测点扫描的历史数据中的实际降水量进行比较得到残差,进一步完成对GBDT算法模型进行训练;通过训练得到GBDT算法模型的最优参数选择,再将卫星实时扫描的观测点实时数据如经纬度信息、当天的年序列号以及该点的海拔高度代入训练后的GBDT算法模型,进而得到该卫星观测点的对应的降水量预测值;In step 4, the initial values of the parameters of the GBDT algorithm model are optimized by using a genetic algorithm to obtain the optimal initial values of the parameters of the GBDT algorithm model; in step 5, the historical data scanned by the integrated satellite observation point, which may be the latitude and longitude information, the annual serial number of the day and the altitude of the point, are substituted into the GBDT algorithm model to obtain the predicted precipitation corresponding to the above-mentioned satellite observation point; the predicted precipitation is compared with the actual precipitation in the historical data scanned by the satellite observation point to obtain the residual, and the GBDT algorithm model is further trained; the optimal parameter selection of the GBDT algorithm model is obtained through training, and then the real-time data of the observation point scanned by the satellite in real time, such as the longitude and latitude information, the annual serial number of the day and the altitude of the point, are substituted into the trained GBDT algorithm model to obtain the corresponding precipitation prediction value of the satellite observation point;
(1)GBDT算法设置为:(1) The GBDT algorithm is set as:
①损失函数的选择① Choice of loss function
对于GBDT算法预测降水量,就是要找到合适的GBDT算法的参数,衡量是不是GBDT算法的核实的参数,通过损失函数进行确定,损失函数:For the GBDT algorithm to predict precipitation, it is necessary to find the appropriate parameters of the GBDT algorithm and measure whether they are verified parameters of the GBDT algorithm. The loss function is used to determine the parameters. The loss function is:
L(y,F)=|y-F|,其中y是实际值,F(x)是预测值,为胡伯损失获得各回归树的最合适的参数就等价于使损失函数L最小化;L(y,F)=|yF|, Where y is the actual value and F(x) is the predicted value. Huber's loss Obtaining the most appropriate parameters for each regression tree is equivalent to minimizing the loss function L;
②目标函数②Objective function
目标函数就是获得各回归树的最合适的参数,即第t颗回归树的分配比例ρt以及第t颗回归树的参数:其中ρ为各回归树分配比例,即该树的学习率或者步长,θ是回归树的参数;The objective function is to obtain the most appropriate parameters for each regression tree, that is, the allocation ratio ρt of the tth regression tree and the parameters of the tth regression tree: Where ρ is the allocation ratio of each regression tree, that is, the learning rate or step size of the tree, and θ is the parameter of the regression tree;
其中参数迭代 The parameter iteration
其中选择步长 The step size is selected
其中Ex,y为每颗回归树迭代计算的错误率,x表述输入的数据,f(xi)表示第i次迭代的预测值,h表示求二阶导数;h(x,θ)表示回归树,h(xi,θt)则表示第t棵回归树的第i次迭代;Where Ex ,y is the error rate of each regression tree iteration, x represents the input data, f( xi ) represents the predicted value of the i-th iteration, and h represents the second-order derivative. h(x,θ) represents the regression tree, and h( xi , θt ) represents the i-th iteration of the t-th regression tree.
③残差的计算:③ Calculation of residuals:
GBDT算法在对回归树进行训练时,需要以残差作为目标值,训练下一个回归树,直到建立的回归树的数量达到要求并且残差rit达到预期范围时停止训练,rit也就表示的是第i次迭代的弱学习器建立的方向, When the GBDT algorithm trains the regression tree, it needs to use the residual as the target value to train the next regression tree until the number of established regression trees reaches the requirement and the residual rit reaches the expected range. The training is stopped. rit also represents the direction of the weak learner established in the i-th iteration.
(2)GBDT算法的迭代流程为:(2) The iterative process of the GBDT algorithm is:
GBDT每棵决策树训练的是前面决策树分类结果中的错误。每一次的计算都是为了减少上一次的残差,为了消除残差GBDT进行多次迭代,每次迭代都在残差减少的梯度方向上建立一个新的模型,具体迭代流程为:Each decision tree of GBDT is trained based on the errors in the classification results of the previous decision tree. Each calculation is to reduce the residual of the previous one. In order to eliminate the residual, GBDT performs multiple iterations. Each iteration builds a new model in the gradient direction of residual reduction. The specific iteration process is as follows:
①初始化第一个方程并且GBDT算法模型的参数的初始值θ0是利用遗传算法将GBDT算法模型的参数的初始值进行优化,得到GBDT算法模型的最优的参数初始值;① Initialize the first equation And the initial value θ 0 of the parameter of the GBDT algorithm model is optimized by using a genetic algorithm to obtain the optimal initial value of the parameter of the GBDT algorithm model;
②每次迭代都要计算残差rit也就表示的是第i次迭代的弱学习器建立的方向,以训练下一个回归树;②The residual is calculated at each iteration R it also represents the direction of the weak learner established in the i-th iteration to train the next regression tree;
③以残差为目标构建回归方程{(xi,rit)}i=1,...,n,其中xi为导入的数据,其可以为该卫星观测点的经纬度信息、当天的年序列号以及该点的海拔高度;③ Construct a regression equation {( xi , rit )} i=1,...,n with the residual as the target, wherexi is the imported data, which can be the latitude and longitude information of the satellite observation point, the year serial number of the day, and the altitude of the point;
④找到当前最合适的参数选择: ④Find the most suitable parameter selection at present:
⑤经过M次迭代后得到最后的预测结果:yi为上述卫星观测点对应得到的预测的降水量。⑤After M iterations, the final prediction result is obtained: yi is the predicted precipitation corresponding to the above satellite observation point.
训练样本设置:传统的降水模型全国用一个模板,既无法处理不同地区的不同海拔造成的降水差异,也无法处理不同经纬度位置造成的降水差异,还不能处理不同季节造成的降水差异。本发明将降水实况产品和卫星分辨率匹配到5公里以后,为了使预测更加稳定,创新性的采用以观测点为中心,周围3×3的9个点的数据并加上经纬度信息、当天的年序列号以及该点的海拔高度作为一个样本的输入。该特征的输入创新性的同时考虑了时间,位置和海拔,较以往的全国降水用统一模型有了很大的改进。对于一个输入训练样本,共包含292个特征,Training sample settings: The traditional precipitation model uses one template for the whole country, which can neither handle the precipitation differences caused by different altitudes in different regions, nor the precipitation differences caused by different longitudes and latitudes, nor the precipitation differences caused by different seasons. The present invention matches the actual precipitation product and the satellite resolution to after 5 kilometers. In order to make the prediction more stable, the data of 9 points of 3×3 around the observation point is innovatively used, and the longitude and latitude information, the year serial number of the day and the altitude of the point are added as the input of a sample. The input of this feature innovatively takes into account time, location and altitude at the same time, which is a great improvement over the previous unified model for precipitation across the country. For an input training sample, a total of 292 features are included,
训练过程:为了加快训练时间,同时减少预测时间,按区域训练多个模型。中国区域观测,范围为北纬3°-55°,东经60°-137°。对每个4°×4°的范围内的数据训练一个模型,这样在训练和预测的时候都可以并行计算,提高了效率,并且大大缩短的预测时间。纬度方向跨度为55°-3°=52°,共需要13个区域模型,经度方向跨度为137°-60°=77°,共需要20个模型,其中第20个模型只需要1°经度×4°纬度的数据。也就是总共需要训练20×13=260个区域的模型。对于每个模型,输入2019年的数据进行训练,卫星分辨率是0.04°,那么每一个标准区域内,有100×100个点,再乘以365天,共365万个训练数据进行训练Training process: In order to speed up the training time and reduce the prediction time, multiple models are trained by region. The China regional observation range is 3°-55° north latitude and 60°-137° east longitude. A model is trained for each data within the range of 4°×4°, so that parallel calculations can be performed during training and prediction, which improves efficiency and greatly shortens the prediction time. The span in the latitude direction is 55°-3°=52°, and a total of 13 regional models are required. The span in the longitude direction is 137°-60°=77°, and a total of 20 models are required. The 20th model only requires 1° longitude × 4° latitude data. In other words, a total of 20×13=260 regional models need to be trained. For each model, input the data from 2019 for training. The satellite resolution is 0.04°, so there are 100×100 points in each standard area, multiplied by 365 days, for a total of 3.65 million training data for training.
预测设置:对于训练集以外的卫星数据,当卫星获取到一个小时内的4个时次数据后,进行4°×4°的拆分。每一个区域分别分给对应的模型进行预测,最后再将预测结果拼接起来,形成中国区域的降水分布Forecast settings: For satellite data outside the training set, when the satellite obtains 4-hour data within an hour, it is split into 4°×4°. Each area is assigned to the corresponding model for prediction, and finally the prediction results are spliced together to form the precipitation distribution in China.
本发明的基于风云四号卫星的降水量的预测方法取得了以下技术效果:The precipitation prediction method based on the Fengyun-4 satellite of the present invention has achieved the following technical effects:
1、将风云四号卫星的观测的结果数据进行时间和空间的连续性整合,最大限度的使用风云四号卫星的数据,保证了样本数据的充足和真实,将整合后的数据对降水量预测模型进行训练,提高所述降水预测方法的结果的准确性和可信度。1. The observation result data of the Fengyun-4 satellite are continuously integrated in time and space, and the data of the Fengyun-4 satellite are used to the maximum extent to ensure the sufficiency and authenticity of the sample data. The integrated data are used to train the precipitation prediction model to improve the accuracy and credibility of the results of the precipitation prediction method.
针对风云四号卫星的观测的结果数据特点,涉及了降水量预测的数学模型,将遗传算法与GBDT算法模型进行深度融合和优化,在整合后的卫星观测数据对GBDT算法模型进行训练之前,利用遗传算法将GBDT算法模型的参数的初始值进行优化,得到GBDT算法模型的最优的参数初始值,在该参数初始值的基础上进一步完成GBDT算法的模型的迭代计算,相比于现有技术中GBDT算法模型参数初始值的随机生成,提高了数学模型的数据处理能力,并提高了降水量预测结果的准确度和预测计算的稳定性。In view of the data characteristics of the observation results of the Fengyun-4 satellite, a mathematical model for precipitation prediction is involved. The genetic algorithm and the GBDT algorithm model are deeply integrated and optimized. Before the integrated satellite observation data are used to train the GBDT algorithm model, the initial values of the parameters of the GBDT algorithm model are optimized using the genetic algorithm to obtain the optimal initial values of the parameters of the GBDT algorithm model. Based on the initial values of the parameters, the iterative calculation of the GBDT algorithm model is further completed. Compared with the random generation of the initial values of the GBDT algorithm model parameters in the prior art, the data processing capability of the mathematical model is improved, and the accuracy of the precipitation prediction results and the stability of the prediction calculation are improved.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
图1:GBDT算法流程图。Figure 1: GBDT algorithm flow chart.
具体实施方式DETAILED DESCRIPTION
为使本发明实施例的目的、技术方案和优点更加清楚,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。In order to make the purpose, technical solution and advantages of the embodiments of the present invention clearer, the technical solution in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments are part of the embodiments of the present invention, not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by ordinary technicians in this field without creative work are within the scope of protection of the present invention.
风云四号卫星成像仪观测时,要兼顾云导风制作、飞轮卸载以及扫描区域频率最大化,因此观测的时间有5分钟和15分钟两种,观测的空间范围有全圆盘和中国区域两种,且每天都要发生多次变化,还有因为飞轮卸载空缺的时次,为了最大限度的使用风云四号卫星的数据,必须对风云四号卫星的观测结果进行时间和空间的连续性整合。When the FY-4 satellite imager is observing, it must take into account cloud guide wind production, flywheel unloading and maximizing the scanning area frequency. Therefore, the observation time is 5 minutes and 15 minutes, and the spatial range of observation is the entire disk and the Chinese area. It changes many times every day. In addition, due to the vacancies caused by flywheel unloading, in order to maximize the use of FY-4 satellite data, the observation results of the FY-4 satellite must be integrated continuously in time and space.
1.卫星扫描数据时间整合1. Satellite scanning data time integration
风云四号卫星成像仪观测模式如下:成像仪每日获取40幅全圆盘云图,165幅中国区域云图,中国区域观测,范围为北纬3°-55°,东经60°-137°。每小时一次全圆盘观测,时间为整点-整点过15分钟。The observation mode of the FY-4 satellite imager is as follows: the imager obtains 40 full disk cloud images and 165 China regional cloud images every day. The China regional observation range is 3°-55°N and 60°-137°E. Full disk observation is performed once every hour, from the hour to 15 minutes after the hour.
每3小时进行一次连续3幅全圆盘观测,时间为00:00(世界时,下同)/03:00/06:00/09:00/12:00/15:00/18:00/21:00时次全圆盘云图和该时次前后各一幅全圆盘云图,例如:00:00的连续3幅全圆盘观测时间为23:45-23:59:59、00:00-00:14:59、00:15-00:29:59。Three consecutive full disk observations are conducted every three hours, with the full disk cloud images at 00:00 (UTC, the same below)/03:00/06:00/09:00/12:00/15:00/18:00/21:00 and one full disk cloud image before and after the time. For example, the three consecutive full disk observations at 00:00 are at 23:45-23:59:59, 00:00-00:14:59, and 00:15-00:29:59.
无全圆盘观测时进行5分钟中国区域观测。每15分钟观测空隙内进行定位定标观测。星下点午夜进行飞轮卸载停止观测。时间为17:15-17:30。When there is no full disk observation, the observation of China area will be carried out for 5 minutes. Positioning and calibration observation will be carried out in every 15-minute observation interval. The flywheel will be unloaded and observation will be stopped at midnight at the subsatellite point. The time is 17:15-17:30.
由于卫星数据观测时间不规整,不利于后期算法的输入,因此将卫星数据进行整合,将连续3个5分钟中国区域常规观测的所有通道的均值整合为一个15分钟数据。即新生成的15分钟中国区域观测数据的各通道值为:Since the observation time of satellite data is irregular, it is not conducive to the input of the later algorithm. Therefore, the satellite data is integrated, and the average of all channels of three consecutive 5-minute routine observations in the Chinese region are integrated into a 15-minute data. That is, the channel values of the newly generated 15-minute observation data in the Chinese region are:
其中i为风云四号的通道号,取值范围为1到14,j为5分钟数据的序号,n为3,即要平均的5分钟数据的数量。Where i is the channel number of FY-4, ranging from 1 to 14, j is the sequence number of the 5-minute data, and n is 3, which is the number of 5-minute data to be averaged.
对于17:15因为星下点午夜进行飞轮卸载停止观测缺失的数据,用前后两个时次的数据均值代替。由于17:00的数据是全圆盘图观测,17:30的是中国区域观测,范围不一样,需要整合到中国区域观测大小再进行均值计算。For the missing data at 17:15, which was stopped for flywheel unloading at midnight, the average of the two hours before and after was used to replace it. Since the data at 17:00 was for full disk observation and the data at 17:30 was for China observation, the ranges were different and needed to be integrated into the China observation size before the average calculation.
2.卫星扫描数据空间整合2. Spatial integration of satellite scanning data
将时间整合好后的全圆盘常规观测数据进行裁切,得到和中国区域常规观测相同的区域,从而使所有的数据都有相同的范围和时间频次,卫星数据的时间和空间整合完毕,整合后的卫星数据一小时4次,范围是中国区域常规观测范围。The full-disk routine observation data after time integration is cut to obtain the same area as the routine observation in the Chinese region, so that all data have the same range and time frequency. The time and space integration of satellite data is completed. The integrated satellite data is collected 4 times an hour, and the range is the routine observation range of the Chinese region.
3.风云四号卫星降水预测通道选择3. Fengyun-4 satellite precipitation prediction channel selection
风云四号卫星一共有14个通道,如表1所示。一般做卫星预测降水,会根据物理机理选择一个红外通道加一个水汽通道进行计算,因为选择更多的通道时,无法分析通道之间的关系以及对降水预测的作用。但是每个通道都包含对降水有贡献的信息,选择的通道少就会造成信息丢失,从而无法提高降水的精度。本发明利用人工智能算法进行降水预测,可以更好地挖掘通道内的信息,因此在利用风云四号卫星数据时,采用列表中7-14共8个卫星通道。Fengyun-4 satellite has a total of 14 channels, as shown in Table 1. Generally, when satellites predict precipitation, an infrared channel plus a water vapor channel will be selected for calculation according to the physical mechanism, because when more channels are selected, the relationship between the channels and the effect on precipitation prediction cannot be analyzed. However, each channel contains information that contributes to precipitation. If fewer channels are selected, information loss will occur, thereby failing to improve the accuracy of precipitation. The present invention uses artificial intelligence algorithms to predict precipitation, which can better mine the information in the channels. Therefore, when using Fengyun-4 satellite data, 8 satellite channels 7-14 in the list are used.
表1风云四号卫星通道列表Table 1 List of FY-4 satellite channels
4.算法选择及参数配置4. Algorithm selection and parameter configuration
GBDT梯度提升决策树是一种迭代的决策树算法,是基于前向分布算法和加法模型,由多棵决策树组成,通过降低学习过程中所产生的残差来实现分类和回归的一种集成算法,最后把所有树的结论累加起来做决策。树的类型有两种,分别是分类树和回归树,分类树一般用来处理分类问题,回归树一般用来处理预测问题。GBDT是一个泛化能力较强的算法,由于GBDT算法的核心是通过每一棵树拟合前面的树所产生的残差和通过一系列公式计算将所有树的结果累计起来作为最终的预测输出,而分类树不太容易实现上述过程,所以本发明采用GBDT算法中的树为回归树。流程图如附图1所示。GBDT gradient boosting decision tree is an iterative decision tree algorithm, which is based on the forward distribution algorithm and the additive model, and is composed of multiple decision trees. It is an integrated algorithm that realizes classification and regression by reducing the residuals generated in the learning process, and finally adds up the conclusions of all trees to make decisions. There are two types of trees, namely classification trees and regression trees. Classification trees are generally used to deal with classification problems, and regression trees are generally used to deal with prediction problems. GBDT is an algorithm with strong generalization ability. Since the core of the GBDT algorithm is to fit the residuals generated by the previous tree through each tree and accumulate the results of all trees through a series of formula calculations as the final prediction output, and the classification tree is not easy to implement the above process, so the present invention uses the tree in the GBDT algorithm as a regression tree. The flow chart is shown in Figure 1.
利用遗传算法将GBDT算法模型的参数的初始值进行优化,得到GBDT算法模型的最优的参数的初始值;然后再将整合后的卫星观测点扫描的历史数据如经纬度信息、当天的年序列号以及该点的海拔高度代入GBDT算法模型,得到上述卫星观测点对应的预测的降水量;将该预测的降水量与卫星观测点扫描的历史数据中的实际降水量进行比较得到残差,进一步完成对GBDT算法模型进行训练;通过训练得到GBDT算法模型的最优参数选择,再将卫星实时扫描的观测点实时数据如经纬度信息、当天的年序列号以及该点的海拔高度代入训练后的GBDT算法模型,进而得到该卫星观测点的对应的降水量预测值。The genetic algorithm is used to optimize the initial values of the parameters of the GBDT algorithm model to obtain the optimal initial values of the parameters of the GBDT algorithm model; then the historical data scanned by the integrated satellite observation point, such as the longitude and latitude information, the annual serial number of the day and the altitude of the point, are substituted into the GBDT algorithm model to obtain the predicted precipitation corresponding to the above-mentioned satellite observation point; the predicted precipitation is compared with the actual precipitation in the historical data scanned by the satellite observation point to obtain the residual, and the GBDT algorithm model is further trained; the optimal parameter selection of the GBDT algorithm model is obtained through training, and then the real-time data of the observation point scanned by the satellite in real time, such as the longitude and latitude information, the annual serial number of the day and the altitude of the point, are substituted into the trained GBDT algorithm model to obtain the corresponding precipitation prediction value of the satellite observation point.
其中,遗传算法的设置及参数配置:Among them, the settings and parameter configuration of the genetic algorithm:
利用遗传算法与GBDT算法相结合的数学模型,对降水量进行预测;利用遗传算法将GBDT算法模型的参数的初始值进行优化,得到GBDT算法模型的最优的初始值;The mathematical model combining genetic algorithm and GBDT algorithm is used to predict precipitation; the initial values of the parameters of the GBDT algorithm model are optimized by genetic algorithm to obtain the optimal initial values of the GBDT algorithm model;
其中,遗传算法的具体设计为:Among them, the specific design of the genetic algorithm is:
1、种群初始化与染色体编码1. Population initialization and chromosome encoding
个体编码方法为实数编码,每个个体均为一个实数串,个体包含了GBDT算法模型的全部参数;初始种群的数量为:20-100;The individual encoding method is real number encoding. Each individual is a real number string. The individual contains all the parameters of the GBDT algorithm model. The number of initial populations is: 20-100;
2、确定目标函数和适应度函数2. Determine the objective function and fitness function
根据个体得到的GBDT算法模型的全部参数,用训练数据训练GBDT算法模型后预测系统输出,把预测得到期望输出与实际输出之间的误差绝对值作为个体适应度值F,其计算公式为:According to all the parameters of the GBDT algorithm model obtained by the individual, the system output is predicted after the GBDT algorithm model is trained with the training data. The absolute value of the error between the expected output and the actual output is taken as the individual fitness value F, and its calculation formula is:
式中,n为网络输出节点数;yi为GBDT算法模型第i颗回归树的叶子节点的期望输出,oi为第i颗回归树的叶子节点的实际输出;k为系数;Where n is the number of network output nodes; yi is the expected output of the leaf node of the i-th regression tree of the GBDT algorithm model; oi is the actual output of the leaf node of the i-th regression tree; k is the coefficient;
3、选择操作3. Select an operation
选择操作设置为比例选择法,即基于适应度比例的选择策略,每个个体i的选择概率pi为: The selection operation is set as the proportional selection method, that is, the selection strategy based on the fitness ratio, and the selection probability pi of each individual i is:
式中,Fi为个体i的适应度值;N为种群个体数目;In the formula, Fi is the fitness value of individual i; N is the number of individuals in the population;
4、交叉操作4. Crossover Operation
由于个体采用实数编码,所以交叉操作方法采用实数交叉法,第k个染色体ak和第l个染色体al在j个基因的交叉操作方法如下:Since individuals are coded with real numbers, the crossover operation method uses the real number crossover method. The crossover operation method of the kth chromosome ak and the lth chromosome a l in the jth gene is as follows:
akj=akj(1-b)+aljba kj = a kj (1-b) + a lj b
alj=alj(1-b)+akjba lj = a lj (1-b) + a kj b
式中,系数b是[0,1]之间的随机数;交叉概率设置为0.4-0.99;Where, coefficient b is a random number between [0,1]; the crossover probability is set to 0.4-0.99;
5、变异操作5. Mutation Operation
选取第i个染色体个体的第j个基因aij进行变异操作,方法如下:Select the jth gene aij of the i-th chromosome individual for mutation operation, the method is as follows:
式中,amax为基因aij的上界;amin为基因aij的下界;g为当前迭代次数,遗传算法的终止进化代数为100-500;Gmax是最大的进化次数;r为[0,1]间的随机数;变异概率为0.0001-0.1。In the formula, a max is the upper bound of gene a ij ; a min is the lower bound of gene a ij ; g is the current iteration number, the termination evolutionary number of the genetic algorithm is 100-500; G max is the maximum evolutionary number; r is a random number between [0,1]; and the mutation probability is 0.0001-0.1.
其中,GBDT算法的设置及计算过程:Among them, the setting and calculation process of GBDT algorithm:
在步骤四中,利用遗传算法将GBDT算法模型的参数的初始值进行优化,得到GBDT算法模型的最优的参数的初始值;在步骤五中,再将整合后的卫星观测点扫描的历史数据,其可以是经纬度信息、当天的年序列号以及该点的海拔高度代入GBDT算法模型,得到上述卫星观测点对应的预测的降水量;将该预测的降水量与卫星观测点扫描的历史数据中的实际降水量进行比较得到残差,进一步完成对GBDT算法模型进行训练;通过训练得到GBDT算法模型的最优参数选择,再将卫星实时扫描的观测点实时数据如经纬度信息、当天的年序列号以及该点的海拔高度代入训练后的GBDT算法模型,进而得到该卫星观测点的对应的降水量预测值;In step 4, the initial values of the parameters of the GBDT algorithm model are optimized by using a genetic algorithm to obtain the optimal initial values of the parameters of the GBDT algorithm model; in step 5, the historical data scanned by the integrated satellite observation point, which may be the latitude and longitude information, the annual serial number of the day and the altitude of the point, are substituted into the GBDT algorithm model to obtain the predicted precipitation corresponding to the above-mentioned satellite observation point; the predicted precipitation is compared with the actual precipitation in the historical data scanned by the satellite observation point to obtain the residual, and the GBDT algorithm model is further trained; the optimal parameter selection of the GBDT algorithm model is obtained through training, and then the real-time data of the observation point scanned by the satellite in real time, such as the longitude and latitude information, the annual serial number of the day and the altitude of the point, are substituted into the trained GBDT algorithm model to obtain the corresponding precipitation prediction value of the satellite observation point;
(1)GBDT算法设置为:(1) The GBDT algorithm is set as:
①损失函数的选择① Choice of loss function
对于GBDT算法预测降水量,就是要找到合适的GBDT算法的参数,衡量是不是GBDT算法的核实的参数,通过损失函数进行确定,损失函数:For the GBDT algorithm to predict precipitation, it is necessary to find the appropriate parameters of the GBDT algorithm and measure whether they are verified parameters of the GBDT algorithm. The loss function is used to determine the parameters. The loss function is:
L(y,F)=|y-F|,其中y是实际值,F(x)是预测值,为胡伯损失获得各回归树的最合适的参数就等价于使损失函数L最小化;L(y,F)=|yF|, Where y is the actual value and F(x) is the predicted value. Huber's loss Obtaining the most appropriate parameters for each regression tree is equivalent to minimizing the loss function L;
②目标函数②Objective function
目标函数就是获得各回归树的最合适的参数,即第t颗回归树的分配比例ρt以及第t颗回归树的参数:其中ρ为各回归树分配比例,即该树的学习率或者步长,θ是回归树的参数;The objective function is to obtain the most appropriate parameters for each regression tree, that is, the allocation ratio ρt of the tth regression tree and the parameters of the tth regression tree: Where ρ is the allocation ratio of each regression tree, that is, the learning rate or step size of the tree, and θ is the parameter of the regression tree;
其中参数迭代 The parameter iteration
其中选择步长 The step size is selected
其中Ex,y为每颗回归树迭代计算的错误率,x表述输入的数据,f(xi)表示第i次迭代的预测值,h表示求二阶导数;h(x,θ)表示回归树,h(xi,θt)则表示第t棵回归树的第i次迭代;Where Ex ,y is the error rate of each regression tree iteration, x represents the input data, f( xi ) represents the predicted value of the i-th iteration, and h represents the second-order derivative. h(x,θ) represents the regression tree, and h( xi , θt ) represents the i-th iteration of the t-th regression tree.
③残差的计算:③ Calculation of residuals:
GBDT算法在对回归树进行训练时,需要以残差作为目标值,训练下一个回归树,直到建立的回归树的数量达到要求并且残差rit达到预期范围时停止训练,rit也就表示的是第i次迭代的弱学习器建立的方向, When the GBDT algorithm trains the regression tree, it needs to use the residual as the target value to train the next regression tree until the number of established regression trees reaches the requirement and the residual rit reaches the expected range. The training is stopped. rit also represents the direction of the weak learner established in the i-th iteration.
(2)GBDT算法的迭代流程为:(2) The iterative process of the GBDT algorithm is:
GBDT每棵决策树训练的是前面决策树分类结果中的错误。每一次的计算都是为了减少上一次的残差,为了消除残差GBDT进行多次迭代,每次迭代都在残差减少的梯度方向上建立一个新的模型,具体迭代流程为:Each decision tree of GBDT is trained based on the errors in the classification results of the previous decision tree. Each calculation is to reduce the residual of the previous one. In order to eliminate the residual, GBDT performs multiple iterations. Each iteration builds a new model in the gradient direction of residual reduction. The specific iteration process is as follows:
①初始化第一个方程并且GBDT算法模型的参数的初始值θ0是利用遗传算法将GBDT算法模型的参数的初始值进行优化,得到GBDT算法模型的最优的参数初始值;① Initialize the first equation And the initial value θ 0 of the parameter of the GBDT algorithm model is optimized by using a genetic algorithm to obtain the optimal initial value of the parameter of the GBDT algorithm model;
②每次迭代都要计算残差rit也就表示的是第i次迭代的弱学习器建立的方向,以训练下一个回归树;②The residual is calculated at each iteration R it also represents the direction of the weak learner established in the i-th iteration to train the next regression tree;
③以残差为目标构建回归方程{(xi,rit)}i=1,...,n,其中xi为导入的数据,其可以为该卫星观测点的经纬度信息、当天的年序列号以及该点的海拔高度;③ Construct a regression equation {( xi , rit )} i=1,...,n with the residual as the target, wherexi is the imported data, which can be the latitude and longitude information of the satellite observation point, the year serial number of the day, and the altitude of the point;
④找到当前最合适的参数选择: ④Find the most suitable parameter selection at present:
⑤经过M次迭代后得到最后的预测结果:yi为上述卫星观测点对应得到的预测的降水量。⑤After M iterations, the final prediction result is obtained: yi is the predicted precipitation corresponding to the above satellite observation point.
经过训练集的训练学习和测试集的5交叉验证,最终得到GBDT算法模型的最优参数设置如表2所示。在该参数下,2019年降水平均误差为0.0476毫米,远远好于默认参数误差0.327毫米。After training and learning of the training set and 5 cross-validation of the test set, the optimal parameter settings of the GBDT algorithm model are finally obtained as shown in Table 2. Under this parameter, the average precipitation error in 2019 is 0.0476 mm, which is much better than the default parameter error of 0.327 mm.
表2参数表Table 2 Parameters
5.训练样本设置5. Training sample settings
传统的降水模型全国用一个模板,既无法处理不同地区的不同海拔造成的的降水差异,也无法处理不同经纬度位置造成的降水差异,还不能处理不同季节造成的降水差异。本发明将降水实况产品和卫星分辨率匹配到5公里以后,为了使预测更加稳定,创新性的采用以观测点为中心,周围3×3的9个点的数据并加上经纬度信息、当天的年序列号以及该点的海拔高度作为一个样本的输入。该特征的输入创新性的同时考虑了时间,位置和海拔,较以往的全国降水用统一模型有了很大的改进。对于一个输入训练样本,共包含292个特征,如下表所示:The traditional precipitation model uses one template for the whole country, which can neither handle the precipitation differences caused by different altitudes in different regions, nor the precipitation differences caused by different longitudes and latitudes, nor the precipitation differences caused by different seasons. The present invention matches the real-time precipitation product and the satellite resolution to 5 kilometers. In order to make the prediction more stable, the data of 9 points around the observation point in a 3×3 shape is innovatively used, with the longitude and latitude information, the year serial number of the day, and the altitude of the point as the input of a sample. The input of this feature innovatively takes into account time, location and altitude at the same time, which is a great improvement over the previous unified model for precipitation across the country. For an input training sample, a total of 292 features are included, as shown in the following table:
表3输入样本内容列表Table 3 Input sample content list
对应的标签则为这个点整小时的降水量。The corresponding label is the precipitation at this point in the hour.
6.并行计算设置6. Parallel computing settings
为了加快训练时间,同时减少预测时间,按区域训练多个模型。中国区域观测,范围为北纬3°-55°,东经60°-137°。对每个4°×4°的范围内的数据训练一个模型,这样在训练和预测的时候都可以并行计算,提高了效率,并且大大缩短的预测时间。纬度方向跨度为55°-3°=52°,共需要13个区域模型,经度方向跨度为137°-60°=77°,共需要20个模型,其中第20个模型只需要1°经度×4°纬度的数据。也就是总共需要训练20×13=260个区域的模型。对于每个模型,输入2019年的数据进行训练,卫星分辨率是0.04°,那么每一个标准区域内,有100×100个点,再乘以365天,共365万个训练数据进行训练。In order to speed up the training time and reduce the prediction time, multiple models are trained by region. The Chinese regional observation range is 3°-55° north latitude and 60°-137° east longitude. A model is trained for each data within the range of 4°×4°, so that parallel calculations can be performed during training and prediction, which improves efficiency and greatly shortens the prediction time. The span in the latitude direction is 55°-3°=52°, and a total of 13 regional models are required. The span in the longitude direction is 137°-60°=77°, and a total of 20 models are required, of which the 20th model only requires 1° longitude × 4° latitude data. That is, a total of 20×13=260 regional models need to be trained. For each model, the data of 2019 is input for training, and the satellite resolution is 0.04°. Then in each standard area, there are 100×100 points, multiplied by 365 days, a total of 3.65 million training data for training.
7.预测设置7. Forecast Settings
对于训练集以外的卫星数据,当卫星获取到一个小时内的4个时次数据后,进行4°×4°的拆分。每一个区域分别分给对应的模型进行预测,最后再将预测结果拼接起来,形成中国区域的降水分布。For satellite data outside the training set, when the satellite obtains 4 time-of-day data within an hour, it is split into 4°×4°. Each area is assigned to the corresponding model for prediction, and finally the prediction results are spliced together to form the precipitation distribution in China.
基于风云四号卫星的降水量的预测方法取得了以下技术效果:The precipitation prediction method based on the Fengyun-4 satellite has achieved the following technical results:
1.将风云四号卫星的观测的结果数据进行时间和空间的连续性整合,最大限度的使用风云四号卫星的数据,保证了样本数据的充足和真实,将整合后的数据对降水量预测模型进行训练,提高所述降水预测方法的结果的准确性和可信度。1. The observation result data of the Fengyun-4 satellite are continuously integrated in time and space, and the data of the Fengyun-4 satellite are used to the maximum extent to ensure the sufficiency and authenticity of the sample data. The integrated data are used to train the precipitation prediction model to improve the accuracy and credibility of the results of the precipitation prediction method.
2.针对风云四号卫星的观测的结果数据特点,设计了降水量预测的数学模型,将遗传算法与GBDT算法模型进行深度融合和优化,在整合后的卫星观测数据对GBDT算法模型进行训练之前,利用遗传算法将GBDT算法模型的参数的初始值进行优化,得到GBDT算法模型的最优的参数初始值,在该参数初始值的基础上进一步完成GBDT算法的模型的迭代计算,相比于现有技术中GBDT算法模型参数初始值的随机生成,提高了数学模型的数据处理能力,并提高了降水量预测结果的准确度和预测计算的稳定性。2. According to the data characteristics of the observation results of the Fengyun-4 satellite, a mathematical model for precipitation prediction was designed, and the genetic algorithm and the GBDT algorithm model were deeply integrated and optimized. Before the integrated satellite observation data were used to train the GBDT algorithm model, the initial values of the parameters of the GBDT algorithm model were optimized using the genetic algorithm to obtain the optimal initial values of the parameters of the GBDT algorithm model. Based on the initial values of the parameters, the iterative calculation of the GBDT algorithm model was further completed. Compared with the random generation of the initial values of the GBDT algorithm model parameters in the prior art, the data processing capability of the mathematical model was improved, and the accuracy of the precipitation prediction results and the stability of the prediction calculation were improved.
对所公开的实施例的上述说明,使本领域专业技术人员能够实现或使用本发明。对这些实施例的多种修改对本领域的专业技术人员来说将是显而易见的,本文中所定义的一般原理可以在不脱离本发明的精神或范围的情况下,在其它实施例中实现。因此,本发明将不会被限制于本文所示的这些实施例,而是要符合与本文所公开的原理和新颖特点相一致的最宽的范围。The above description of the disclosed embodiments enables those skilled in the art to implement or use the present invention. Various modifications to these embodiments will be apparent to those skilled in the art, and the general principles defined herein may be implemented in other embodiments without departing from the spirit or scope of the present invention. Therefore, the present invention will not be limited to the embodiments shown herein, but rather to the widest scope consistent with the principles and novel features disclosed herein.
Claims (4)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202010693436.9A CN111832828B (en) | 2020-07-17 | 2020-07-17 | Intelligent precipitation prediction method based on Fengyun-4 meteorological satellite |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202010693436.9A CN111832828B (en) | 2020-07-17 | 2020-07-17 | Intelligent precipitation prediction method based on Fengyun-4 meteorological satellite |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN111832828A CN111832828A (en) | 2020-10-27 |
| CN111832828B true CN111832828B (en) | 2023-12-19 |
Family
ID=72923590
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202010693436.9A Active CN111832828B (en) | 2020-07-17 | 2020-07-17 | Intelligent precipitation prediction method based on Fengyun-4 meteorological satellite |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN111832828B (en) |
Families Citing this family (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN112380781B (en) * | 2020-11-30 | 2022-10-18 | 中国人民解放军国防科技大学 | A Complementation Method of Satellite Observations Based on Reanalysis Data and Imbalanced Learning |
| CN112926664B (en) * | 2021-03-01 | 2023-11-24 | 南京信息工程大学 | Feature selection based on evolutionary algorithm and short-term heavy precipitation forecasting method in CART forest |
| CN112966656A (en) * | 2021-03-29 | 2021-06-15 | 国家卫星海洋应用中心 | Data processing method and device |
Citations (15)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN105974495A (en) * | 2016-04-29 | 2016-09-28 | 中国科学院遥感与数字地球研究所 | Method for pre-judging future average cloud amount of target area by using classification fitting method |
| JP2017003416A (en) * | 2015-06-10 | 2017-01-05 | 古野電気株式会社 | Rainfall prediction system |
| CN108475393A (en) * | 2016-01-27 | 2018-08-31 | 华为技术有限公司 | The system and method that decision tree is predicted are promoted by composite character and gradient |
| CN108508505A (en) * | 2018-02-05 | 2018-09-07 | 南京云思创智信息科技有限公司 | Heavy showers and thunderstorm forecasting procedure based on multiple dimensioned convolutional neural networks and system |
| CN108761574A (en) * | 2018-05-07 | 2018-11-06 | 中国电建集团北京勘测设计研究院有限公司 | Rainfall evaluation method based on Multi-source Information Fusion |
| CN109143408A (en) * | 2018-08-09 | 2019-01-04 | 河海大学 | Combine short-term precipitation forecasting procedure in dynamic area based on MLP |
| CN109946235A (en) * | 2019-02-26 | 2019-06-28 | 南京信息工程大学 | Multi-layer cloud retrieval method based on the multi-channel scanning imaging radiometer of Fengyun 4A meteorological satellite |
| CN110346844A (en) * | 2019-07-15 | 2019-10-18 | 南京恩瑞特实业有限公司 | Quantitative Precipitation estimating and measuring method of the NRIET based on cloud classification and machine learning |
| CN110516818A (en) * | 2019-05-13 | 2019-11-29 | 南京江行联加智能科技有限公司 | A kind of high dimensional data prediction technique based on integrated study technology |
| CN110765644A (en) * | 2019-11-06 | 2020-02-07 | 兰州大学 | Data assimilation method for Fengyun No. four satellite lightning imager |
| CN110824586A (en) * | 2019-10-23 | 2020-02-21 | 上海理工大学 | Precipitation prediction method based on improved decision tree algorithm |
| US10613252B1 (en) * | 2015-10-02 | 2020-04-07 | Board Of Trustees Of The University Of Alabama, For And On Behalf Of The University Of Alabama In Huntsville | Weather forecasting systems and methods |
| CN111105068A (en) * | 2019-11-01 | 2020-05-05 | 复旦大学 | Numerical value mode correction method based on sequence regression learning |
| CN111210082A (en) * | 2020-01-13 | 2020-05-29 | 东南大学 | A Precipitation Prediction Method Based on Optimized BP Neural Network Algorithm |
| CN111368887A (en) * | 2020-02-25 | 2020-07-03 | 平安科技(深圳)有限公司 | Training method of thunderstorm weather prediction model and thunderstorm weather prediction method |
Family Cites Families (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| AU2002216339B2 (en) * | 2000-12-15 | 2006-03-02 | Nooly Technologies Ltd. | Location-based weather nowcast system and method |
| FR2947938B1 (en) * | 2009-07-10 | 2014-11-21 | Thales Sa | METHOD OF PREDICTING EVOLUTION OF A WEATHER PHENOMENON FROM DATA FROM A WEATHER RADAR |
| US9207098B2 (en) * | 2014-02-21 | 2015-12-08 | Iteris, Inc. | Short-term travel-time prediction modeling augmented with radar-based precipitation predictions and scaling of same |
| US11248930B2 (en) * | 2018-03-02 | 2022-02-15 | International Business Machines Corporation | Microclimate wind forecasting |
-
2020
- 2020-07-17 CN CN202010693436.9A patent/CN111832828B/en active Active
Patent Citations (15)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2017003416A (en) * | 2015-06-10 | 2017-01-05 | 古野電気株式会社 | Rainfall prediction system |
| US10613252B1 (en) * | 2015-10-02 | 2020-04-07 | Board Of Trustees Of The University Of Alabama, For And On Behalf Of The University Of Alabama In Huntsville | Weather forecasting systems and methods |
| CN108475393A (en) * | 2016-01-27 | 2018-08-31 | 华为技术有限公司 | The system and method that decision tree is predicted are promoted by composite character and gradient |
| CN105974495A (en) * | 2016-04-29 | 2016-09-28 | 中国科学院遥感与数字地球研究所 | Method for pre-judging future average cloud amount of target area by using classification fitting method |
| CN108508505A (en) * | 2018-02-05 | 2018-09-07 | 南京云思创智信息科技有限公司 | Heavy showers and thunderstorm forecasting procedure based on multiple dimensioned convolutional neural networks and system |
| CN108761574A (en) * | 2018-05-07 | 2018-11-06 | 中国电建集团北京勘测设计研究院有限公司 | Rainfall evaluation method based on Multi-source Information Fusion |
| CN109143408A (en) * | 2018-08-09 | 2019-01-04 | 河海大学 | Combine short-term precipitation forecasting procedure in dynamic area based on MLP |
| CN109946235A (en) * | 2019-02-26 | 2019-06-28 | 南京信息工程大学 | Multi-layer cloud retrieval method based on the multi-channel scanning imaging radiometer of Fengyun 4A meteorological satellite |
| CN110516818A (en) * | 2019-05-13 | 2019-11-29 | 南京江行联加智能科技有限公司 | A kind of high dimensional data prediction technique based on integrated study technology |
| CN110346844A (en) * | 2019-07-15 | 2019-10-18 | 南京恩瑞特实业有限公司 | Quantitative Precipitation estimating and measuring method of the NRIET based on cloud classification and machine learning |
| CN110824586A (en) * | 2019-10-23 | 2020-02-21 | 上海理工大学 | Precipitation prediction method based on improved decision tree algorithm |
| CN111105068A (en) * | 2019-11-01 | 2020-05-05 | 复旦大学 | Numerical value mode correction method based on sequence regression learning |
| CN110765644A (en) * | 2019-11-06 | 2020-02-07 | 兰州大学 | Data assimilation method for Fengyun No. four satellite lightning imager |
| CN111210082A (en) * | 2020-01-13 | 2020-05-29 | 东南大学 | A Precipitation Prediction Method Based on Optimized BP Neural Network Algorithm |
| CN111368887A (en) * | 2020-02-25 | 2020-07-03 | 平安科技(深圳)有限公司 | Training method of thunderstorm weather prediction model and thunderstorm weather prediction method |
Non-Patent Citations (5)
| Title |
|---|
| Retrieval of cloud top properties from advanced geostationary satellite imager measurements based on machine learning algorithms;Min Min;《Remote Sensing of Environment》;全文 * |
| Simulation of weather systems over Indian regionSimulation of weather systems over Indian region using mesoscale models using mesoscale models;S. S. Vaidya;Meteorology and Atmospheric Physics;全文 * |
| 卫星云图资料在降水量客观分析中的应用试验;刘刈;何险峰;刘德;;高原山地气象研究(01);全文 * |
| 基于改进CART算法的降雨量预测模型;李正方;杜景林;周芸;;现代电子技术(02);全文 * |
| 风云三号降水测量雷达技术性能分析;尹红刚;商建;吴琼;郭杨;窦芳丽;谷松岩;;气象科技(05);全文 * |
Also Published As
| Publication number | Publication date |
|---|---|
| CN111832828A (en) | 2020-10-27 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN109214592B (en) | Multi-model-fused deep learning air quality prediction method | |
| CN111832828B (en) | Intelligent precipitation prediction method based on Fengyun-4 meteorological satellite | |
| CN110135630B (en) | Short-term load demand forecasting method based on random forest regression and multi-step optimization | |
| CN108539738B (en) | A Short-Term Load Forecasting Method Based on Gradient Boosting Decision Tree | |
| CN103426026B (en) | A kind of method of hybrid neural networks prediction and identification scenic spot meteorological element | |
| CN110991690B (en) | A multi-temporal wind speed prediction method based on deep convolutional neural network | |
| CN111080032A (en) | Load prediction method based on Transformer structure | |
| CN106600060B (en) | Solar irradiation amount prediction method based on similar day sunny coefficient correction | |
| CN106897957B (en) | Automatic weather station real-time data quality control method based on PCA and PSO-E L M | |
| CN109165693A (en) | It is a kind of to sentence knowledge method automatically suitable for dew, frost and the weather phenomenon of icing | |
| CN110781458A (en) | Method for predicting surface solar irradiance based on mixed regression model | |
| CN111582555B (en) | Photovoltaic power prediction method based on foundation cloud picture image characteristics | |
| CN115796040A (en) | A Small Sample Based Greenhouse Temperature Prediction Method | |
| CN115267945A (en) | Thunder and lightning early warning method and system based on graph neural network | |
| CN114548575A (en) | Self-adaptive building day-ahead load prediction method based on transfer learning | |
| CN106845080A (en) | Scene Tourist meteorological disaster intelligent Forecasting based on difference amendment | |
| CN110046756A (en) | Short-time weather forecasting method based on Wavelet Denoising Method and Catboost | |
| CN117422174A (en) | Traffic hub passenger flow prediction method based on large event perception module LEAM | |
| CN116502050A (en) | Dynamic interpolation method and system for missing evapotranspiration observations at global flux sites | |
| CN113344290A (en) | Correcting method for sub-season rainfall weather forecast based on U-Net network | |
| CN119151755B (en) | Method, device and equipment for determining the health status of vegetation | |
| CN114997475B (en) | Kmeans-based fusion model photovoltaic power generation short-term prediction method | |
| CN116245008A (en) | Dynamic estimation method of drainage basin hydrologic model parameters based on digital twin | |
| CN116565829A (en) | Short-term photovoltaic power generation capacity prediction method based on SOM-BP neural network | |
| CN119051003A (en) | Correlation-based photovoltaic output prediction method |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |