CN112308298B

CN112308298B - A multi-scenario performance index prediction method and system for semiconductor production lines

Info

Publication number: CN112308298B
Application number: CN202011108406.3A
Authority: CN
Inventors: 乔非; 高陈媛; 刘鹃
Original assignee: Tongji University
Current assignee: Tongji University
Priority date: 2020-10-16
Filing date: 2020-10-16
Publication date: 2022-06-14
Anticipated expiration: 2040-10-16
Also published as: CN112308298A

Abstract

The invention relates to a multi-scenario performance index prediction method and a multi-scenario performance index prediction system for a semiconductor production line, wherein the method comprises the following steps: the production scene quantitative division module is driven by data, quantitatively maps the product value, the average product processing period and the utilization rate of each device of the production line, and divides the production line into three scenes of light load, normal load and heavy load; the main prediction network construction module is used for combining a deep neural network algorithm and semiconductor production line performance prediction by taking the divided normal load data as sample data to construct a prediction network under a normal load scene; and the multi-scene prediction model building module is used for applying the idea of transfer learning to production line prediction and building networks under light load and heavy load scenes according to the built main prediction network so as to form the multi-scene prediction model. Compared with the prior art, the method quantitatively divides the production line scene, can more accurately predict the performance indexes of a plurality of production lines under different load scenes, and can be used on line.

Description

A multi-scenario performance index prediction method and system for semiconductor production lines

技术领域technical field

本发明涉及智能制造技术领域，尤其是涉及一种面向半导体生产线的多场景性能指标预测方法及系统。The invention relates to the technical field of intelligent manufacturing, in particular to a multi-scenario performance index prediction method and system for a semiconductor production line.

背景技术Background technique

制造业是国民经济的产业主体，是经济高速增长的引擎，是综合国力的重要体现。为了保证竞争力，制造企业需快速地进行动态调度管理，合理分配资源。实时的性能指标预测结果能够为生产决策和评估提供依据，因此迅速了解生产线在一定调度模式下所达到的性能指标成了必然要求。Manufacturing is the main body of the national economy, the engine of rapid economic growth, and an important manifestation of comprehensive national strength. In order to ensure competitiveness, manufacturing enterprises need to quickly carry out dynamic scheduling management and rationally allocate resources. Real-time performance index prediction results can provide a basis for production decision-making and evaluation, so it is an inevitable requirement to quickly understand the performance index achieved by the production line in a certain scheduling mode.

目前，预测模型建模方法主要有数学建模、仿真建模、机器学习建模三种方法。首先，半导体制造是一个具有高度非线性、多变量耦合性、随机不确定性的复杂过程，数学模型的建立极其困难；其次，传统的仿真建模方法需要投入大量的资金且预测耗时较长，适应动态环境的能力较差。为了克服数学模型和传统仿真预测模型的局限性，专家学者们引入机器学习技术，研究数据驱动的生产线预测方法。At present, there are three main methods of predictive model modeling: mathematical modeling, simulation modeling, and machine learning modeling. First, semiconductor manufacturing is a complex process with high nonlinearity, multi-variable coupling, and random uncertainty, and the establishment of mathematical models is extremely difficult; second, traditional simulation modeling methods require a lot of investment and take a long time to predict , the ability to adapt to the dynamic environment is poor. In order to overcome the limitations of mathematical models and traditional simulation prediction models, experts and scholars have introduced machine learning technology to study data-driven production line prediction methods.

中国专利“可用于半导体生产线动态调度的性能预测方法”(专利公开号：CN103310285 A)发明了一种基于极限学习机的半导体生产线调度系统，该方法采集半导体生产线历史数据建立样本集与测试样本集；采用极限学习机方法构建预测模型；运用测试样本测试预测模型的网络性能，将预测结果归一化后输出。中国专利“一种基于LR的生产线备件损坏率预测系统”(专利公开号：CN 108898254 A)发明了一种生产线备件损坏率预测系统，该方法通过传感器收集设备的运行记录，每小时电流，电压平均值，运行时长等数据；人工获取备件的安装到替换的时间；通过LR训练，获得模型；通过模型预测对应类别设备的当前损耗率。The Chinese patent "Performance Prediction Method for Dynamic Scheduling of Semiconductor Production Lines" (Patent Publication No.: CN103310285 A) invents a semiconductor production line scheduling system based on extreme learning machines, which collects historical data of semiconductor production lines to establish sample sets and test sample sets ; Use the extreme learning machine method to build a prediction model; use test samples to test the network performance of the prediction model, and output the prediction results after normalization. The Chinese patent "An LR-based Production Line Spare Part Damage Rate Prediction System" (Patent Publication No.: CN 108898254 A) invented a production line spare part damage rate prediction system. The method collects the operation records of the equipment through sensors, hourly current, voltage Average value, running time and other data; manually obtain the time from the installation of spare parts to replacement; obtain the model through LR training; predict the current loss rate of the corresponding category of equipment through the model.

不难看出，现有方法发明大都只针对单一生产场景，而半导体生产线存在诸多不确定的动态生产因素，例如投料变化、生产线负载变化等，目前现有的识别和预测方法均难以适应动态生产环境，且大部分生产线性能预测专注于生产率等单一性能指标，同时预测设备利用率和生产率的多性能指标预测较少。为此，需要一种能够适应多种生产线状态且及时的性能指标预测方法，包括场景划分、主预测网络的构建和各场景网络构建组成新模型三个组成部分，以提高生产线性能预测的准确性以及适应性，给调度决策提供更有力的支撑。目前尚未有上述生产线性能预测方法相关的文献和专利。It is not difficult to see that most of the existing method inventions are only aimed at a single production scenario, and there are many uncertain dynamic production factors in the semiconductor production line, such as changes in material feeding, changes in production line load, etc. The current identification and prediction methods are difficult to adapt to the dynamic production environment. , and most production line performance forecasts focus on a single performance indicator such as productivity, while there are fewer multi-performance indicators forecasting equipment utilization and productivity. To this end, a timely performance index prediction method that can adapt to a variety of production line states is required, including scene division, the construction of the main prediction network, and the construction of each scene network to form three components of a new model to improve the accuracy of production line performance prediction. and adaptability to provide stronger support for scheduling decisions. At present, there are no literatures and patents related to the above-mentioned production line performance prediction method.

发明内容SUMMARY OF THE INVENTION

本发明的目的就是为了克服上述现有技术存在的缺陷而提供一种面向半导体生产线的多场景性能指标预测方法及系统。The purpose of the present invention is to provide a multi-scenario performance index prediction method and system for semiconductor production lines in order to overcome the above-mentioned defects in the prior art.

本发明的目的可以通过以下技术方案来实现：The object of the present invention can be realized through the following technical solutions:

一种面向半导体生产线的多场景性能指标预测方法，该方法包括以下步骤：A multi-scenario performance index prediction method oriented to a semiconductor production line, the method includes the following steps:

步骤1：针对生产线历史数据进行量化映射，并根据轻载、正常负载和重载场景各自所分别对应的场景定量来进行数据划分；Step 1: Quantitatively map the historical data of the production line, and divide the data according to the scenarios corresponding to the light load, normal load and heavy load scenarios respectively;

步骤2：以划分出来的正常负载场景下的数据作为样本数据，将深度神经网络算法和半导体生产线性能预测相结合，构建正常负载场景下的预测网络；Step 2: Using the divided data under the normal load scenario as sample data, combine the deep neural network algorithm and the performance prediction of the semiconductor production line to construct a prediction network under the normal load scenario;

步骤3：基于正常负载场景下的预测网络，进一步构建轻载和重载场景下的预测网络，将轻载、正常负载和重载各自场景下的预测网络组成多场景预测模型；Step 3: Based on the prediction network under the normal load scenario, further construct the prediction network under the light load and heavy load scenarios, and form the prediction network under the light load, normal load and heavy load scenarios into a multi-scenario prediction model;

步骤4：针对生产线在线数据根据阈值划分为不同场景结果后，于所述多场景预测模型中选择对应网络进行预测，得到性能指标预测结果。Step 4: After the online data of the production line is divided into different scenario results according to the threshold, the corresponding network is selected in the multi-scenario prediction model for prediction, and the performance index prediction result is obtained.

进一步地，所述的生产线历史数据包括输入量和输出量，其中，所述输入量包括：采样天数、产品平均加工周期、平均日移动步数、总在制品数、各缓冲区队长、缓冲区总队长和总移动步数；所述输出量包括：产品平均加工周期和各设备利用率。Further, the historical data of the production line includes input and output, wherein the input includes: sampling days, average product processing cycle, average daily moving steps, total work-in-progress, each buffer leader, buffer zone The total team leader and the total number of moving steps; the output includes: the average processing cycle of the product and the utilization rate of each equipment.

进一步地，所述的步骤2中构建正常负载场景下的预测网络的过程包括以下分步骤：Further, the process of constructing the prediction network under the normal load scenario in the described step 2 includes the following sub-steps:

步骤201：将样本数据中的调度规则符号进行编码，并将其作为输入进行归一化处理，将数量限制在[0,1]区间内，归一化公式为[x-min(x_i)]/[max(x_i)-min(x_i)]，其中，x指输入变量，i为样本变量；Step 201: Encode the scheduling rule symbol in the sample data, and use it as an input for normalization processing, limit the number within the [0,1] interval, and the normalization formula is [x-min(x _i ) ]/[max(x _i )-min(x _i )], where x refers to the input variable and i is the sample variable;

步骤202：采用深度神经网络构建正常负载场景下的预测网络；将深度学习算法与生产线性能预测相结合，采用网格搜索法得到合适的隐藏层层数、各隐藏层神经元个数以及各层激励函数；Step 202: Use a deep neural network to build a prediction network under a normal load scenario; combine the deep learning algorithm with the production line performance prediction, and use a grid search method to obtain the appropriate number of hidden layers, the number of neurons in each hidden layer, and the number of layers in each hidden layer. excitation function;

步骤203：采用测试样本测试预测网络的网络性能，将测试样本所对应得到的预测结果反归一化处理后所对应的输出值与测试样本的输出值对比，判断是否满足精度要求；Step 203: use the test sample to test the network performance of the prediction network, compare the output value corresponding to the prediction result obtained by the test sample after inverse normalization processing with the output value of the test sample, and determine whether the accuracy requirement is met;

步骤204：如果测试结果的预测精度能够满足精度要求，则正常负载场景下的预测网络建立成功，如果不满足，则返回至步骤202，重新选取隐藏层层数、各隐藏层神经元个数以及各层激励函数并再次训练模型。Step 204: If the prediction accuracy of the test result can meet the accuracy requirements, the prediction network under the normal load scenario is successfully established; if not, return to step 202, and reselect the number of hidden layers, the number of neurons in each hidden layer, and Each layer activates the function and trains the model again.

进一步地，所述的步骤201中的调度规则符号采用独热编码方式进行编码，以使得其能够被网络接收。Further, the scheduling rule symbol in the step 201 is encoded by one-hot encoding, so that it can be received by the network.

进一步地，所述的步骤202中的深度学习算法的输入为：Further, the input of the deep learning algorithm in the described step 202 is:

对于给定的k个不同样本的训练集：For a given training set of k different samples:

X^k＝{S^k,D^k,Ru^k,P^k|S^k∈R^∝,D^k∈R^β,Ru^k∈R^λ,P^k∈R^m}X ^k ＝{S ^k ,D ^k ,Ru ^k ,P ^k |S ^k ∈R ^∝ ,D ^k ∈R ^β ,Ru ^k ∈R ^λ ,P ^k ∈R ^m }

其中，

为智能车间系统状态；

为智能车间投料信息；

为智能车间调度规则；

为在当前智能车间系统状态S^k，当前投料信息D^k，当前调度规则Ru^k情况下1天后的性能指标；in,

is the status of the intelligent workshop system;

Feeding information for intelligent workshop;

Scheduling rules for smart workshops;

is the performance index 1 day after the current intelligent workshop system state S ^k , the current feeding information D ^k , and the current scheduling rule Ru ^k ;

所述的步骤202中的深度学习算法的输出采用o_i表示，其中i＝1，…，m1，m1指输出值o的维数，即有m1个输出。The output of the deep learning algorithm in step 202 is represented by o _i , where i=1, . . ., m1, m1 refers to the dimension of the output value o, that is, there are m1 outputs.

进一步地，所述的步骤204中的精度要求采用均方根误差作为基准，其描述公式为：Further, the precision requirement in the described step 204 uses the root mean square error as the benchmark, and the description formula is:

式中，m为输出变量数，P为输入性能指标，Y为预测出的性能指标，

为输出变量样本数。where m is the number of output variables, P is the input performance index, Y is the predicted performance index,

is the number of samples of the output variable.

进一步地，所述的步骤3中的构建轻载和重载场景下的预测网络的过程包括以下分步骤：Further, the process of constructing the prediction network under the light-load and heavy-load scenarios in the described step 3 includes the following sub-steps:

步骤301：将正常负载场景下的预测网络的输入层神经元个数赋值给轻载和重载场景下的预测网络；Step 301: Assign the number of input layer neurons of the prediction network under the normal load scenario to the prediction network under the light load and heavy load scenarios;

步骤302：将正常负载场景下的预测网络的除最后一层隐藏层的各层隐藏层神经元个数赋值给轻载和重载场景下的预测网络作为对应前n-1层的神经元个数；Step 302: Assign the number of hidden layer neurons of the prediction network in the normal load scenario except the last hidden layer to the prediction network in the light load and heavy load scenarios as the neurons corresponding to the first n-1 layers. number;

步骤303：将正常负载场景下的预测网络的除最后一层隐藏层的各层隐藏层权值阈值信息赋值给轻载和重载场景下的预测网络作为初始化数据；Step 303: Assign the weight threshold information of each hidden layer except the last hidden layer of the prediction network under the normal load scenario to the prediction network under the light load and heavy load scenarios as initialization data;

步骤304：将正常负载场景下的预测网络的除最后一层隐藏层的各层隐藏层激励函数赋值给轻载和重载场景下的预测网络；Step 304: Assign the activation functions of each hidden layer except the last hidden layer of the prediction network under the normal load scenario to the prediction network under the light load and heavy load scenarios;

步骤305：通过网络搜索法调整最后一层隐藏层神经元个数，将步骤1中划分出来的轻载以及重载场景下的数据输入给轻载和重载场景下的预测网络中进行训练，训练完毕后得到轻载和重载场景下的预测网络。Step 305: Adjust the number of neurons in the hidden layer of the last layer through the network search method, and input the data in the light-load and heavy-load scenarios divided in step 1 into the prediction network under the light-load and heavy-load scenarios for training, After training, the prediction network in light-load and heavy-load scenarios is obtained.

进一步地，所述多场景预测模型构建模块包含：Further, the multi-scene prediction model building module includes:

轻载场景预测网络构建，以轻载场景数据为样本，在主预测网络基础上，改变最后一层隐藏层神经元个数，搭建轻载场景预测网络；The light-load scene prediction network is constructed. Taking the light-load scene data as a sample, on the basis of the main prediction network, the number of neurons in the last hidden layer is changed to build a light-load scene prediction network;

重载场景预测网络构建，以重载场景数据为样本，在主预测网络基础上，改变最后一层隐藏层神经元个数，搭建重载场景预测网络；The overloaded scene prediction network is constructed. Taking the overloaded scene data as a sample, on the basis of the main prediction network, the number of neurons in the last hidden layer is changed to build the overloaded scene prediction network;

网络组合，将三个预测网络进行组合。组成多场景预测模型。Network combination, combining the three prediction networks. Form a multi-scenario prediction model.

进一步地，所述的步骤1包括以下分步骤：Further, described step 1 comprises the following sub-steps:

步骤101：在相同采样天数下，采集生产线历史数据进行量化映射，绘制折线图；Step 101: Under the same sampling days, collect historical data of the production line for quantitative mapping, and draw a line graph;

步骤102：根据各折线图中的曲线变化规律以及轻载、正常负载和重载场景各自所分别对应的场景定量来进行数据划分。Step 102 : Perform data division according to the curve change law in each line graph and the scenario quantifications corresponding to the light load, normal load and heavy load scenarios respectively.

进一步地，所述步骤102中的轻载场景下，所对应的曲线中，起点为设备利用率开始进入平稳阶段时总在制品值所对应的点，终点为产品平均加工周期曲线斜率最大时总在制品值所对应的点；Further, in the light-load scenario in the step 102, in the corresponding curve, the starting point is the point corresponding to the total work-in-process value when the equipment utilization rate begins to enter a stable stage, and the end point is the total product when the average processing cycle curve slope is the maximum. The point corresponding to the WIP value;

正常负载场景下，所对应的曲线中，起点为产品平均加工周期曲线斜率最大时在制品值所对应的点，终点为产品平均加工周期曲线斜率最小时在制品值所对应的点；In the normal load scenario, in the corresponding curve, the starting point is the point corresponding to the WIP value when the slope of the product average processing cycle curve is the largest, and the end point is the point corresponding to the WIP value when the product average processing cycle curve slope is the smallest;

重载场景下，所对应的曲线中，起点为产品平均加工周期曲线斜率最小时总在制品值所对应的点。In the heavy load scenario, in the corresponding curve, the starting point is the point corresponding to the total work-in-process value when the slope of the average processing cycle curve of the product is the smallest.

本发明还提供一种采用所述的面向半导体生产线的多场景性能指标预测方法的系统，该系统包括：The present invention also provides a system that adopts the multi-scenario performance index prediction method for semiconductor production lines, the system comprising:

主预测网络构建模块，用于以划分出来的正常负载场景下的数据作为样本数据，将深度神经网络算法和半导体生产线性能预测相结合，构建正常负载场景下的预测网络；The main prediction network building module is used to use the divided data under normal load scenarios as sample data, and combine the deep neural network algorithm with the performance prediction of semiconductor production lines to construct a prediction network under normal load scenarios;

多场景预测模型构建模块，用于基于正常负载场景下的预测网络，进一步构建轻载和重载场景下的预测网络，将轻载、正常负载和重载各自场景下的预测网络组成多场景预测模型；The multi-scenario prediction model building module is used to further construct the prediction network under the light load and heavy load scenarios based on the prediction network under the normal load scenario, and combine the prediction networks under the light load, normal load and heavy load scenarios into multi-scenario prediction Model;

生产场景定量划分模块，用于针对生产线历史数据进行量化映射，并根据轻载、正常负载和重载场景各自所分别对应的场景定量来进行数据划分。The production scene quantitative division module is used to quantitatively map the historical data of the production line, and divide the data according to the quantitative scenes corresponding to the light load, normal load and heavy load scenarios respectively.

与现有技术相比，本发明具有以下优点：Compared with the prior art, the present invention has the following advantages:

1、本发明通过对生产场景进行定量的划分，将生产环境划分为轻载、正常负载、重载三个场景，使预测方法适应动态的生产环境；1. The present invention divides the production environment into three scenarios of light load, normal load and heavy load by quantitatively dividing the production scenarios, so that the prediction method can be adapted to the dynamic production environment;

2、本发明将深度学习算法与半导体生产线预测相结合，分别以划分出来的轻载、正常负载、重载数据为样本数据，建立了神经网络预测模型，能够对系统中的实时变化及时作出反应，减少了重调度的需要。本发明把系统看成黑箱，提供的建模方法把利用生产线可获得的历史数据及离在线数据，挖掘其中有用知识，实现实时在线优化控制；2. The present invention combines the deep learning algorithm with the prediction of the semiconductor production line, and uses the divided light load, normal load and heavy load data as sample data to establish a neural network prediction model, which can respond to real-time changes in the system in time. , reducing the need for rescheduling. The present invention regards the system as a black box, and provides a modeling method that utilizes the historical data and off-line data obtainable from the production line, mines useful knowledge therein, and realizes real-time on-line optimization control;

3、本发明将迁移学习思想运用到多场景网络的训练过程中，根据已经构建出的主预测网络，构建轻载、重载场景下的网络，从而组成多场景预测模型，减少了网络训练的时间，提高了生产线性能预测的准确性；3. The present invention applies the transfer learning idea to the training process of the multi-scenario network, and builds the network under the light-load and heavy-load scenarios according to the main prediction network that has been constructed, so as to form a multi-scenario prediction model and reduce the network training time. time, improving the accuracy of production line performance prediction;

4、本发明能够预测未来一天后的多个生产线性能指标，为生产决策提供更多的数据支持。4. The present invention can predict the performance indicators of multiple production lines one day in the future, and provide more data support for production decision-making.

附图说明Description of drawings

图1为本发明的结构示意图；Fig. 1 is the structural representation of the present invention;

图2为本发明实施例中MiniFAB模型结构图；Fig. 2 is the MiniFAB model structure diagram in the embodiment of the present invention;

图3为本发明实施例中第26天采样Ma至Me设备调度规则分别EDD、EDD、SRPT、SRPT、EDD的产品平均加工周期示意图；FIG. 3 is a schematic diagram of the average processing cycle of products of EDD, EDD, SRPT, SRPT, and EDD, respectively, in the 26th day sampling Ma to Me equipment scheduling rules in the embodiment of the present invention;

图4为本发明实施例中第26天采样Ma至Me设备调度规则分别EDD、EDD、SRPT、SRPT、EDD、EDD的在制品数示意图；4 is a schematic diagram of the number of work-in-progress of the equipment scheduling rules for sampling Ma to Me on the 26th day, respectively, EDD, EDD, SRPT, SRPT, EDD, and EDD in the embodiment of the present invention;

图5为本发明实施例中第26天采样Ma至Me设备调度规则分别EDD、EDD、SRPT、SRPT、EDD、EDD的各设备利用率示意图；FIG. 5 is a schematic diagram of the utilization rate of each equipment of EDD, EDD, SRPT, SRPT, EDD, and EDD for sampling Ma to Me equipment scheduling rules on the 26th day in the embodiment of the present invention;

图6为本发明实施例中各场景模型、不分场景模型以及多场景多性能指标预测模型在生产率上的均方根误差对比示意图；6 is a schematic diagram showing the comparison of the root mean square error in productivity of each scenario model, a scenario-independent model, and a multi-scenario multi-performance index prediction model in an embodiment of the present invention;

图7为本发明实施例中各场景模型、不分场景模型以及多场景多性能指标预测模型在Ma设备利用率上的均方根误差对比示意图；7 is a schematic diagram illustrating the comparison of the root mean square error of each scenario model, a scenario-independent model, and a multi-scenario multi-performance index prediction model on Ma equipment utilization in an embodiment of the present invention;

图8为本发明实施例中各场景模型、不分场景模型以及多场景多性能指标预测模型在Mb设备利用率上的均方根误差对比示意图；8 is a schematic diagram showing the comparison of the root mean square error of each scenario model, a scenario-independent model, and a multi-scenario multi-performance index prediction model on Mb device utilization in an embodiment of the present invention;

图9为本发明实施例中各场景模型、不分场景模型以及多场景多性能指标预测模型在Mc设备利用率上的均方根误差对比示意图；9 is a schematic diagram illustrating the comparison of the root mean square error of each scenario model, a scenario-independent model, and a multi-scenario multi-performance index prediction model on Mc equipment utilization in an embodiment of the present invention;

图10为本发明实施例中各场景模型、不分场景模型以及多场景多性能指标预测模型在Md设备利用率上的均方根误差对比示意图；10 is a schematic diagram showing the comparison of the root mean square error of each scenario model, a scenario-independent model, and a multi-scenario multi-performance index prediction model on Md equipment utilization in an embodiment of the present invention;

图11为本发明实施例中各场景模型、不分场景模型以及多场景多性能指标预测模型在Me设备利用率上的均方根误差对比示意图。FIG. 11 is a schematic diagram showing the comparison of the root mean square error of the Me device utilization rate of each scenario model, a scenario-independent model, and a multi-scenario multi-performance index prediction model in an embodiment of the present invention.

具体实施方式Detailed ways

下面将结合本发明实施例中的附图，对本发明实施例中的技术方案进行清楚、完整地描述，显然，所描述的实施例是本发明的一部分实施例，而不是全部实施例。基于本发明中的实施例，本领域普通技术人员在没有做出创造性劳动的前提下所获得的所有其他实施例，都应属于本发明保护的范围。The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are part of the embodiments of the present invention, but not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of the present invention.

具体实施例specific embodiment

如图1所示，本发明提供一种面向半导体生产线的多场景多性能指标预测方法，包括生产场景定量划分模块、主预测网络构建模块、多场景预测模型构建模块。生产场景定量划分模块由数据驱动，将生产线的在制品值、产品平均加工周期、各设备利用率进行量化映射，将生产线划分为轻载、正常负载、重载三个场景；主预测网络构建模块，以划分出来的正常负载数据为样本数据，将深度神经网络算法与半导体生产线性能预测相结合，构建正常负载场景下的预测网络；多场景预测模型构建模块：将迁移学习的思想运用到生产线预测中，根据已经构建出的主预测网络，构建轻载、重载场景下的网络，从而组成多场景预测模型。As shown in FIG. 1 , the present invention provides a multi-scenario multi-performance index prediction method for semiconductor production lines, including a production scenario quantitative division module, a main prediction network building module, and a multi-scenario prediction model building module. The quantitative division module of production scenarios is driven by data, and quantitatively maps the value of the production line, the average processing cycle of products, and the utilization rate of each equipment, and divides the production line into three scenarios: light load, normal load, and heavy load; the main prediction network building module , taking the divided normal load data as sample data, combining the deep neural network algorithm with the performance prediction of semiconductor production lines to build a prediction network under normal load scenarios; multi-scenario prediction model building module: applying the idea of transfer learning to production line prediction In , according to the main prediction network that has been constructed, the network under light-load and heavy-load scenarios is constructed to form a multi-scenario prediction model.

本实施例以MiniFab模型为例进行说明。MiniFab是Intel公司制造系统首席科学家Karl Kempf博士提出的一个半导体生产流程的实验仿真模型，由5台设备及6道工序组成，模型结构如图2所示。This embodiment takes the MiniFab model as an example for description. MiniFab is an experimental simulation model of a semiconductor production process proposed by Dr. Karl Kempf, chief scientist of Intel's manufacturing system. It consists of 5 devices and 6 processes. The model structure is shown in Figure 2.

应用MiniFab模型为实施对象，上述面向半导体生产线的多场景多性能指标预测方法的工作过程如下：Using the MiniFab model as the implementation object, the working process of the above-mentioned multi-scenario and multi-performance index prediction method for semiconductor production lines is as follows:

步骤1，定义样本信息X^k＝{S^k,D^k,Ru^k,P^k|S^k∈R^∝,D^k∈R^β,Ru^k∈R^λ,P^k∈R^m}，其中

描述智能车间系统状态；

描述智能车间投料信息；

描述智能车间调度规则；

描述在当前状态S^k，当前投料方式D^k，当前调度规则Ru^k下1天后的性能指标。选取车间生产率PROD、设备Ma利用率U_a、设备Mb利用率U_b、设备Mc利用率U_c、设备Md利用率U_d、设备Me利用率U_e这六个性能指标作为预测目标，样本信息中的生产线系统状态S^k、投料信息D^k、调度规则Ru^k定义见表1、表2和表3。选取10％作为测试样本集；Step 1, define sample information X ^k ={S ^k ,D ^k ,Ru ^k ,P ^k |S ^k ∈R ^∝ ,D ^k ∈R ^β ,Ru ^k ∈R ^λ ,P ^k ∈R ^m }, where

Describe the status of the smart workshop system;

Describe the material feeding information of the intelligent workshop;

Describe intelligent workshop scheduling rules;

Describe the performance index after 1 day under the current state S ^k , the current feeding mode D ^k , and the current scheduling rule Ru ^k . Six performance indicators of workshop productivity PROD, equipment Ma utilization rate U _a , equipment Mb utilization rate U _b , equipment Mc utilization rate U _c , equipment Md utilization rate U _d , equipment Me utilization rate U _e are selected as prediction targets, and the sample information The definitions of the production line system state ^Sk , the feeding information D ^k , and the scheduling rule Ru ^k are shown in Table 1, Table 2 and Table 3. Select 10% as the test sample set;

表1 MiniFab样本信息中系统状态S^k定义Table 1 Definition of system state ^Sk in MiniFab sample information

字段名field name 描述describe 数据类型type of data DayDay 采样天数Sampling days 整型Integer MCTMCT 产品平均加工周期Average product cycle time 浮点型floating point MDayMOVMDayMOV 平均日移动步数Average daily steps 整型Integer WIPWIP 在制品数Work in process 整型Integer Mab_QueueMab_Queue MaMb缓冲区队长MaMb buffer captain 整型Integer Mcd_QueueMcd_Queue McMd缓冲区队长Captain McMd Buffer 整型Integer Me_QueueMe_Queue Me缓冲区队长Captain Me Buffer 整型Integer Total_QueueTotal_Queue 缓冲区总队长buffer chief 整型Integer Total_MOVTotal_MOV 总移动步数total moving steps 整型Integer

表2 MiniFab样本信息中投料信息D^k定义Table 2 Definition of feed information D ^k in MiniFab sample information

字段名field name 描述describe 数据类型type of data x_Releasex_Release 产品x的日平均投料Average daily feed for product x 整型Integer y_Releasey_Release 产品y的日平均投料Average daily feed for product y 整型Integer z_Releasez_Release 产品z的日平均投料Average daily feed for product z 整型Integer

表3 MiniFab样本信息中调度规则Ru^k定义Table 3 Scheduling rule Ru ^k definition in MiniFab sample information

字段名field name 描述describe 数据类型type of data Ma_RuleMa_Rule 设备Ma的调度规则Scheduling rules for equipment Ma 字符型character type Mb_RuleMb_Rule 设备Mb的调度规则Scheduling rules for device Mb 字符型character type Mc_RuleMc_Rule 设备Mc的调度规则Scheduling rules for device Mc 字符型character type Md_RuleMd_Rule 设备Md的调度规则Scheduling rules for equipment Md 字符型character type Me_RuleMe_Rule 设备Me的调度规则Scheduling rules for equipment Me 字符型character type

本实施例中，采样天数为1至30，产品a，产品b，产品c的日平均投料分别从1增至19，Ma、Mb调度规则选择EDD或SRPT，Mc、Md调度规则选择EDD或SRPT或CR，Me调度规则选择EDD或SRPT，获取1天后的各设备利用率、生产率性能指标。In this embodiment, the number of sampling days is 1 to 30, the daily average feed of product a, product b, and product c is increased from 1 to 19, respectively, EDD or SRPT is selected for the scheduling rules of Ma and Mb, and EDD or SRPT is selected for the scheduling rules of Mc and Md. Or select EDD or SRPT for CR, Me scheduling rules, and obtain the utilization rate and productivity performance indicators of each equipment after 1 day.

步骤2，将产品平均加工周期MCT、在制品数WIP、设备Ma利用率U_a、设备Mb利用率U_b、设备Mc利用率U_c、设备Md利用率U_d、设备Me利用率U_e通过采样天数以及产品日平均投料总数进行映射，观察图中各曲线变化规律的对应关系，定量划分生产线轻载、正常负载、重载场景。Step 2: Pass through the average product processing cycle MCT, the number of WIP, the utilization rate U a of the equipment Ma, the utilization rate U _b of the equipment Mb, the utilization rate U _c of the equipment M _{d, the utilization rate U d} _of the equipment Me, and the utilization rate U _e of the equipment Me. Map the number of sampling days and the average daily feeding total of the product, observe the corresponding relationship between the curve changes in the figure, and quantitatively divide the light load, normal load and heavy load scenarios of the production line.

场景1：轻载场景。在该场景下，设备Ma利用率U_a、设备Mb利用率U_b、设备Mc利用率U_c、设备Md利用率U_d、设备Me利用率U_e趋于稳定，该场景开始的起点为MCT设备利用率开始进入平稳阶段的点，该场景终点为MCT曲线斜率最大点，根据该场景的起点和终点对应WIP曲线，定量划分轻载场景。Scenario 1: Light load scenario. In this scenario, the device Ma utilization rate U _a , the device Mb utilization rate U _b , the device Mc utilization rate U _c , the device Md utilization rate U _d , and the device Me utilization rate U _e tend to be stable, and the starting point of this scenario is MCT The point at which the equipment utilization begins to enter a stable stage, the end point of the scene is the point with the maximum slope of the MCT curve, and the light load scene is quantitatively divided according to the start point and end point of the scene corresponding to the WIP curve.

场景2：正常场景。在该场景下，各设备利用率较为平稳，MCT有所波动，该场景开始的起点为MCT曲线斜率最大点，该场景终点为MCT曲线斜率最小点，根据该场景的起点和终点对应WIP曲线，定量划分正常场景。Scenario 2: Normal scenario. In this scenario, the utilization rate of each device is relatively stable, and the MCT fluctuates. The starting point of the scenario is the point with the maximum slope of the MCT curve, and the end point of the scenario is the point with the minimum slope of the MCT curve. Quantitatively divide normal scenes.

场景3：重载场景。在该场景下，各设备利用率较为平稳，MCT曲线一直上升，没有下降，WIP曲线上升，该场景开始的起点为MCT曲线斜率最小点，根据该场景的起点和终点对应WIP曲线，定量划分重载场景。Scenario 3: Reload scenario. In this scenario, the utilization rate of each device is relatively stable, the MCT curve keeps rising and does not drop, and the WIP curve rises. The starting point of this scenario is the minimum slope of the MCT curve. According to the WIP curve corresponding to the starting and ending points of the scenario, quantitatively divide the load the scene.

根据三个场景对应的起始横坐标，定量得到三个场景划分的WIP值，为之后在线数据根据WIP具体数值划分场景奠定基础。According to the starting abscissas corresponding to the three scenarios, the WIP values of the three scenarios are quantitatively obtained, which lays the foundation for the subsequent online data to divide the scenarios according to the specific WIP values.

1a)绘制映射图：在相同采样天数下，选取若干种调度规则下的产品平均加工周期MCT、在制品数WIP、设备Ma利用率U_a、设备Mb利用率U_b、设备Mc利用率U_c、设备Md利用率U_d、设备Me利用率U_e进行折线图的绘制。此处以采样天数为26、Ma、Mb、Mc、Md、Me设备调度规则分别EDD、EDD、SRPT、SRPT、EDD、EDD为例，绘制出图3、图4和图5；1a) Draw a map: Under the same sampling days, select the average product processing cycle MCT, the number of WIP, the utilization rate U a of equipment Ma, the utilization rate U _b of equipment Mb, and the utilization rate U _c of equipment Mc under several scheduling rules _. , the utilization rate U _d of the equipment Md and the utilization rate U _e of the equipment Me to draw a line graph. Here, taking the sampling days of 26, Ma, Mb, Mc, Md, and Me equipment scheduling rules as EDD, EDD, SRPT, SRPT, EDD, and EDD as an example, Figure 3, Figure 4, and Figure 5 are drawn;

1b)场景定量划分：当样本id为6时，图5的设备利用率趋于稳定，此时图4的WIP值大致为60；当样本id为8时，图3的MCT波动阶段斜率最大，此时图4的WIP在200上下波动；当样本id为16时，MCT只升不降，WIP在750上下波动。场景划分定义指标为WIP，因此，当WIP处于[60,200)时判定为轻载场景，当WIP处于[200,750)时判定为正常场景，当WIP处于[750，∞)时判定为重载场景。1b) Quantitative division of scenarios: when the sample id is 6, the equipment utilization in Figure 5 tends to be stable, and the WIP value in Figure 4 is roughly 60; when the sample id is 8, the slope of the MCT fluctuation phase in Figure 3 is the largest, At this time, the WIP in Figure 4 fluctuates around 200; when the sample id is 16, the MCT only rises and does not fall, and the WIP fluctuates around 750. The definition index of scene division is WIP. Therefore, when WIP is in [60, 200), it is determined as a light-loaded scene, when WIP is in [200, 750), it is determined as a normal scene, and when WIP is in [750, ∞), it is determined as an overloaded scene.

步骤3，对调度规则符号进行编码，使其被网络所接收，并对原始数据通过min-max进行归一化处理，去其量纲，将原始数据变为[0,1]之间的小数。编码表如表4所示。Step 3: Encode the scheduling rule symbol so that it can be received by the network, and normalize the original data through min-max, remove its dimension, and change the original data to a decimal between [0,1] . The coding table is shown in Table 4.

表4调度规则编码表Table 4 Scheduling rule coding table

调度规则scheduling rules 编码coding EDDEDD [0,1][0,1] SRPTSRPT [1,0][1,0] CRCR [1,1][1,1]

步骤4，将划分为正常负载场景的数据，采用DNN算法，以投料信息D、MaMb调度规则、McMd调度规则、Me调度规则、系统状态S中的在制品数、MaMb缓冲区队长、McMd缓冲区队长、Me缓冲区队长、缓冲区总队长、总移动步数为输入，表征性能指标的P为输出，获取投料信息、调度规则、系统状态与性能指标的匹配关系，即主预测模型。Step 4: Use the DNN algorithm to divide the data into normal load scenarios, and use the DNN algorithm to calculate the feed information D, the MaMb scheduling rule, the McMd scheduling rule, the Me scheduling rule, the number of work-in-progress in the system state S, the MaMb buffer length, and the McMd buffer. The captain, Me buffer captain, total buffer captain, and total moving steps are input, and P, which represents the performance index, is the output, and the matching relationship between feeding information, scheduling rules, system status and performance indicators is obtained, that is, the main prediction model.

采用所述深度神经网络算法学习获得主预测模型的具体过程为：The specific process of using the deep neural network algorithm to learn and obtain the main prediction model is as follows:

2a)将网络的权值矩阵W^η、V和阈值矩阵θ^q、γ初始化，值取[-1,1]之间的随机数。设置训练次数计数器h。2a) Initialize the weight matrices W ^η , V and threshold matrices θ ^q , γ of the network, and the values are random numbers between [-1, 1]. Set the training times counter h.

2b)选取第k对数据样本，计算各隐藏层的输出结果：2b) Select the kth pair of data samples, and calculate the output results of each hidden layer:

2c)确定输出层的输出结果：2c) Determine the output of the output layer:

2d)计算输出层与隐藏层各个神经元的校正误差

2d) Calculate the correction error of each neuron in the output layer and the hidden layer

2e)计算第η层隐藏层的各层误差

2e) Calculate the error of each layer of the nth hidden layer

2f)根据选取的梯度下降优化器为Adadelta Optimizer2f) According to the selected gradient descent optimizer is Adadelta Optimizer

2g)设置loss函数：2g) Set the loss function:

is the number of samples of the output variable.

2h)设置学习率为0.05，设定网络学习次数50000次；2h) Set the learning rate to 0.05 and set the network learning times to 50,000 times;

2i)根据网络中定义的损失函数计算输出总误差，若损失函数值小于设定的数值，则跳出训练程序，若不满足，则调整各层权值和阈值，继续训练。2i) Calculate the total output error according to the loss function defined in the network. If the loss function value is less than the set value, jump out of the training program. If not, adjust the weights and thresholds of each layer and continue training.

2j)检查训练次数计数器h是否达到网络设定值，若未达到，则训练次数计数器h+1；若达到，则跳出程序，重新设置隐藏层层数、神经元个数以及激励函数。2j) Check whether the training times counter h reaches the network setting value, if not, then the training times counter h+1; if it does, jump out of the program and reset the number of hidden layers, the number of neurons and the excitation function.

最终通过网格搜索法确定网络A的网络层数为6层，网络结构为10*10*10*9*8*6，隐藏层1-4层激励函数为tanh，输出层激励函数为sigmoid。Finally, the grid search method is used to determine that the network layer of network A is 6 layers, the network structure is 10*10*10*9*8*6, the activation function of the hidden layers 1-4 is tanh, and the excitation function of the output layer is sigmoid.

步骤5，根据主预测网络的网络信息，通过迁移学习的方法，构建轻载场景网络以及重载场景网络，步骤如下：Step 5: According to the network information of the main prediction network, a light-load scenario network and a heavy-load scenario network are constructed by the method of transfer learning. The steps are as follows:

步骤K1：将主预测网络的输入层神经元个数赋值给新的轻载/重载网络；Step K1: Assign the number of neurons in the input layer of the main prediction network to the new light load/heavy load network;

步骤K2：将主预测网络除最后一层隐藏层的各层隐藏层神经元个数赋值给新的轻载/重载网络作为前n-1层的神经元个数；Step K2: Assign the number of neurons in each hidden layer of the main prediction network except the last hidden layer to the new light-load/heavy-load network as the number of neurons in the first n-1 layers;

步骤K3：将主预测网络除最后一层隐藏层的各层隐藏层权值阈值信息赋值给新的轻载/重载网络作为初始化数据；Step K3: Assign the weight threshold information of each hidden layer of the main prediction network except the last hidden layer to the new light load/heavy load network as initialization data;

步骤K4：将主预测网络除最后一层隐藏层的各层隐藏层激励函数赋值给新的轻载/重载网络；Step K4: Assign the activation function of each hidden layer of the main prediction network except the last hidden layer to the new light load/heavy load network;

步骤K5：通过网格搜索法调整最后一层隐藏层神经元个数，将轻载/重载场景数据输入进行训练。Step K5: Adjust the number of neurons in the last hidden layer by grid search method, and input the light load/heavy load scene data for training.

本实施例中，主预测网络A共有4层隐藏层，将网络A的前3层隐藏层迁移到网络B和网络C的前3层隐藏层，通过调节最后一层隐藏层的神经元个数，训练获得网络B和网络C，最终确定网络B的网络层数为6层，网络结构为10*10*10*9*9*6，隐藏层1-4层激励函数为tanh，输出层激励函数为sigmoid；网络C的网络层数为6层，网络结构为10*10*10*9*7*6，隐藏层1-4层激励函数为tanh，输出层激励函数为sigmoid。三个网络合而为一，构成多场景多指标预测模型，模型结构如表5所示。In this embodiment, the main prediction network A has a total of 4 hidden layers, the first 3 hidden layers of the network A are migrated to the first 3 hidden layers of the network B and the network C, and the number of neurons in the last hidden layer is adjusted by adjusting the number of neurons in the last hidden layer. , train to obtain network B and network C, and finally determine that the number of network layers of network B is 6 layers, the network structure is 10*10*10*9*9*6, the hidden layer 1-4 layer excitation function is tanh, and the output layer excitation The function is sigmoid; the network layer of network C is 6 layers, the network structure is 10*10*10*9*7*6, the activation function of the hidden layers 1-4 is tanh, and the excitation function of the output layer is sigmoid. The three networks are combined into one to form a multi-scenario multi-index prediction model. The model structure is shown in Table 5.

表5多场景多指标预测模型结构Table 5 Multi-scenario multi-index prediction model structure

当在线数据进入该模型之前，先根据阈值进行场景识别，若识别为正常场景，则选择模型中的主预测网络进行预测；若识别为轻载场景，则选择模型中的轻载场景预测网络进行预测；若识别为重载场景，则选择模型中的重载场景预测网络进行预测。Before the online data enters the model, the scene is identified according to the threshold value. If it is identified as a normal scene, the main prediction network in the model is selected for prediction; if it is identified as a light-load scene, the light-load scene prediction network in the model is selected for prediction. Prediction; if it is identified as an overloaded scene, select the overloaded scene prediction network in the model for prediction.

分别将不分场景模型、单场景模型和多场景多性能指标预测模型应用于MiniFAB模型中，比较不分场景网络、单场景模型和多场景多性能指标预测模型的预测结果。The scenario-independent model, the single-scenario model and the multi-scenario multi-performance index prediction model are respectively applied to the MiniFAB model, and the prediction results of the scenario-independent network, the single-scenario model and the multi-scenario multi-performance indicator prediction model are compared.

不分场景模型和多场景多指标预测模型输出与测试样本平均值对比表如表6所示。Table 6 shows the comparison between the output of the scenario-independent model and the multi-scenario multi-index prediction model and the average value of the test sample.

表6不分场景模型和多场景多指标预测模型输出与测试样本平均值对比表Table 6. Comparison between the output of the non-scenario model and the multi-scenario multi-index prediction model and the average value of the test sample

在不分场景模型，即将数据混合训练得到一个整体的网络模型，最高相对误差达3.4％，最低误差为1.0％，在多场景多指标预测模型下，最高相对误差达1.5％，最低相对误差为0.15％，从输出平均值相对误差角度看，多场景多指标预测模型优于不分场景模型。In the scenario-independent model, that is, the data is mixed to train to obtain an overall network model, the highest relative error is 3.4%, and the lowest is 1.0%. In the multi-scenario and multi-index prediction model, the highest relative error is 1.5%, and the lowest relative error is 0.15%. From the perspective of the relative error of the output average, the multi-scenario multi-index prediction model is better than the no-scenario model.

单场景模型、不分场景模型和多场景多指标预测模型输出均方根误差平均值如表7所示。The average output root mean square error of the single-scenario model, the no-scenario model and the multi-scenario multi-index prediction model is shown in Table 7.

表7各模型均方根误差平均值表Table 7 The average value of the root mean square error of each model

从整体上看，多场景多指标预测模型精度更高，质量更好。On the whole, the multi-scenario multi-index prediction model has higher accuracy and better quality.

单场景模型、不分场景模型和多场景多指标预测模型生产率均方根误差、各设备利用率均方根误差如图6～图11所示。Figures 6 to 11 show the root mean square error of productivity and the root mean square error of each equipment utilization rate for the single-scenario model, the no-scenario model, and the multi-scenario multi-index prediction model.

由图6～图11可知，轻载单场景模型对重载场景的预测误差最大，对轻载场景的预测误差最小；重载单场景模型对重载场景的预测误差最小，对轻载场景的预测误差最大；正常单场景对正常场景的预测误差最小，对重载场景的预测误差最大；不分场景模型对重载场景误差最大，对正常场景的预测误差最小；而本发明提出的多场景多指标预测模型对三种场景的预测误差几乎持平，且均处于四种模型最优的位置。It can be seen from Figure 6 to Figure 11 that the light-load single-scenario model has the largest prediction error for the heavy-load scenario, and the smallest prediction error for the light-load scenario; the heavy-load single-scenario model has the smallest prediction error for the heavy-load scenario, and the prediction error for the light-load scenario is the smallest. The prediction error is the largest; the prediction error of the normal single scene is the smallest for the normal scene, and the prediction error for the overloaded scene is the largest; the model regardless of the scene has the largest error for the overloaded scene, and the prediction error for the normal scene is the smallest; The prediction errors of the multi-index prediction models for the three scenarios are almost the same, and they are all in the optimal position of the four models.

以上所述，仅为本发明的具体实施方式，但本发明的保护范围并不局限于此，任何熟悉本技术领域的技术人员在本发明揭露的技术范围内，可轻易想到各种等效的修改或替换，这些修改或替换都应涵盖在本发明的保护范围之内。因此，本发明的保护范围应以权利要求的保护范围为准。The above are only specific embodiments of the present invention, but the protection scope of the present invention is not limited to this. Any person skilled in the art can easily think of various equivalents within the technical scope disclosed by the present invention. Modifications or substitutions should be included within the protection scope of the present invention. Therefore, the protection scope of the present invention should be subject to the protection scope of the claims.

Claims

1. A multi-scenario performance index prediction method for a semiconductor production line is characterized by comprising the following steps:

step 1: carrying out quantitative mapping on historical data of a production line, and carrying out data division according to scene quantification respectively corresponding to light load scenes, normal load scenes and heavy load scenes;

step 2: taking the divided data under the normal load scene as sample data, and combining a deep neural network algorithm with the performance prediction of a semiconductor production line to construct a prediction network under the normal load scene;

and step 3: based on the prediction network under the normal load scene, the prediction networks under the light load scene and the heavy load scene are further constructed, and the prediction networks under the light load scene, the normal load scene and the heavy load scene form a multi-scene prediction model;

and 4, step 4: after dividing the online data of the production line into different scene results according to a threshold value, selecting a corresponding network from the multi-scene prediction model for prediction to obtain a performance index prediction result;

the process of constructing the prediction network under the normal load scene in the step 2 comprises the following sub-steps:

step 201: coding a scheduling rule symbol in the sample data, and performing normalization processing by taking the scheduling rule symbol as input;

step 202: adopting a deep neural network to construct a prediction network under a normal load scene; combining a deep learning algorithm with the performance prediction of a production line, and obtaining the number of proper hidden layers, the number of neurons of each hidden layer and excitation functions of each layer by adopting a grid search method;

step 203: testing the network performance of the prediction network by adopting a test sample, comparing an output value corresponding to a prediction result obtained by the test sample after reverse normalization processing with an output value of the test sample, and judging whether the accuracy requirement is met;

step 204: if the prediction precision of the test result can meet the precision requirement, the prediction network under the normal load scene is successfully established, if the prediction precision of the test result cannot meet the precision requirement, the step 202 is returned, the number of hidden layers, the number of neurons of each hidden layer and excitation functions of each layer are selected again, and the model is trained again;

the deep learning algorithm in step 202 has the following inputs:

for a given training set of k different samples:

X^k＝{S^k，D^k，Ru^k，P^k|S^k∈R^∝，D^k∈R^β，Ru^k∈R^λ，P^k∈R^m}

wherein,

the state is the state of an intelligent workshop system;

feeding information for an intelligent workshop;

scheduling rules for the intelligent workshop;

for the current state S of the intelligent workshop system^kCurrent feeding information D^kCurrent scheduling rule Ru^kPerformance index after 1 day under the circumstances;

the output of the deep learning algorithm in step 202 is o_iWhere i is 1, …, m1, m1 indicates the dimension of the output value o, i.e. there are m1 outputs;

the accuracy requirement in step 204 is based on the root mean square error, and the description formula is as follows:

where m is the number of output variables, P is the input performance index, Y is the predicted performance index,

is the output variable sample number;

the process of constructing the prediction network under the light load scene and the heavy load scene in the step 3 comprises the following sub-steps:

step 301: assigning the number of neurons of an input layer of the prediction network under a normal load scene to the prediction network under light load and heavy load scenes;

step 302: assigning the number of neurons of each hidden layer except the last hidden layer of the prediction network under the normal load scene to the prediction network under the light load scene and the heavy load scene as the number of neurons of the corresponding front n-1 layer;

step 303: assigning the weight threshold information of each hidden layer except the last hidden layer of the prediction network under the normal load scene to the prediction network under the light load scene and the heavy load scene as initialization data;

step 304: assigning excitation functions of all hidden layers except the last hidden layer of the prediction network under a normal load scene to the prediction network under light load and heavy load scenes;

step 305: adjusting the number of neurons in the last hidden layer by a network search method, inputting the data under the light load and heavy load scenes divided in the step 1 into a prediction network under the light load and heavy load scenes for training, and obtaining the prediction network under the light load and heavy load scenes after the training is finished;

the step 1 comprises the following sub-steps:

step 101: collecting historical data of a production line for quantitative mapping under the same sampling days, and drawing a line graph;

step 102: dividing data according to curve change rules in each line graph and scene quantification corresponding to light load scenes, normal load scenes and heavy load scenes respectively, wherein in the corresponding curves, a starting point is a point corresponding to a total product value when the equipment utilization rate starts to enter a stable stage, and an end point is a point corresponding to the total product value when the slope of the curve of the average processing period of the product is maximum;

under a normal load scene, in the corresponding curve, the starting point is the point corresponding to the product value when the slope of the average processing period curve of the product is maximum, and the end point is the point corresponding to the product value when the slope of the average processing period curve of the product is minimum;

under a heavy load scene, the starting point in the corresponding curve is the point corresponding to the total product value when the slope of the average processing cycle curve of the product is minimum.

2. The method as claimed in claim 1, wherein the production line history data includes input quantity and output quantity, wherein the input quantity includes: sampling days, average processing period of products, average daily moving steps, total number of products in process, queue length of each buffer area, total queue length of the buffer areas and total moving steps; the output quantity comprises: average processing period of products and utilization rate of each device.

3. The method as claimed in claim 1, wherein the scheduling rule symbol in step 201 is encoded by using a one-hot encoding method so that it can be received by a network.

4. A system using the semiconductor production line oriented multi-scenario performance index prediction method of claim 1, characterized in that the system comprises:

the main prediction network construction module is used for combining a deep neural network algorithm and semiconductor production line performance prediction by taking the divided data under the normal load scene as sample data to construct a prediction network under the normal load scene;

the multi-scene prediction model construction module is used for further constructing prediction networks under light load and heavy load scenes based on the prediction networks under normal load scenes, and forming the prediction networks under the light load scenes, the normal load scenes and the heavy load scenes into a multi-scene prediction model;

and the production scene quantitative division module is used for carrying out quantitative mapping on the historical data of the production line and carrying out data division according to the scene quantification respectively corresponding to the light load scene, the normal load scene and the heavy load scene.