CN108364095B - Diagnosis method of molten steel quality in steelmaking production process based on data mining - Google Patents
Diagnosis method of molten steel quality in steelmaking production process based on data mining Download PDFInfo
- Publication number
- CN108364095B CN108364095B CN201810116387.5A CN201810116387A CN108364095B CN 108364095 B CN108364095 B CN 108364095B CN 201810116387 A CN201810116387 A CN 201810116387A CN 108364095 B CN108364095 B CN 108364095B
- Authority
- CN
- China
- Prior art keywords
- molten steel
- heats
- data
- temperature
- composition
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
- 
        - G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/04—Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
 
- 
        - G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/06—Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
- G06Q10/063—Operations research, analysis or management
- G06Q10/0639—Performance analysis of employees; Performance analysis of enterprise or organisation operations
- G06Q10/06395—Quality analysis or management
 
- 
        - G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/04—Manufacturing
 
- 
        - Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02P—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
- Y02P90/00—Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
- Y02P90/30—Computing systems specially adapted for manufacturing
 
Landscapes
- Business, Economics & Management (AREA)
- Human Resources & Organizations (AREA)
- Engineering & Computer Science (AREA)
- Strategic Management (AREA)
- Economics (AREA)
- General Business, Economics & Management (AREA)
- Theoretical Computer Science (AREA)
- Marketing (AREA)
- Entrepreneurship & Innovation (AREA)
- Development Economics (AREA)
- Tourism & Hospitality (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Educational Administration (AREA)
- Game Theory and Decision Science (AREA)
- Operations Research (AREA)
- Quality & Reliability (AREA)
- Manufacturing & Machinery (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Primary Health Care (AREA)
- Treatment Of Steel In Its Molten State (AREA)
- Investigating And Analyzing Materials By Characteristic Methods (AREA)
- General Factory Administration (AREA)
Abstract
Description
技术领域technical field
本发明涉及钢水质量诊断技术领域,特别是指一种基于数据挖掘的炼钢生产过程钢水质量诊断方法。The invention relates to the technical field of molten steel quality diagnosis, in particular to a method for diagnosing molten steel quality in a steelmaking production process based on data mining.
背景技术Background technique
目前我国钢铁生产的突出问题表现在常规钢材生产过剩而高品质钢材质量与国际先进水平存在一定差距往往需要靠进口来实现。当一些关键钢铁产品出现问题时需要进行质量定位及诊断。因此迫切需要利用现代数据挖掘与冶金机理相结合的方法实现全流程工艺质量诊断。国内外不少专家对其做出了研究。由北京科技大学高效轧制国家工程研究中心研发的冶金全流程工艺质量在线监控和离线分析系统针对汽车板基板质量管控提出对全流程工艺质量参数进行全面采集、监控和在线评级,并在出现问题时进行全流程快速追溯、分析、优化和改进的需求。北京科技大学彭开香等人提出了一种带钢热连轧质量的故障诊断方法及装置,解决了现有技术中产品质量往往是由较为熟练的操作工人凭借自己的经验控制,使得一旦发生故障,仅靠延迟滞后的反馈控制策略很难保证产品质量的问题。At present, the outstanding problems of my country's iron and steel production are manifested in the overproduction of conventional steel and a certain gap between the quality of high-quality steel and the international advanced level, which often needs to be realized by importing. When some key steel products have problems, quality positioning and diagnosis are required. Therefore, it is urgent to use the method combining modern data mining and metallurgical mechanism to realize the whole process process quality diagnosis. Many domestic and foreign experts have made research on it. The online monitoring and offline analysis system for metallurgical whole process quality developed by the National Engineering Research Center of High Efficiency Rolling of University of Science and Technology Beijing proposes to comprehensively collect, monitor and online rate the process quality parameters of the whole process for the quality control of automobile plate and substrate. It is necessary to quickly trace, analyze, optimize and improve the whole process. Peng Kaixiang of Beijing University of Science and Technology and others proposed a fault diagnosis method and device for the quality of hot continuous strip rolling, which solves the problem that the product quality in the prior art is often controlled by more skilled operators with their own experience, so that once a fault occurs, It is difficult to guarantee the quality of products only by the feedback control strategy of delay and lag.
然而目前的质量诊断技术很难实现钢铁产品在生产过程中的诊断和预警,仅仅是在某浇次生产结束后对其进行抽样检查等到下一浇次再进行质量调整,这样很难防止质量问题的批次出现,对提高生产效率带来很大的局限。同时目前冶金行业的质量诊断大多都是针对铸坯的质量诊断,很少有针对铁水预处理、转炉炼钢和炉外精炼三个生产过程工序的质量诊断。同时从诊断对象的角度分析在钢铁行业内大部分的质量诊断都是在假设过程变量相互独立的前提下提出的,而本发明采取的是一种针对依赖变量的综合的质量诊断技术,认为温度与成分之间是存在一定联系的,是一种多因素综合考虑的方法。However, the current quality diagnosis technology is difficult to realize the diagnosis and early warning of iron and steel products in the production process. It is only after the production of a certain pouring time that it is sampled and checked until the next pouring time, and then the quality adjustment is carried out, which is difficult to prevent quality problems. The emergence of batches has brought great limitations to improving production efficiency. At the same time, most of the quality diagnosis in the metallurgical industry is for the quality diagnosis of the cast slab, and there are few quality diagnosis for the three production processes of molten iron pretreatment, converter steelmaking and out-of-furnace refining. At the same time, most of the quality diagnosis in the iron and steel industry is put forward on the premise that the process variables are independent of each other from the perspective of the diagnosis object, while the present invention adopts a comprehensive quality diagnosis technology for dependent variables. There is a certain relationship with the ingredients, and it is a method of comprehensive consideration of multiple factors.
发明内容SUMMARY OF THE INVENTION
本发明要解决的技术问题是提供一种基于数据挖掘的炼钢生产过程钢水质量诊断方法。The technical problem to be solved by the present invention is to provide a method for diagnosing molten steel quality in the steelmaking production process based on data mining.
该方法包括步骤如下:The method includes the following steps:
(1)数据收集与筛选:(1) Data collection and screening:
利用炼钢厂二级系统,收集炼钢厂生产工艺数据,利用炉次号,串联铁水预处理、转炉炼钢、LF炉精炼、RH炉精炼四个工序的生产数据,并生成联合表,筛选出其中的有效数据;Use the secondary system of the steel plant to collect the production process data of the steel plant, use the heat number, the production data of the four processes of series hot metal pretreatment, converter steelmaking, LF furnace refining, and RH furnace refining, and generate a joint table to filter out valid data in it;
(2)数据的标准化:(2) Standardization of data:
对步骤(1)中筛选出的有效数据进行标准化处理;Standardize the valid data screened out in step (1);
(3)基于聚类的各工序钢水或铁水成分及温度控制目标确定:(3) Determine the composition and temperature control target of molten steel or molten iron in each process based on clustering:
从步骤(2)标准化处理后的数据中分别筛选出不同工序处理的炉次集合,并进一步分别筛选出各个工序出站成分及温度满足操作规程的所有炉次集合;分析钢水或铁水温度和成分参数对不同工序处理的影响规律,选取对钢水或铁水质量有重要影响的钢水或铁水温度和成分参数作为聚类因素,对筛选出的各个工序出站成分及温度满足操作规程的所有炉次集合内的炉次数据利用K-Means聚类算法来聚类处理,确定各工序钢水或铁水成分及温度控制的目标;From the standardized data in step (2), screen out the sets of heats processed by different processes, and further screen out all sets of heats whose outbound components and temperatures of each process meet the operating rules; analyze the temperature and composition of molten steel or molten iron The influence law of parameters on the treatment of different processes, the temperature and composition parameters of molten steel or molten iron that have an important impact on the quality of molten steel or molten iron are selected as clustering factors, and the selected outbound components and temperatures of each process are set for all heats that meet the operating regulations. The heat data inside is clustered by K-Means clustering algorithm to determine the target of molten steel or molten iron composition and temperature control in each process;
(4)钢水或铁水类别划分及工序操作工艺合理模式提取:(4) Classification of molten steel or molten iron and extraction of a reasonable mode of process operation technology:
步骤(3)中筛选出的各个工序出站成分及温度满足操作规程的所有炉次集合为命中炉次,其余炉次为非命中炉次,分析比较命中炉次与非命中炉次的不同工序处理工艺,总结命中炉次的处理工艺特征和非命中炉次的处理工艺特征,命中炉次的处理工艺即为该类钢水的合理的相应工序工艺模式,对不同工序的工艺模式分析、收集,并录入数据库管理;All heats screened out in step (3) where the outbound components and temperatures of each process meet the operating rules are set as hit heats, and the rest of the heats are non-hit heats. Analyze and compare the different processes between the hit heats and the non-hit heats. Processing technology, summarize the processing technology characteristics of the hit heat and the processing technology characteristics of the non-hit heat, the processing technology of the hit heat is the reasonable corresponding process process mode of this type of molten steel, analyze and collect the process modes of different processes, And enter the database management;
(5)工序钢水质量预测:(5) Prediction of molten steel quality in process:
通过二级系统收集到某个炉次的进站钢水或铁水温度和成分,如果超出相应工序进站控制目标,但是在可控范围(即所有命中炉次往前追溯到进站时刻的成分温度信息的集合即为该工序的可控范围)之内,则根据钢水或铁水的温度和成分取值,将其划入步骤(3)中的处理后的相应类别中,然后根据该类别的命中率预测该炉次钢水或铁水经过相应工序处理后的钢水温度和成分达到终点控制目标的概率,数值等于命中率;The temperature and composition of incoming molten steel or molten iron for a certain heat are collected through the secondary system. If it exceeds the control target of entering the station for the corresponding process, but is within the controllable range (that is, the composition temperature of all hit heats is traced back to the time of entering the station) The set of information is within the controllable range of the process), then according to the temperature and composition of molten steel or molten iron, it is classified into the corresponding category after processing in step (3), and then according to the hit of the category Predict the probability that the temperature and composition of the molten steel in this heat or molten iron after the corresponding process will reach the end point control target, the value is equal to the hit rate;
(6)工艺操作方案提供:(6) The process operation plan provides:
根据步骤(5)中对钢水质量的预测,为相应工序的相应炉次提供工艺操作模式方案,保证最终达到控制目标要求;According to the prediction of molten steel quality in step (5), a process operation mode scheme is provided for the corresponding heat of the corresponding process to ensure that the control target requirements are finally achieved;
(7)预警:(7) Warning:
若某炉次的进站钢水温度和成分超出可控范围,则系统报警。If the temperature and composition of incoming molten steel in a heat exceeds the controllable range, the system will alarm.
其中,步骤(1)中有效数据为在整个炼钢过程中某炉次的数据从KR预处理到精炼结束的温度、成分、各种工艺操作及参数信息都完整且在合理范围内的数据。Among them, the valid data in step (1) is the data of a certain heat in the whole steelmaking process, from KR preprocessing to the end of refining, the temperature, composition, various process operations and parameter information are complete and within a reasonable range.
步骤(2)中标准化处理具体为利用Min-Max标准化方法将所有聚类因素化为标量且都映射在区间[0,1]中。The standardization process in step (2) is to use the Min-Max standardization method to convert all clustering factors into scalars and map them in the interval [0,1].
本发明的上述技术方案的有益效果如下:The beneficial effects of the above-mentioned technical solutions of the present invention are as follows:
本发明方法填补了现有研究大多只是针对铸坯质量研究的空缺,同时还可以在炼钢生产过程中对钢水质量进行及时的调控,从而保证了生产过程的稳定性和窄窗口控制,并且能有效的提高产品质量。The method of the invention fills the gap that most of the existing researches only focus on the quality of the cast slab, and at the same time, the quality of molten steel can be regulated in time during the steelmaking production process, thereby ensuring the stability of the production process and narrow window control, and can Effectively improve product quality.
附图说明Description of drawings
图1为本发明的基于数据挖掘的炼钢生产过程钢水质量诊断方法工艺流程图;Fig. 1 is the technical flow chart of the molten steel quality diagnosis method of steelmaking production process based on data mining of the present invention;
图2为本发明实施例提供的炼钢生产过程中钢水温度和成分的系统诊断方法在实际应用时的流程图;Fig. 2 is the flow chart of the system diagnosing method of molten steel temperature and composition in the steelmaking production process provided by the embodiment of the present invention during practical application;
图3为本发明实施例提供的对生产数据进行聚类划分时使用的K-Means聚类算法的流程图;3 is a flowchart of the K-Means clustering algorithm used when the production data is clustered and divided according to an embodiment of the present invention;
图4为本发明实施例提供的决策树算法流程图。FIG. 4 is a flowchart of a decision tree algorithm provided by an embodiment of the present invention.
具体实施方式Detailed ways
为使本发明要解决的技术问题、技术方案和优点更加清楚,下面将结合附图及具体实施例进行详细描述。In order to make the technical problems, technical solutions and advantages to be solved by the present invention more clear, the following will be described in detail with reference to the accompanying drawings and specific embodiments.
本发明提供一种基于数据挖掘的炼钢生产过程钢水质量诊断方法,如图1所示,该方法包括步骤如下:The present invention provides a method for diagnosing molten steel quality in a steel-making production process based on data mining. As shown in FIG. 1 , the method includes the following steps:
(1)数据收集与筛选:(1) Data collection and screening:
利用炼钢厂二级系统,收集炼钢厂生产工艺数据,利用炉次号,串联铁水预处理、转炉炼钢、LF炉精炼、RH炉精炼四个工序的生产数据,并生成联合表,筛选出其中的有效数据;Use the secondary system of the steel plant to collect the production process data of the steel plant, use the heat number, the production data of the four processes of series hot metal pretreatment, converter steelmaking, LF furnace refining, and RH furnace refining, and generate a joint table to filter out valid data in it;
(2)数据的标准化:(2) Standardization of data:
对步骤(1)中筛选出的有效数据进行标准化处理;Standardize the valid data screened out in step (1);
(3)基于聚类的各工序钢水或铁水成分及温度控制目标确定:(3) Determine the composition and temperature control target of molten steel or molten iron in each process based on clustering:
从步骤(2)标准化处理后的数据中分别筛选出不同工序处理的炉次集合,并进一步分别筛选出各个工序出站成分及温度满足操作规程的所有炉次集合;分析钢水或铁水温度和成分参数对不同工序处理的影响规律,选取对钢水或铁水质量有重要影响的钢水或铁水温度和成分参数作为聚类因素,对筛选出的各个工序出站成分及温度满足操作规程的所有炉次集合内的炉次数据利用K-Means聚类算法来聚类处理,确定各工序钢水或铁水成分及温度控制的目标;From the standardized data in step (2), screen out the sets of heats processed by different processes, and further screen out all sets of heats whose outbound components and temperatures of each process meet the operating regulations; analyze the temperature and composition of molten steel or molten iron The influence law of parameters on the treatment of different processes, the temperature and composition parameters of molten steel or molten iron that have an important impact on the quality of molten steel or molten iron are selected as clustering factors, and the selected outbound components and temperatures of each process are set for all heats that meet the operating regulations. The heat data inside is clustered by K-Means clustering algorithm to determine the target of molten steel or molten iron composition and temperature control in each process;
(4)钢水或铁水类别划分及工序操作工艺合理模式提取:(4) Classification of molten steel or molten iron and extraction of a reasonable mode of process operation technology:
步骤(3)中筛选出的各个工序出站成分及温度满足操作规程的所有炉次集合为命中炉次,其余炉次为非命中炉次,分析比较命中炉次与非命中炉次的不同工序处理工艺,总结命中炉次的处理工艺特征和非命中炉次的处理工艺特征,命中炉次的处理工艺即为该类钢水的合理的相应工序工艺模式,对不同工序的工艺模式分析、收集,并录入数据库管理;All heats screened out in step (3) where the outbound components and temperatures of each process meet the operating rules are set as hit heats, and the rest of the heats are non-hit heats. Analyze and compare the different processes between the hit heats and the non-hit heats. Processing technology, summarize the processing technology characteristics of the hit heat and the processing technology characteristics of the non-hit heat, the processing technology of the hit heat is the reasonable corresponding process process mode of this type of molten steel, analyze and collect the process modes of different processes, And enter the database management;
(5)工序钢水质量预测:(5) Prediction of molten steel quality in process:
通过二级系统收集到某个炉次的进站钢水或铁水温度和成分,如果超出相应工序进站控制目标,但是在可控范围(所有命中炉次往前追溯到进站时刻的成分温度信息的集合)之内,则根据钢水或铁水的温度和成分取值,将其划入步骤(3)中的处理后的相应类别中,然后根据该类别的命中率预测该炉次钢水或铁水经过相应工序处理后的钢水温度和成分达到终点控制目标的概率,数值等于命中率;The temperature and composition of incoming molten steel or molten iron for a certain heat are collected through the secondary system. If it exceeds the control target of entering the station for the corresponding process, but is within the controllable range (all hit heats are traced back to the composition temperature information at the time of entering the station) within the set), then according to the temperature and composition of molten steel or molten iron, classify it into the corresponding category after processing in step (3), and then predict the molten steel or molten iron in this heat according to the hit rate of the category. The probability that the temperature and composition of molten steel processed by the corresponding process reach the end point control target, the value is equal to the hit rate;
(6)工艺操作方案提供:(6) The process operation plan provides:
根据步骤(5)中对钢水质量的预测,为相应工序的相应炉次提供工艺操作模式方案,保证最终达到控制目标要求;According to the prediction of molten steel quality in step (5), a process operation mode scheme is provided for the corresponding heat of the corresponding process to ensure that the control target requirements are finally achieved;
(7)预警:(7) Warning:
若某炉次的进站钢水温度和成分超出可控范围,则系统报警。If the temperature and composition of incoming molten steel in a heat exceeds the controllable range, the system will alarm.
如图2所示,具体实施过程如下:As shown in Figure 2, the specific implementation process is as follows:
(1)数据收集与筛选(1) Data collection and screening
采集某一钢种钢水在炼钢厂内生产过程中的历史样本数据,所述数据包括KR铁水预处理数据、转炉炼钢数据、LF精炼工艺数据、RH精炼工艺数据、板坯连铸生产管控记录、钢包工艺查询记录,利用数据库以炉次号为依据将这些生产数据关联在一起,形成炼钢厂全流程联合数据表并筛选出其中的有效数据。主要研究钢水在各个跨进出站时对钢水质量有重要影响的成分及温度数据。针对不同钢种,从历史样本数据中筛选出该钢种的所有处理的炉次记为集合S,在所有炉次集合S中,出站成分及温度满足操作规程的所有炉次为命中炉次,记为集合P。Collect historical sample data of a certain type of molten steel in the production process of the steelmaking plant, the data includes KR molten iron pretreatment data, converter steelmaking data, LF refining process data, RH refining process data, slab continuous casting production control Records, ladle process query records, use the database to associate these production data based on the heat number, form a joint data table for the whole process of the steelmaking plant, and filter out the valid data. It mainly studies the composition and temperature data of molten steel that have an important influence on the quality of molten steel when it enters and exits each station. For different steel grades, all the processed heats of this steel grade are screened from the historical sample data and recorded as set S, and in the set S of all heats, all the heats whose outbound composition and temperature meet the operating regulations are hit heats , denoted as the set P.
(2)数据的标准化(2) Standardization of data
由于这些温度和成分数据在分布区间和单位上各有不同直接对其进行聚类时不合理的,所以要先对其进行标准化处理,这里采用的处理方式是Min-Max标准化处理方法,该方法是对原始数据进行线性变换。设minA和maxA分别为属性A的最大值和最小值,将A的一个原始值x通过Min-Max标准化映射在区间[0,1]中的值x`,其公式为:Since these temperature and composition data have different distribution intervals and units, it is unreasonable to directly cluster them, so they must be standardized first. The processing method used here is the Min-Max normalization processing method. This method is a linear transformation of the original data. Let minA and maxA be the maximum and minimum values of attribute A, respectively, and an original value x of A is mapped to the value x` in the interval [0,1] through Min-Max normalization. The formula is:
(3)基于聚类的各工序钢水(铁水)成分及温度控制目标确定(3) Determination of composition and temperature control target of molten steel (hot metal) in each process based on clustering
先将集合P往前追溯到进站时刻的信息再用K-Means聚类算法其成分及温度信息的历史样本数据进行聚类分析,如图3所示,将某一炉钢水的的成分及温度信息等多个变量以矩阵的方式进行储存,用矩阵的行来表示每一个炉次的钢水,列来表示该炉次钢水的信息,则n炉钢水p个信息的集合可以用一个n×p维的矩阵来表示,第i炉次钢水的第j个信息在矩阵中表示为xij,数据矩阵如下:First trace the information of the set P back to the time of entering the station, and then use the K-Means clustering algorithm to perform cluster analysis on the historical sample data of its composition and temperature information. As shown in Figure 3, the composition of a certain furnace of molten steel and Multiple variables such as temperature information are stored in the form of a matrix. The rows of the matrix are used to represent the molten steel of each heat, and the columns are used to represent the information of the molten steel of this heat. Then the set of p pieces of information about the molten steel of n furnaces can be represented by an n× The p-dimensional matrix is represented, and the jth information of the molten steel in the ith heat is represented as x ij in the matrix, and the data matrix is as follows:
然后用相异度矩阵来储存这些同一钢种不同炉次之间的差异性,n个炉次钢水的相异度矩阵表示为n×n维的矩阵,用d(A,B)来表示A与B的相异性,则含有n个炉次的集合X={x1,x2,…,xn}的相异度矩阵表示如下:Then the dissimilarity matrix is used to store the differences between different heats of the same steel grade. The dissimilarity matrix of n heats of molten steel is represented as an n×n-dimensional matrix, and d(A, B) is used to represent A The dissimilarity with B, the dissimilarity matrix of the set X={x 1 ,x 2 ,...,x n } containing n heats is expressed as follows:
在这里,d(xi,xj)为某种相似性度量函数,当xi相似或相近时d(xi,xj)的值接近0,而当d(xi,xj)值较大时,代表了炉次xi和xj有很大差异。目前最常用的相似性度量函数为欧式距离,定义为:Here, d(x i ,x j ) is a similarity measure function, when x i is similar or similar, the value of d(x i ,x j ) is close to 0, and when the value of d(x i ,x j ) is close to 0 When it is larger, it means that the heats x i and x j are very different. The most commonly used similarity measure function is Euclidean distance, which is defined as:
xi和xj代表任意两炉钢水,p为钢水的变量数,n为炉次数。x i and x j represent any two furnaces of molten steel, p is the variable number of molten steel, and n is the number of furnaces.
根据历史样本数据确定数据集X={x1,x2,…,xn}以及聚类数目;Determine the dataset X={x 1 ,x 2 ,...,x n } and the number of clusters according to the historical sample data;
①初始化:随机指定k个聚类中心(m1,m2,…,mk);①Initialization: randomly assign k cluster centers (m 1 , m 2 ,...,m k );
②分配xi:对每一个样本xi,找到离它最近的聚类中心,并将其分配到该类;②Assign xi : For each sample xi , find the cluster center closest to it and assign it to this class;
③重新计算各簇中心: ③ Recalculate the center of each cluster:
④计算偏差: ④Calculation deviation:
⑤判断收敛:如果J值收敛,则算法终止;否则,返回第二步。⑤ Judgment convergence: if the J value converges, the algorithm terminates; otherwise, return to the second step.
⑥获得模态:通过反复的运算直至收敛即可得到k个模态⑥ Obtaining modes: k modes can be obtained by repeated operations until convergence
这k个模态就对应了k个聚类中心,以这k个中心作为所有数据集合S的聚类中心再进行一次聚类,这样就把所有生产数据分为了k类,针对每一类数据挖掘其工序结束后的合格率,认为命中率高类的集合即为该工序进站时刻的操作规程,该类钢水的聚类中心的钢水温度值m和各成分值ni为控制最优值,根据冶炼钢种的要求,决定终点目标范围ΔT`,再结合工序间钢水温降值ΔT,则上一工序终点的目标温度为m+ΔT±ΔT`。根据冶炼钢种的要求,决定终点目标范围Δn,而工序间成分并不会发生太大变化,则上一工序终点的目标成分为ni+Δn。These k modes correspond to k cluster centers, and the k centers are used as the cluster centers of all data sets S to perform clustering again, so that all production data are divided into k categories, and for each type of data The qualified rate after the end of the process is excavated, and it is considered that the set with high hit rate is the operation procedure when the process enters the station, and the molten steel temperature value m and each component value n i of the cluster center of this type of molten steel are the optimal control values. , according to the requirements of smelting steel grades, determine the target range of the end point ΔT`, and then combine the molten steel temperature drop value ΔT between processes, the target temperature at the end of the previous process is m+ΔT±ΔT`. According to the requirements of smelting steel grades, the target range Δn of the end point is determined, and the composition does not change much between processes, so the target composition at the end point of the previous process is n i + Δn.
(4)钢水(铁水)类别划分及工序操作工艺合理模式提取(4) Classification of molten steel (hot metal) and extraction of reasonable models of process operation technology
上一步骤得到了k类钢水,在每一类钢水中,都有部分炉次属于命中炉次,所有命中炉次组成的集合即为该钢水的在该工序的可控范围,计算每个类中命中炉次占该类所有炉次的比例,记为该工序进站钢水该类别的命中率j。In the previous step, k types of molten steel were obtained. In each type of molten steel, some of the heats belong to the hit heats. The set of all the hit heats is the controllable range of the molten steel in this process. The proportion of the middle hit heats in all the heats of this type is recorded as the hit rate j of this type of molten steel entering the station in this process.
分析比较命中炉次与非命中炉次在该工序处理工艺的不同,总结提炼出命中炉次的处理工艺特征。这个过程采用的是决策树方法来完成的:Analyze and compare the difference between the hit heat and the non-hit heat in this process, and summarize and extract the processing technology characteristics of the hit heat. This process is done using the decision tree method:
如图4所示,利用决策树方法对连续属性进行处理,假定连续属性a在样本集D上出现n个不同的取值,合格与不合格样本所占的比例为pk(k=1,2,…,|y|)As shown in Figure 4, the continuous attribute is processed by the decision tree method. Assuming that the continuous attribute a has n different values in the sample set D, the proportion of qualified and unqualified samples is p k (k=1, 2,…,|y|)
①将某个连续属性在样本集上出现的n个不同的取值从小到大进行排序,记为{a1,a2,...,an}① Sort the n different values of a continuous attribute that appear in the sample set from small to large, denoted as {a 1 ,a 2 ,...,a n }
②将某个属性相邻两个取值ai、ai+1之间的中点作为可能的分裂点t,将数据集分为两部分,计算每个可能的分裂点的信息增益:②The midpoint between two adjacent values a i and a i+1 of an attribute As a possible split point t, divide the dataset into two parts and calculate the information gain for each possible split point:
其中信息熵 where information entropy
③选择修正后信息增益最大的分裂点作为该特征的最佳分裂点③ Select the split point with the largest information gain after correction as the best split point for the feature
④计算最佳分裂点的信息增益率并进行修正后作为该属性的信息增益④ Calculate the information gain rate of the best split point and modify it as the information gain of the attribute
其中 in
⑤比较各属性的信息增益率来构造决策树,增益率大的作为根结点,依次将决策树延伸下去并对决策树进行剪枝处理⑤Comparing the information gain rate of each attribute to construct a decision tree, the one with the largest gain rate is used as the root node, and the decision tree is extended in turn and the decision tree is pruned.
命中炉次的处理工艺即为该类钢水在该工序的合理处理工艺模式。将所有k类钢水在某工序的处理工艺模式分析、收集,并录入数据库管理。The treatment process of the hit heat is the reasonable treatment process mode of this type of molten steel in this process. Analyze, collect and record the processing mode of all k-type molten steel in a certain process, and enter it into database management.
(5)工序钢水质量预测(5) Prediction of molten steel quality in process
通过二级系统收集到某个炉次在某个工序的进站钢水温度和成分,如果超出该工序进站控制目标,但是在可控范围之内,则根据其温度和成分取值,将其划入k个类别中的某一类,然后根据该类别的命中率预测该炉次钢水经过RH处理后的钢水温度和成分达到终点控制目标的概率,数值等于命中率。The incoming molten steel temperature and composition of a certain heat in a certain process are collected through the secondary system. If it exceeds the incoming control target of the process, but is within the controllable range, the temperature and composition will be taken according to the value of the temperature and composition. It is classified into one of the k categories, and then predicts the probability that the molten steel temperature and composition of the molten steel after RH treatment in this heat will reach the end point control target according to the hit rate of this category, and the value is equal to the hit rate.
(6)工艺操作方案提供(6) Provide process operation plan
根据某工序的k类进站钢水对应的工艺模式,为该炉次提供合理的工艺操作模式方案,指导工序进行合理的处理,保证最终达到控制目标要求。According to the process mode corresponding to the k-type incoming molten steel of a certain process, a reasonable process operation mode scheme is provided for this heat, and the process is guided to carry out reasonable treatment to ensure that the control target requirements are finally achieved.
(7)预警(7) Early warning
若某炉次的某工序进站钢水温度和成分超出可控范围,则系统报警,警告工序该炉次钢水的温度和成分范围,超出历史数据中所有命中炉次的温度和成分范围,其在该工序处理的命中率很低,需要重点关注,选择非常规处理工艺。If the temperature and composition of the incoming molten steel in a certain process of a certain heat exceeds the controllable range, the system will alarm, warning the process that the temperature and composition of the molten steel in this heat exceeds the temperature and composition range of all hit heats in the historical data. The hit rate of this process is very low, and it is necessary to focus on it and choose an unconventional processing process.
以上所述是本发明的优选实施方式,应当指出,对于本技术领域的普通技术人员来说,在不脱离本发明所述原理的前提下,还可以做出若干改进和润饰,这些改进和润饰也应视为本发明的保护范围。The above are the preferred embodiments of the present invention. It should be pointed out that for those skilled in the art, without departing from the principles of the present invention, several improvements and modifications can be made. These improvements and modifications It should also be regarded as the protection scope of the present invention.
Claims (2)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title | 
|---|---|---|---|
| CN201810116387.5A CN108364095B (en) | 2018-02-06 | 2018-02-06 | Diagnosis method of molten steel quality in steelmaking production process based on data mining | 
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title | 
|---|---|---|---|
| CN201810116387.5A CN108364095B (en) | 2018-02-06 | 2018-02-06 | Diagnosis method of molten steel quality in steelmaking production process based on data mining | 
Publications (2)
| Publication Number | Publication Date | 
|---|---|
| CN108364095A CN108364095A (en) | 2018-08-03 | 
| CN108364095B true CN108364095B (en) | 2022-05-13 | 
Family
ID=63004547
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date | 
|---|---|---|---|
| CN201810116387.5A Active CN108364095B (en) | 2018-02-06 | 2018-02-06 | Diagnosis method of molten steel quality in steelmaking production process based on data mining | 
Country Status (1)
| Country | Link | 
|---|---|
| CN (1) | CN108364095B (en) | 
Families Citing this family (3)
| Publication number | Priority date | Publication date | Assignee | Title | 
|---|---|---|---|---|
| CN112685460B (en) * | 2020-12-08 | 2024-06-04 | 联峰钢铁(张家港)有限公司 | Standard analysis and prediction method for steelmaking process temperature | 
| CN115169450A (en) * | 2022-06-30 | 2022-10-11 | 马鞍山钢铁股份有限公司 | Steel grade classification method | 
| CN116127345B (en) * | 2022-12-23 | 2023-11-14 | 北京科技大学 | Converter steelmaking process model design method based on deep clustering generative adversarial network | 
Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title | 
|---|---|---|---|---|
| CN101504544A (en) * | 2008-02-01 | 2009-08-12 | 霍尼韦尔国际公司 | Methods and apparatus for an oxygen furnace quality control system | 
| CN103882176A (en) * | 2014-03-25 | 2014-06-25 | 东北大学 | On-line dynamic optimization control method for converter steelmaking process based on data driving | 
| CN104630410A (en) * | 2015-02-10 | 2015-05-20 | 东北大学 | Real-time dynamic converter steelmaking quality prediction method based on data analysis | 
Family Cites Families (1)
| Publication number | Priority date | Publication date | Assignee | Title | 
|---|---|---|---|---|
| US20100292825A1 (en) * | 2006-08-09 | 2010-11-18 | Auckland Uniservices Limited | Process control of an industrial plant | 
- 
        2018
        - 2018-02-06 CN CN201810116387.5A patent/CN108364095B/en active Active
 
Patent Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title | 
|---|---|---|---|---|
| CN101504544A (en) * | 2008-02-01 | 2009-08-12 | 霍尼韦尔国际公司 | Methods and apparatus for an oxygen furnace quality control system | 
| CN103882176A (en) * | 2014-03-25 | 2014-06-25 | 东北大学 | On-line dynamic optimization control method for converter steelmaking process based on data driving | 
| CN104630410A (en) * | 2015-02-10 | 2015-05-20 | 东北大学 | Real-time dynamic converter steelmaking quality prediction method based on data analysis | 
Non-Patent Citations (1)
| Title | 
|---|
| 《转炉-连铸区段精细化控制节能潜力研究》;冯凯 等;《重庆大学学报》;20131130;第44-49页 * | 
Also Published As
| Publication number | Publication date | 
|---|---|
| CN108364095A (en) | 2018-08-03 | 
Similar Documents
| Publication | Publication Date | Title | 
|---|---|---|
| CN104630410B (en) | A kind of pneumatic steelmaking quality real-time dynamic forecast method based on data parsing | |
| CN108364095B (en) | Diagnosis method of molten steel quality in steelmaking production process based on data mining | |
| CN115630839B (en) | Intelligent feedback production regulation and control system based on data mining | |
| CN114329940A (en) | Continuous casting billet quality prediction method based on extreme learning machine | |
| CN108469180A (en) | The method for building up of sintering end point forecasting system based on big data and machine learning | |
| CN108956041B (en) | Sintering machine air leakage fault online diagnosis method based on single-side detection method of frequency spectrum characteristics | |
| CN115815345A (en) | Mechanism collaborative prediction method and system for predicting the mechanical properties of hot-rolled strip steel in the whole process | |
| CN107619894A (en) | A kind of blast furnace operating type of furnace online management method | |
| Shi et al. | Process metallurgy and data-driven prediction and feedback of blast furnace heat indicators | |
| CN114764547A (en) | Heating furnace steel billet temperature prediction method and device integrating mechanism and deep learning | |
| CN114841277A (en) | A kind of converter tapping alloy yield prediction method and system | |
| CN119289672B (en) | Intelligent control system and method for smelting furnace | |
| Dong et al. | Transfer Learning Deep EDC-Tabnet for Predicting End-Point of BOF Steelmaking Process with Small Samples | |
| Wang et al. | Online monitoring method for multiple operating batch processes based on local collection standardization and multi‐model dynamic PCA | |
| CN112418522B (en) | Industrial heating furnace steel temperature prediction method based on three-branch integrated prediction model | |
| CN116759012A (en) | Molten iron quality prediction method based on batch tracking and space-time conversion | |
| JP3943841B2 (en) | Steel material estimation equipment | |
| CN118114563A (en) | Temperature prediction method and system for steel smelting equipment based on big data | |
| Shi et al. | Process Metallurgy and Genetic Algorithm Multi‐Objective Optimization of Blast Furnace Hearth Working State | |
| CN106778848A (en) | A kind of Zn finery method for diagnosing faults based on multiclass Probit models | |
| CN108123436A (en) | Voltage out-of-limit prediction model based on principal component analysis and multivariate regression algorithm | |
| Jun et al. | Abnormal Diagnosis of Dam Safety Monitoring Data Based on Ensemble Learning | |
| JP4365537B2 (en) | Steel product quality control equipment | |
| NL2030465B1 (en) | Continuous casting slab quality predicting method based on extreme learning machine | |
| CN112149966A (en) | Method for quality judgment based on steelmaking multi-data-source aggregation guidance process | 
Legal Events
| Date | Code | Title | Description | 
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |