ISIJ International
Online ISSN : 1347-5460
Print ISSN : 0915-1559
ISSN-L : 0915-1559
Instrumentation, Control and System Engineering
Fuel Ratio Optimization of Blast Furnace Based on Data Mining
Xiuyun Zhai Mingtong ChenWencong Lu
Author information
JOURNAL OPEN ACCESS FULL-TEXT HTML

2020 Volume 60 Issue 11 Pages 2471-2476

Details
Abstract

Despite the age of the process, the blast furnace (BF) ironmaking is still crucial to iron and steel industry. To improve the competitiveness of enterprises, fuel ratio (FR) in BF ironmaking process needs to be kept its lowest level possibly. In the work, a prediction model was established to predict FR of the BF by using feature selection and support vector regression (SVR). GA-SVR (genetic algorithm - SVR) method was employed to select the most informative five features from the candidate features.The experimental results indicated that the SVR model brought high learning precision and excellent prediction generalization ability. To explore and discover the laws of BF production, the influences of the five features on FR were discussed by simulation analysis of the model. All the calculations were performed on the computational platform of data mining developed by us. The work can provide guides for the operators on modulating input parameters in advance. The methods outlined here can provide valuable hints into revealing mechanisms of BF ironmaking process and realizing controlled production of BF with guidance of quantitative analysis methods.

1. Introduction

Iron and steel making, a typical high-energy-consuming, high-emission and high-pollution industry, is also the fundamental industry for the contemporary world.1,2,3,4) Blast furnace (BF) ironmaking is placed special attention due to consuming more than 70% of the energy in the whole production.5,6,7,8)

The inner of BF composes of five zones from top to bottom: throat, shaft, belly, bosh and hearth (as shown graphically in Fig. 1). The typical BF system mainly includes BF body, ore and coke feeding system, hot blast system, pulverized coal injection system, top gas treatment system and tapping system.9) During the operation of BF, solid raw materials mainly including iron ore and coke are alternately charged from the top of the furnace layer by layer. Simultaneously, high-pressure and high temperature hot blast enriched with oxygen and carrying auxiliary materials such as natural gas or pulverized coal is blown into the bottom of the furnace through tuyeres. Their combustion with coke at the raceway produces a great deal of heat and reducing gas consisting of carbon monoxide and hydrogen.10) The reducing atmosphere and the coke remove the oxygen in the burden to make the pig iron products accompanied by slags, which are regularly tapped out by opening the clay-lined tapholes for the following steelmaking process.

Fig. 1.

Schematic of a typical BF ironmaking process. (Online version in color.)

Energy consumption, resource shortages and environmental problems caused by the development of iron and steel industry are becoming increasingly serious. Low fuel ratio (FR) will result in decreasing the sum of all fuel consumption (including coke, pulverized coal, coke nut etc.) per ton of molten iron produced, and is the root of addressing the above problems.11) Then, it is a necessary work to quickly and efficiently predict FR of BF to achieve low energy consumption production. However, it is a challenge to construct a relational model between FR and the multitudinous parameters because BF is a complex industrial reactor including the interacting effects of multiphases accompanied simultaneously with multiphase coupling and multiphysics field coexisting,12) and so on.

What is encouraging is that data mining applied for research on the BF production has been certified as an effective approach to solve the above problems.13,14,15) Zhou et al. constructed the Hammerstein model to predict the most essential quality indicators of the BF with the least squares support vector machine-based nonlinear subspace identification method.16) The prediction model of BF’s silicon content with support vector regression (SVR) was built by Xu et al.17) In those data-driven models, many data mining methods were used, such as principal component analysis (PCA),18,19) SVR,20,21,22) artificial neural network (ANN),23,24) genetic algorithm (GA),25) and so on. Especially, support vector machine (SVM) have certainly garnered a lot of attention in that it can solve the dimension disaster and avoid overfitting to deal with nonlinear problem. For instance, the smooth support vector regression (SSVR) model constructed by Jian et al.26) was used to predict the trend of the silicon content of BF hot metal. Because there are highly complicated chemistry processes and the transport phenomena in blast furnaces, the data-driven models27) with the assistance of data mining are of vital importance for throwing light on the complex interrelations among variables in ironmaking process.28,29)

The work concentrates on the development of the support vector regression (SVR) model based on data mining to predict and optimize FR of the BF with the optimal feature set including barely five parameters. The quantitative control of FR can been realized by adjusting the parameters. In modeling, GA-SVR method was used to find the optimal feature set. The experimental results indicated that the SVR-RBF (radial basis function) model with high precision and generalization performance can provide the valuable guidance and references for operators and managers to determine the direction of low energy-consumption, low-emission, saving cost, and strengthening the competition power of the enterprises.

2. Materials and Methods

2.1. Dataset and Features

The dataset was built by collecting the historical data throughout the year from the BF (internal volume of 2000 m3) in the Iron and Steel Company of China. The outliers must be excluded because they occurred under abnormal production conditions (such as blowing-down, record fault or overhaul). So, the real dataset for modeling comprises 326 samples. The dataset for testing the model consists of 87 samples was collected from the BF in the first three months of next year. The disciplinarian of BF burden distribution had remained unchanged during the period of data collection.

A lot of process parameters of BF have more or less effects on FR since BF is a complex industrial reactor.30,31) However, not all parameters related to FR can be as the inputs of model because there are strong correlations among some parameters. Too many input variables will lead to a highly complex model. Therefore, it is a requisite work to search for the candidate features from numerous process parameters for modeling. Finally, the thirty-five candidate features (in Table 1) were determined from experiences of the BF experts and correlation analysis. The output of the model is FR of the BF in units of Kg/t. The median value of FR is equal to 561.5 Kg/t.

Table 1. The list of the candidate features for modeling.
No.MeaningsFeaturesNo.MeaningsFeatures
1Grade of Iron (%)X119Iron Losses (%)X19
2Pig Iron [Ti] (%)X220Unit Consumption of Nut Coke (Kg/t)X20
3Pig Iron [Si] (%)X321Small Sinter (Kg/t)X21
4Blast Temperature (°C)X422Comprehensive Ironmaking Strength (t/m3·d)X22
5Top Gas Pressure (MPa)X523Feed batchX23
6Blast Volume (m3/min)X624Gas Utilization Rate (%)X24
7Coke-load (t/t)X725Unit Consumption of Iron Ore (Kg/t)X25
8Utilization Coefficient (%)X826Index Burden Permeability (Q/ΔP)X26
9Slag Iron Ratio (Kg/t)X927Gray Iron Ratio (%)X27
10The Basicity of SlagX1028Top Temperature (°C)X28
11Oxygen-enriched Rate (%)X1129Sinter (t/batch)X29
12Coke Ash (%)X1230Small Sinter (t/batch)X30
13Coke Sulfur (%)X1331Pellet 1 (t/batch)X31
14Coke M40 (%)X1432Pellet 2 (t/batch)X32
15Coke M10 (%)X1533Huili Mine (t/batch)X33
16Coke CSR (%)X1634Batch Weight of Coke (t/batch)X34
17Coke < 25 mm (%)X1735Blast Speed (m/s)X35
18Clinker Rate (%)X18

2.2. Support Vector Regression

A version of SVM for regression, called SVR, was proposed by Vapnik et al.32) in 1996. SVM generalization to SVR is accomplished by introducing a ε-insensitive region around the function, called the ε-tube. Moreover, SVR is formulated as an optimization problem by first defining a convex ε-insensitive loss function to be minimized and finding the flattest tube that contains most of the training samples.33) In SVR, points outside the tube are penalized, but those within the tube, either above or below the function, receive no penalty. SVR problem formulation is derived from a geometrical perspective, using the nonlinear regression example shown in Fig. 2. Adopting a soft-margin approach like that employed in SVM, slack variables ξ, ξ* can be employed to guard against outliers.

Fig. 2.

The diagram of SVR with ε-insensitive approach. (Online version in color.)

Training the original SVR means solving:34)   

minimize                1 2 ||ω| | 2 (1)
  
subject   to   { y i -<ω,    x i >-bε <ω,    x i >+   b- y i ε     (2)
where xi is a training sample with target value yi. The inner product plus intercept <ω, xi> + b is the prediction for the sample, and ε is a free parameter that servers as a threshold: all predictions must be within ε range of the true prediction.

Vapnik introduced the functions into his developed SVM method, making SVM can deal with nonlinear problems by using linear algorithms. The open issue in SVR is the selection of parameter values for the kernel and loss functions.35) In this work, the SVR model employed RBF as the kernel function,36) whose expression is given as:   

K( x i , x j )=exp( -|| x i - x j | | 2 σ 2 ) (3)

2.3. Implement

The calculations were performed on Online Computational Platform of Material Data Mining37,38) developed by us. It can be freely used on the website (http://matdata.shu.edu.cn/ocpmdm/). Its predecessor is HyperMiner software package32,39,40) written by us. Its free version can be downloaded from the website of Laboratory of Materials Data Mining in Shanghai University (http://chemdata.shu.edu.cn:8080/MyLab/Lab/download.jsp).

3. Results and Discussion

3.1. Feature Selection

Feature selection is a key factor to determine whether a successful model could be established. It can reduce the dimension of feature space to further decrease the risk of over fitting, and can better remove features unrelated to target value and noise interference.41,42,43) Meanwhile, it can also make the training time shorten, further promote the prediction ability and generalization performance of the model. In the work, GA-SVR method was employed to screen the subset of features for modeling. Compared with other optimization algorithms, GA has an ability to move from local optima present on the response surface. Accordingly, it is widely used in optimization for its fine global search capability. In the SVR models, root mean square error (RMSE) of 10-fold cross validation was as the fitness function evaluation. The smaller the RMSE is, the better the modeling effect is. RMSE is defined as follows:   

RMSE= i=1 n ( e i - p i ) 2 n (4)
where: ei and pi are experimental value and predictive value of FR of the ith sample, respectively; n is the number of sample points in the sample set.

Figure 3 illustrates how GA can be used to compress the candidate variable set for finding the optimal feature set. It can be seen that the smallest RMSE emerged after GA accomplished 48 generations. Here the optimal input variable set was found including the five features (Blast Temperature (X4), Coke-load (X7), Iron Losses (X19), Pellet 2 (X32) and Batch Weight of Coke (X34)). In accordance with the BF production process, the five features are responsible for FR directly or indirectly. So it is reasonable that they are used as the predictors of the model.

Fig. 3.

RMSE versus generation of evolution in GA. (Online version in color.)

3.2. Model Building

A SVR-RBF model was constructed to make quantitative predictions of unknown samples. The generalization performance of a nonlinear SVR-RBF model relies on the setting values of three hyper-parameters, namely, C, ε and σ. Parameter C is a constant that determines regularized penalty to estimation errors. Parameter ε controls the width of the ε-insensitive zone and is used to fit the training data. Parameter σ is the RBF width parameter that determines how far the effect of a single training example reaches, where the low value means ‘far’, and the high value means ‘close’. When optimizing the three parameters through the grid-search method, they were set in [0.01, 0.1], [1, 30] and [0.5, 1.5], and had steps of 0.02, 2 and 0.22, respectively. Their optimization process is shown in Fig. 4. They are best when RMSE is lowest, namely the best regression performance can be achieved.

Fig. 4.

Three hyper-parameters optimization of the SVR-RBF model. (Online version in color.)

The result of SVR parameter grid search showed that the optimal C, ε and σ were 3, 0.03 and 1.38, respectively. A SVR-RBF model with RCro (correlation coefficient) of 0.916, RMSECro of 5.252 and Q2Cro (determination coefficient) of 0.839 for 10-fold cross validation was constructed by using the optimized parameters. The subscript Cro represents cross-validation. The predictive results of the training set and the test set are shown in Fig. 5. The model is shown as follows:   

y= i n β i exp(-1.38(||x- x i | | 2 )+   0.4506755 (5)
where x is the unknown vector, xi is the support vector in the model, n is the corresponding number, and βi is Lagrange multiplier of the support vectors.
Fig. 5.

The predictive results of the training set and the test set by using the SVR model. (Online version in color.)

RTra, RMSETra and Q2Tra of the model are 0.942, 4.386 and 0.888, and RTes, RMSETes and Q2Tes of the model are 0.923, 5.460 and 0.837, respectively. The subscript Tra and Tes identify the training set and the test set. To further understand the prediction performance of the model, relative deviations (RD) of FR for the training set and the test set were analyzed. The result is shown in Table 2. It can be seen from the table that the model predicts 80.37% data with RD of less than 1%, and 1.23% data have RD of from 2% to 4% in the training set. It can be also observed that 72.41% data are predicted with RD of less than 1%, and no data have RD of more than 2% in the test set. From the above analysis, it can be seen that the model has high prediction accuracy and practical meaning. So, SVR-RBF method is feasible not only in theory but in application.

Table 2. Relative deviation (RD) of predictive values of FR for the training set and test set.
RD (%)% of predictive training valuesCumulative percentage% of predictive test valuesCumulative percentage
RD ≤ 180.3780.3772.4172.41
1< RD ≤218.4098.7727.59100
2 < RD ≤ 41.231000100
Total100100

3.3. Simulation Analysis

The section discusses the effects of various variables on FR by means of regression analysis of the SVR-RBF model. Simulation study can be used to observe the diversification of the target variable on one variable when the other variables are fixed at the mean values. Figure 6 illustrates the diversification of FR with Blast Temperature (X4), Coke-load (X7), Iron Losses (X19), Pellet 2 (X32), and Batch Weight of Coke (X34) respectively when the other four features keep constant. From Fig. 6(a), the increased Blast Temperature translates into lower FR in the zone of low blast temperatures, while FR rises slightly at higher blast temperatures. It is mainly because too much low-quality coal powder are blown into BF when Blast Temperatures are higher. In Fig. 6(b), FR first increases with Coke-load then decreases more sharply. In Figs. 6(c) and 6(e), FR increases with Iron Losses and Batch Weight of Coke, respectively. The opposite trends can be observed in Fig. 6(d). Namely, FR has been in the decreasing trend when Pellet 2 is increased.

Fig. 6.

The diversification of FR with (a) Blast Temperature (X4), (b) Coke-load (X7), (c) Iron Losses (X19), (d) Pellet 2 (X32), and (e) Batch Weight of Coke (X34). (Online version in color.)

4. Conclusions

This work focuses on FR optimization of the BF that is a critical problem directly related to the existence and development of the iron and steel enterprises. To solve the issue, the data mining methods were employed to construct the relationship model between FR and the five technology parameters. Taking the quantitative control and optimization of FR of the BF into account, the SVR-RBF model with RTra of 0.942, RMSETra of 4.386 and Q2Tra of 0.888 was established. The model with high precision and generalization performance can help the BF foremen adjust the parameters in advance to optimize FR, and assist the managers to make the right decisions for improving the competitiveness of the enterprise. All the calculations were performed on the computational platform of data mining developed by us, which is free to do many of the data mining tasks that were previously available only in commercial software packages. The study integrated experiences of the BF foremen and experts, and the data mining techniques to solve the industrial optimization problem with strong noise and multi-variable couple. So the method outlined here can provide valuable hints into the industry optimization with the assistance of data mining.

Acknowledgement

The authors acknowledge the financial support from the National Key Research and Development Program of China (No. 2016YFB0700504).

References
 
© 2020 The Iron and Steel Institute of Japan.

This is an open access article under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs license.
https://creativecommons.org/licenses/by-nc-nd/4.0/
feedback
Top