ISIJ International
Online ISSN : 1347-5460
Print ISSN : 0915-1559
ISSN-L : 0915-1559
Regular Article
Artificial Neural Network Modeling of Flow Stress in Hot Rolling
Parya AghasafariHamid AbdiMahmoud Salimi
Author information
JOURNAL OPEN ACCESS FULL-TEXT HTML

2014 Volume 54 Issue 4 Pages 872-879

Details
Abstract

In this study, an artificial neural network model is proposed to predict the flow stress variations during the hot rolling process. Optimization of the proposed neural network with respect to number of neurons within the hidden layer, different training methods and transfer functions of the neural network is performed. The results of the optimal network were compared with those of the conventional analytic method and it is shown that using an optimal neural network the mean calculated error is drastically reduced.

1. Introduction

Hot rolling process is one of the most common processes in forming of metal products with different shape and dimensions. In this process, dimensional accuracy of the workpiece depends on accurate prediction of materials resistance to forming in addition to mill control system. Resistance against deformation mainly depends on surface contact of the workpiece and the rollers, flow stress and friction coefficient. Flow stress in rolling is influenced by various parameters such as grain size, recrystallization, dynamic and static recovery, dislocations distribution, reduction at each stand, temperature and velocity or strain rate. In sheet rolling industries, controlling the strip profile and flatness is an important issue. For this purpose the pressure applied on the work rolls neck to maintain the required force for the rolling process depends on the resistance against deformation. One way to estimate this force is by considering the chemical composition of material for modeling of the workpiece’s mechanical behavior during the deformation. Extensive studies are conducted in this area including several works using Artificial Neural Networks (ANN) to model this process. Zhang et al., considered percentage of Carbon and Manganese content, input-output thickness, temperature and width of the plate as inputs of an ANN and tried to estimate the rolling force.1) Lin et al. modeled an ANN whose inputs were temperature, strain, strain rate, logarithm of deformation’s rate and inter-pass delay time. Flow stress of two-pass hot deformed 42CrMo steel was regarded as network output.2) In Ghaisari et al. model reduction, coiling temperature, finishing rolling temperature and percentage of components were inputs of ANN and outputs were the yield strength, ultimate tensile strength and elongation.3) Bagheripoor and Bisadi regarded impressive parameters of rolling process such as; thickness reduction, initial temperature, friction coefficient and roll speed as inputs of ANN. Calculated rolling forces and rolling torques from the finite element method were used as outputs of the ANN to monitor the dimension and plate output profile accuracy.4) Rao and Prasad tried to investigate the feasibility of utilizing an ANN to extract the complex relationships involved in hot-deformation process modeling.5) From the production point of view it is very important to predict the required roll force as accurate as possible so that the desired outgoing thickness is maintained. The requirement for the large computational time using either accurate analytical solution or finite element method makes it unsuitable for online applications. In addition the most challenging parameter required for accurate prediction of the rolling force by any method is the workpiece resistance to deformation known as the flow stress. In present investigation, an attempt is made to obtain an optimized model for evaluating the flow stress in hot rolling process by using an inverse analysis method and ANN. The ANN results are compared with an analytical method based on the least square method of error between the calculated values and the measured ones.

2. Flow Stress Equation

The flow stress of material is generally dependent on parameters such as the composition (C), temperature (T), strain (ε), strain rate ( ε ˙ ), grain size (d) and previous history. These parameters are not constant within the volume and also vary with time during the hot rolling process. The mechanisms by which the recrystallization and recovery take place are complicated and are not the purpose of this investigation. In this investigation the mean and equivalent values of the above mentioned parameters at each stand are considered to define the flow stress.

Researchers have presented different relations for flow stress. For instance, Ludwik6) presented his model in terms of strain rate. Zener and Holman7,8) and also Alder and Phillips9) expressed their model based on dependence of flow stress on strain and strain rate. Finally, Inouye and some other researchers expressed their model more complete than the early researchers. They considered dependence of flow stress on temperature, strain and strain rate. In flat strip rolling where the plate width remains constant, the plane strain condition applies. Considering a thickness reduction in plane strain flat rolling from h0 to hf, the thickness strain is defined as:   

d ε t = dh h    or    ε t = h 0 h f dε =ln h f h 0 (1)
In this relation h is the instantaneous height of the element in the roll gap.

The effective strain in this condition would be:   

ε= 2 3 ln h f h 0 (2)
The strain rate representing the deformation speed known as the rate of deformation tensor is easily derived from the Eulerian linear strain tensor as follows:   
ε ˙ ij = 1 2 d dt ( u i x j + u j x i ) (3)
Where ui is the velocity vector at position xi.

From the engineering point of view the strain rate is simply defined as follows:   

ε ˙ = 1 h dh dt (4)
The mean strain rate used in this investigation is the one proposed by Ford and Alexander.10)

At any position in the roll gap the strain is defined as:   

ε ˙ x = 1 h dh dx dx dt (5)
Where x is the distance from the exit plane in the deformation zone and h is the strip thickness at distance x. Since the term dx dt is the horizontal velocity of the element at distance x, by referring to geometry of the deformation zone and by integrating the strain over the arc of contact and dividing the result by the projected contact length, the mean strain rate for any pass is obtained as follows:   
ε ˙ m = πN 30 R r r h 0 4-3r (2-r) 2 (6)
In this relation r is the reduction in thickness as given by r= h f - h 0 h 0 and N is the roll speed in RPM.

In this study the mean flow stress in the plastic zone is estimated using the mill data. Inouye’s relation is used to describe the flow behavior of specific steel during hot rolling. The form of the equation is given by:   

σ= σ 0 ε n ε ˙ m exp(A/T) (7)
Where σ0, n, m and A are constants, ε, ε ˙ and σ represent the equivalent or effective strain, effective strain rate and flow stress values respectively and T is the temperature.

The von Mises yield criterion gives:   

σ= 2 3 ( s ij s ij ) 1/2 (8)
Where sij is the deviatoric part of the stress tensor.

Similarly the strain and strain rate values presented in this equation are the effective or the representative values. For example the effective plastic strain increment with respect to the strain increment tensor ij is given by the following relation.   

dε= 2 3 ( d ε ij d ε ij ) 1/2 (9)
To evaluate a relationship between the parameters an ANN is modeled to take strain, strain rate, temperature and a constant coefficient of unity related to σ0 as its inputs and considered the flow stress as its output. The Bland and Ford model is utilized to calculate the flow stress. In this model the rolling condition is considered to be plane –strain, plane sections perpendicular to the direction of rolling remains plane, the friction coefficient is considered to be constant over the arc of contact and the elastic strains of the work-piece is assumed to be negligible. This model is presented for calculation of the roll force and torque as follow:10,11)   
P=w[ 0 φ n p + R dφ+ φ n α p - R dφ] ] (10)
  
G=wμ R R r [ φ n α p + dφ- 0 φ n p - dφ] ] (11)
In these relations w is the plate width μ is the friction coefficient modeled as coulomb model, Rr is the roll radius, R' is the deformed roll radius (the value of which is usually calculated using Hitchcock12) approximate solution), φ is the angular contact position, φn is the angular neutral position and p+ and p are the roll pressure at entry and exit side at the roll gap respectively as given by the following relations:   
p + = 2kh h f ( 1- t f 2k ) e μH (12)
  
p - = 2kh h 0 ( 1- t b 2k ) e μ( H 1 -H) (13)

k is the yield (flow) shear stress of material. According to von Mises yield criterion k=σ/ 3 where σ is the representative flow stress at each stand. tf and tb are the front and back tensions. h, H and H1 appeared in these equations are specified as below:   

h= h f +2 R (1-cosφ) (14)
  
H=2 R h f tan -1 ( R h f ϕ ) (15)
  
H 1 =2 R h tan -1 ( R h f α ) (16)

In these equations hf is the final thickness and α is the total contact angle.

Table 1 gives a sample of rolling data for specific steel grade St37 from the finishing mill at Mobarakeh steel company. By solving nonlinear equations based on the Bland and Ford model, the flow stress and friction coefficient values are obtained as given in Table 2.

Table 1. Data log of the rolled material with w=958 mm.
standsh0
(mm)
r%P
(kN)
vR
(mm/s)
tf
(kN/mm2)
tb
(kN/mm2)
ε
(lnh1/h2)
ε ˙
(1/s)
T
(°K)
G
(kN-mm)
F135.62531.833111459820.00600.38325.90121239.3339160
F224.28444.3181398217090.00850.0060.585515.85351229.1345020
F313.52236.611305627020.00950.00850.455829.08771218.8177390
F48.57229.931058139040.0110.00950.355745.8602120488033
F56.00622.361924851210.0120.0110.253159.46551186.551850
F64.66318.6806163470.0140.0120.205774.68871167.334288
F73.79612.0536297733900.0140.128674.42731146.125576
Table 2. Calculated values for friction coefficient and flow stress.
standsμσ
F10.2540.152486
F20.2570.173910
F30.2590.206636
F40.2590.210688
F50.2580.241464
F60.2590.248264
F70.2570.267512

The main object of this work is to investigate the flow stress variations in the finishing mill. Referring to the slip line field solution provided by Hill,13) the mean pressure factor γm within the roll gap in relation to the plastic zone geometry is given.

Considering the representative strain as given by the following relation   

ε= 2 3 γ m ε t (17)
γm can be regarded as a redundant work factor. The proportion of redundant work is equal to: (γm–1)/γm. In finishing mill however the ratio of thickness of strip to the contact length is less than unity, in this condition the pressure factor γm =1.

As γm in all stands is considered to be unity, the redundant work is negligible and was ignored. Hence the stress values obtained in this work which is related to strain, strain rate and temperature is close to the true stress values of material within the roll gap.

3. Artificial Neural Networks Model

In the last two decades, different types of ANNs have been employed in various applications including NNs for input output mappings. The input output mapping networks are commonly used where the input-output relationship is unknown, complex, or difficult to obtain. This relationship is constructed in the NNs based on the network structure, network transfer functions, and tuning weight and bias parameters of the neurons of the network. A multi-layer Feed Forward ANN (FFANN) that is commonly used for such mapping consists of one input layer, one or more hidden layers, and one output layer.14)

3.1. Methodology

Implementing the methodology of ANN requires gathering and preparing sample data sets in the first stage and design of the network in the next stage. In the first step, it is important to note that the network can only be as accurate as the data because most of the networks are general and do not have any prior information about the data. Furthermore, FFANNs have shown good interpolation ability. However, the accuracy for extrapolating beyond the sample data range is weak.

The design stage normally consists of 1) pre-processing of the data and 2) designing of the network structure and 3) training network weights and biases parameters. Preprocessing includes eliminating wrong data to make sure that the data is correct and normalization of the input and target vectors. For the normalization process, a sample consist of input as xk (n) and output as yl (n) where k and l represent the input and output numbers respectively, and n is the data sample for each input and output. The normalized values for the inputs and outputs are given by:   

x ˆ k (n)= x k (n)- x ¯ k max( x k )-min( x k ) (18)
  
y ˆ l (n)= y l (n)- y ¯ l max( y l )-min( y l ) (19)
where x ¯ k = n=1 N x k (n) N , y ¯ l = n=1 N y l (n) N and N is the total number of data.

For the design stage, FFANN with one hidden layer that is shown in Fig. 1 is used.

Fig. 1.

The proposed FFANN.

The proposed FFANN has initial structure of 4-N-1 consisting of three layers with 4 neurons at the input, N neurons in the hidden and one neuron at the output layer.

For the training stage, once the network structure and input are determined, then the weights and biases are randomly initialized and FFANN can be trained. During the training process, the data set of input and output were used to adjust the network parameters so that outputs for a given input are as close as possible to the desired output. Commonly, during the training process the weights and biases of the network are iteratively adjusted to minimize the mean square error (mse) of the network. In this study the network has only one output and the mean square error would be:   

mse= 1 K n=1 K [ t l (n)- a l (n) ] 2 (20)
tl (n) is a vector of desired values, and al (n) is the vector of the network’s output values and K is the number of training data.

The correlation coefficient R14) is often used to qualify the generalization capability of the training, validation and testing network. Correlation coefficient R for the two variables is defined as the covariance of the two variables divided by the product of their standard deviations. One set of variables in this investigation is considered as the experimental flow stress obtained from the mill data using the Bland and Ford model and the other set of variables is referred to the flow stress predicted by the Neural Networks. In this case the expression for the correlation coefficient is defined as follows:   

R= i=1 N ( σ ei - σ ¯ e ) ( σ Ni - σ ¯ N ) i=1 N ( σ ei - σ ¯ e ) 2 i=1 N ( σ Ni - σ ¯ N ) 2 (21)

Different training algorithms are used for FFANN and they commonly use the gradient of the mean square error function to determine how to adjust the weights. Gradient descent and Gradient descent with momentum are two of the training algorithms for FFANN, however, they are often too slow for practical problems. Some of the other training algorithms are: Variable learning rate gradient descent, Bayesian regularization, Conjugate gradient with Powell/Beale restarts, Fletcher-Powell conjugate gradient, Polak-Ribiére conjugate gradient, One step scan, Scaled conjugate gradient, BFGS Quasi-Newton and Levenberg-Marquardt algorithm.

For the training purpose, the data is divided into three subsets of the “Training set”, which is used for network training, the “Validation set”, which is used to monitor the error during the training process and the “Test set” which was not already used during the training, but it is used to compare different models.

4. Prediction of Flow Stress by FFANN

In this paper, the aim was to obtain an ANN model for flow stress in hot rolling process.

As it was mentioned in a hot rolling process, flow stress is a function of an initial stress (σ0), strain, strain rate and temperature. These parameters are considered as the inputs to a FFANN and the flow stress will be the output of the network. The aim is to obtain an optimal structure, network functions and parameters for a FFANN for this problem. In order to obtain such network, the following studies were made.

a-Different training algorithms for the FFANN with different transfer function and number of neurons in the hidden layer were used and their influences on rolling force accuracy were investigated.

b-Hidden layer neurons number for the best training algorithm and transfer function was changed and its influences on rolling force accuracy were obtained.

c-The optimal FFANN model implemented and trained with the best training algorithm, transfer function and number of neurons in the hidden layer.

4.1. Methodology of Optimal FFANN for Flow Stress Prediction

In this study, a MATLAB code for a 4-N-1 FFANN is developed. 100 datasets information for each rolling stand in finishing rolling mill were collected from the Mobarakeh Steel Company and used to design the network. The nonlinear equations based on Bland and Ford model were solved first and the flow stress as well as the friction coefficient were obtained. The aim is to obtain an optimal network that could be used to estimate the flow stress in terms of σ0, T, ε and ε ˙ . An initial model (4-N-1) for FFANN with (1, T, ε, ε ˙ ) as its inputs and σ as output was considered. To obtain the optimal structure of the FFANN, the training algorithm, hidden layer transfer function and neurons number were changed and the output of the network for each input vector was extracted. Rolling force for each network output based on Bland and Ford model was recalculated and the mean error values for all calculations were compared. Optimal structure for the FFANN based on the minimum of mean error values in rolling force recalculation was obtained. The flowchart of the method used is given in Fig. 2.

Fig. 2.

Methodology flow chart.

As it mentioned there are different training algorithm for FFANN. Also, different transfer functions are used in network layers. In this study the effects of different transfer functions for the hidden layer and different training algorithm are examined and the results are presented as given in Table 3.

Table 3. Effects of different training algorithms and transfer functions on minimum of mean error in rolling force recalculation.
Training algorithmMinimum of mean error
tansigmoidlogsigmoidLinear
Levenberg-Marquardt3.3793313.6434284.946779
BFGS Quasi-Newton4.0695954.0249294.965345
Resilient Backpropagation4.078424.2136654.97255
Scaled Conjugate Gradient4.2849054.3675314.974842
Conjugate Gradient with Powell/Beale Restarts4.048594.3709264.977647
Fletcher-Powell Conjugate Gradient4.1504044.1038114.992811
Polak-Ribiére Conjugate Gradient4.2880984.4271724.961029
One Step Secant4.3094294.3949174.954902
Variable Learning Rate Gradient Descent5.2060615.3098334.980747

As it can be seen Levenberg-Marquardt algorithm and tangential sigmoid function give the lowest error values in rolling force recalculation. Here nine common training algorithms and three common transfer functions are considered and their effects on output accuracy were investigated.

Among the mentioned algorithms, the Levenberg-Marquard (LM) algorithm is widely accepted as the most efficient one in the logic of recognition accuracy. The LM algorithm provides a numerical solution to the problem of minimizing a non-linear function. It is fast and has stable convergence. LM algorithm is basically a Hessian-based algorithm for nonlinear least square optimization. Hessian-based algorithms allow the network to learn more fine features of a complicated mapping. The training process converges quickly as the solution is approached, since the Hessian does not vanish at the solution. Many other methods have already been developed for ANNs training. The steepest descent algorithm is one of them which could be regarded as one of the most significant breakthroughs for training ANNs. LM can be thought of as a combination of steepest descent and the Gauss- Newton method. When the current solution is close to a local minimum it becomes Gauss- Newton method and show fast convergence while the solution is far from local minimum the algorithm behave like a steepest decent method. In the ANNs field, this algorithm is suitable for training problems. The detailed analysis of the LM algorithm is beyond the scope of this paper and interested readers are referred to the paper by Hagan and Mahnaj15)

The choice of transfer functions may strongly influence the performance of ANNs. The ANNs commonly use the sigmoid transfer functions. Sigmoid functions all share a similar S shape that is essentially linear in their center and nonlinear toward their boundaries. The log-sigmoid transfer function only returns positive values. If the ANN needs to return negative numbers, this function will be unsuitable. While tan-sigmoid transfer functions is positive and negative compatible version of the sigmoid function. The sigmoid transfer functions usually are the default choices for the FFANNs. However, it cannot be said for sure such functions should always provide optimal decision margins.

In this study the model performance is evaluated only in terms of its output accuracy and not considering the CPU time. As expected the combination of LM as the more suitable algorithm in training and the tan-sigmoidal transfer function as the most common and fitting transfer function presented the best results.

In next step, effects of different hidden layer neurons number for the best training algorithm and function is investigated. For this purpose, for the 4-N-1 networks the number N is altered from 1 to 14 as presented in Table 4.

Table 4. Effects of hidden layer neurons number on minimum of mean error in rolling force recalculation.
neurons numberMinimum of mean error
14.871778743
23.968010282
34.858454337
44.05080325
54.309615419
64.035406753
74.036083442
84.084914614
93.871305441
103.379330672
114.02898009
123.793245776
134.174313535
144.01370755

It can be seen that 10 neurons numbers gives the best results among the others. Hence, 4-10-1 is chosen to be the best configuration for the FFANN.

The comparisons between the desired flow stress values obtained from the Bland and Ford model and the network output values for the training, validating and testing datasets are implemented.

In Fig. 3, the scatter plots and correlation coefficient for (a) training, (b) validating, (c) testing and (d) overall “training + validating + testing” of the network results are given. For a linear correlation or regression, straight lines of best fit (the trend-lines) which specify the correlation between the variables are presented on the vertical axis. The more the two data sets agree, the more the scatters tend to get closer to the identity line (y=x). Since the values of correlation coefficient “R” in all plots is well above 0.9 and close to unity, indicates a strong positive relationship through a compact linear rule.

Fig. 3.

Scatter plots and correlation coefficient for training (a), validating (b), testing (c) and overall (d) of the network results.

As it can be seen in Fig. 3(d) in which all values (training, validating and testing) are correlated by R=0.9766, indicates close agreement between these values. It seems that satisfactory correlation between desired values and network output suggests that the FFANN is able to successfully predict the flow stress for the specific steel grade St37 from the finishing mill at Mobarakeh steel company.

5. Comparison between the Results of ANN and the Analytical Method

In Figs. 4(a) and 4(b) the results obtained from the FFANN and an analytical method based on a least-square logic is compared.

Fig. 4.

Comparison between analytical results and NN results. a) NN results. b) analytical results.

As it can be seen the FFANN result presents a more accurate prediction for the rolling force. Referring to the result of the analytical method, it can be seen that the percentage error of stands number 1, 5, 6 and 7 are generally lower and the rest are generally higher than the mean error. FFANN gives even distributed variations in percentage error in addition to more accurate results.

6. Sensitivity Analysis

Mathematically sensitivity is defined as the ratio of the reaction or change induced in the output to a stimulus or change in the input. Sensitivity analysis is used to evaluate the influence of model parameters on model predictions. This will help to study a model from different points of view such as: better understanding of the relationships between input and output variables, testing the robustness of the results of a model etc.

There are two methods of sensitivity analysis; local method and global method. In local method variables or parameters are varied one at a time by a small amount around fixed points and the effect on the output is calculated. In global method all variables or parameters are varied simultaneously over their entire feasible space and the effects on the output of individual variables are evaluated. In this study the global method is applied. As the ANN models are fundamentally nonlinear, the sensitivity measurement depends on the input values.

In this study which is related to hot rolling finishing mill, the four parameters or inputs (initial stress, strain, strain rate and temperature) are somewhat independent of each other. Initial stress is related to the material being rolled, strain is related to the roll gap adjustment which is controlled by the mill control unit or manually by an operator, strain rate is directly related to the rolling speed and the roll gap setting controlled by mill control unit and finally the material temperature which is related to heat transfer to rolls and environments (only a small fraction of the heat generated is due to the dissipated plastic work related to strain/strain rate). These four parameters directly affect the flow stress which in turn determines the rolling force and rolling torque required for the process. Hence, in this study for a particular material being rolled, the correlation between the value of flow stress and the three values (strain, strain rate and temperature) is provided.

To investigate the sensitivity of mechanical behavior of special steel grade St37 to different input parameters, graphs of flow stress in terms of each input are plotted. Using data from various datasets of finishing mill provides a wide range of variation in each input parameter.

For all 700 datasets where in early stands the strain and temperature are higher and the strain rate is lower, while in last stands where the strain is lower and the temperature at the end will drop by almost 20% and the work-piece experiencing higher strain rates the outputs are extracted. The scatter plots of flow stress against each variable are obtained and the best curve among the points is sketched by curve fitting. Figures 5, 6 and 7 show that all variables influencing the output and demonstrate suitable sensitivity of flow stress to each variable.

Fig. 5.

Sensitivity of flow stress to strain.

Fig. 6.

Sensitivity of flow stress to strain rate.

Fig. 7.

Sensitivity of flow stress to average temperature.

Since, in finishing mill data logs, high strains are related to early stands and in these stands the workpiece has higher temperatures, in these stands it shows a lower resistance against deformation, as shown in Fig. 5. In Fig. 6 it can be seen that in hot rolling process at low strain rate which is related to early stands at high temperature, material resistance against deformation is less than that of the other stands. In Fig. 7 sensitivity of flow stress to average temperature is given. It is obvious that in early stands where the temperatures are higher, the deformation takes place with lower stresses than the last stands. Thus, according to rolling mechanism all above figures show convenient sensitivity of flow stress to ANN inputs.

7. Conclusion

In this study, an ANN model is proposed that was able to predict the flow stress variations during hot rolling process. The proposed network is a FFANN with one hidden layer and was optimized with respect to number of neurons within the hidden layer, different training Algorithms and transfer functions of the ANN. The result of the optimal network were compared with that of the conventional analytic method and it is shown that by employing an optimal FFANN the mean calculated error is reduced by about 50%. It is concluded that the proposed method provides accurate result with reduced computational time and is a proper method for online control or optimization.

List of symbols

p roll force

G torque per roll

R correlation coefficient

Rr rollers radius

T average temperature at each stand

hf Outgoing strip thickness

h0 Ingoing strip thickness

k Yield shear stress

m sensitivity of flow stress to strain rate

n sensitivity of flow stress to strain

p+ roll pressures at exit zone

p roll pressures at entry zone

tb backward tension

tf forward tension

vR Roll surface speed

w Strip width

α Total contact angle

ε effective strain

ε ˙  effective strain rate

φ contact angle position

φn contact angle at neutral point

μ friction factor

σ flow stress

σe experimental flow stress values

σ ¯ e  Mean experimental flow stress

σN flow stress values predicted by NN

σ ¯ N  mean values of σN

References
 
© 2014 by The Iron and Steel Institute of Japan

This is an open access article under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs license.
https://creativecommons.org/licenses/by-nc-nd/4.0/
feedback
Top