Chemical and Pharmaceutical Bulletin
Online ISSN : 1347-5223
Print ISSN : 0009-2363
ISSN-L : 0009-2363
Regular Articles
Scale-Free Soft Sensor for Monitoring of Water Content in Fluid Bed Granulation Process
Keita Yaginuma Shuichi TanabeTakuya MiyanoHiroshi NakagawaSatoshi SuzukiShuichi AndoManabu Kano
著者情報
ジャーナル フリー HTML

2020 年 68 巻 9 号 p. 855-863

詳細
Abstract

In-line monitoring of granule water content during fluid bed granulation is important to control drug product qualities. In this study, a practical scale-free soft sensor to predict water content was proposed to cope with the manufacturing scale changes in drug product development. The proposed method exploits two key ideas to construct a scale-free soft sensor. First, to accommodate the changes in the manufacturing scale, the process parameters (PPs) that are critical to water content at different manufacturing scales were selected as input variables. Second, to construct an accurate statistical model, locally weighted partial least squares regression (LW-PLSR), which can cope with collinearity and nonlinearity, was utilized. The soft sensor was developed using both laboratory (approx. 4 kg) data and pilot (approx. 25 kg) scale data, and the prediction accuracy in the commercial (approx. 100 kg) scale was evaluated based on the assumption that the process was scaled-up from the pilot scale to the commercial scale. The developed soft sensor exhibited a high prediction accuracy, which was equivalent to the commonly used near-infrared (NIR) spectra-based method. The proposed method requires only standard instruments; therefore, it is expected to be a cost-effective alternative to the NIR spectra-based method.

Introduction

Fluid bed granulation has been widely applied to the manufacturing of pharmaceutical solid oral dosage forms such as tablets to improve drug product quality and manufacturability. In a top-spray granulation process, a binder solution is sprayed onto the particles fluidized by the heated inlet air from the bottom of the granulator. In-line monitoring of the granule water content profile during granulation is important because it correlates the granule properties and can affect the tablet properties such as hardness and disintegration time finally.1)

Mass balance and heat transfer models have been traditionally used for predicting the water content of the granules.24) These models typically use not only the process parameters (PPs) monitored by the standard instruments, such as inlet air humidity, spray rate, and exhaust air temperature, but also several unknown parameters, which need to be tuned by using experimental data, such as the heat transfer coefficient,2) particle wetting surface ratio,3) and the ability of powder particles to retain moisture.4) Mass balance and heat transfer models often have a challenge in determining the values of unknown parameters, which can deteriorate the prediction accuracy. The values of unknown parameters can differ depending on various factors such as the manufacturing condition, the scale of a fluid bed granulator, and the granule formulation. Hence, to construct accurate and robust models, it is necessary to calibrate the models by using the experimental data acquired with various manufacturing conditions and manufacturing scales, and especially to tune the unknown parameters accurately, which is practically challenging.

Soft sensors,5) which are statistical models constructed by using the standard PPs, have been utilized to predict the water content of granules. Rambali et al.6) demonstrated an optimization of the fluid bed granulation process based on the multiple regression model. In addition, other regression models such as partial least squares regression (PLSR)7) that can cope with collinearity among PPs have been used to increase the prediction accuracy and the robustness of the statistical model.8,9) However, the soft sensors, as well as the mass balance and heat transfer models, are valid only within the process conditions considered during model development. For example, when the manufacturing scale is changed, the prediction accuracy of the model might decrease because the relationship between the majority of PPs and the water content is generally dependent on the manufacturing scale. Similarly, the existing soft sensors used to predict the particle size of granules were valid only for the same granulator for which experimental data was obtained.9,10) Although the manufacturing scale of the granulation process can be adjusted according to the demand during clinical development and commercial manufacturing, a soft sensor that enables scale-free monitoring of the material attributes of granules during granulation such as water content has not been developed yet.

This study aims to develop a novel robust soft sensor that can counter changes in the manufacturing scale. To develop a scale-free soft sensor for monitoring water content, the effects of the input variables and a regression method considering the accuracy and robustness of the prediction model were evaluated. There are two key ideas for constructing a scale-free soft sensor. First, to accommodate the changes in the manufacturing scale, the input variables were selected based on variable importance in the PLSR model constructed using the experimental data obtained at different manufacturing scales. Second, to construct an accurate statistical model, locally weighted partial least squares regression (LW-PLSR)11) was utilized. The LW-PLSR is a type of just-in-time modeling method that can cope with collinearity and nonlinearity. Recently, LW-PLSR has been applied to various industrial processes such as pharmaceutical,1113) petrochemical,14,15) and semiconductor production.15,16) The LW-PLSR was adopted since the relationship between basic PPs and water content could be nonlinear. In this study, soft sensors were developed using both laboratory (approx. 4 kg) data and pilot (approx. 25 kg) scale data; the prediction accuracy in the commercial (approx. 100 kg) scale was evaluated based on the assumption that the process was scaled-up from the pilot scale to the commercial scale. A direct, near-infrared (NIR) spectra-based prediction model17,18) was also constructed as a reference. While this approach requires a high initial investment for equipment such as the NIR spectrometer, probe, spectroscopy software, and their incorporation into the fluid bed granulation process, the NIR spectra-based monitoring methods are generally robust enough to counter changes in the manufacturing scale. Both the soft sensor and the NIR spectra-based models were constructed using a dataset obtained from the same experiments, and their prediction accuracies were compared.

Experimental

Materials

A formulation composed of the drug substance (Daiichi Sankyo Co., Ltd., Japan) and several excipients were granulated using the following three different fluid bed granulators: NFLO-5 (Freund Corp., Japan) for the approx. 4-kg scale, GPCG-30 (Powrex Corp., Japan) for the approx. 25-kg scale, and WSG-120 (Powrex Corp.) for the approx. 100-kg scale. In this study, the manufacturing scales were defined as follows: approx. 4 kg is the laboratory scale, approx. 25 kg is the pilot scale, and approx. 100 kg is the commercial scale. A schematic diagram of the fluid bed granulator is provided in Fig. 1.

Fig. 1. Schematic Diagram of a Fluid Bed Granulator

Granules were sampled during the granulation process at the time points provided in Table 1. In this study, as provided in Table 1, the calibration and validation datasets consist of only spraying process data because the number of PPs in the spraying process is different from that in the drying process; spray rate (g/min) and spray air volume (NL/min) are PPs specific to the spraying process. The spraying process, where liquid supply and evaporation occur simultaneously, is complex and difficult to control water content accurately compared to the drying process. Besides, the scale-up is generally performed based on the water content profile during the spraying process. It is desirable that PPs are determined to keep the granule water content during the spraying process constant at different manufacturing scales.19) Thus, in-line monitoring of water content in the spraying process is especially important, and the present research focused on the spraying process.

Table 1. Calibration and Validation Datasets
DatasetEquipmentBatch No.TimePP1PP2PP3PP4PP5PP6PP7PP8WC
CalibrationNFLO-5102.676.310.05014941.339.7202.2
CalibrationNFLO-5152.582.810.05215039.238.3402.9
CalibrationNFLO-51102.584.710.15515036.736.6483.5
CalibrationNFLO-51152.084.710.05315035.135.5504.3
CalibrationNFLO-51202.584.19.36914935.834.3553.5
CalibrationNFLO-51252.584.89.55314936.234.1564.0
CalibrationNFLO-51302.585.19.74914836.734.1583.8
CalibrationNFLO-5202.667.410.07014742.042.6161.9
CalibrationNFLO-52152.070.610.18114829.331.6659.5
CalibrationNFLO-52201.570.110.28114928.230.36812.5
CalibrationNFLO-52251.570.110.17114827.729.46915.6
CalibrationNFLO-5352.569.39.26614735.541.6343.7
CalibrationNFLO-53102.569.99.56514730.636.8464.7
CalibrationNFLO-53152.570.09.86714729.234.1537.9
CalibrationNFLO-53202.569.68.86914828.530.4707.7
CalibrationNFLO-53252.669.99.46714928.429.6739.2
CalibrationNFLO-53292.570.09.66614828.429.17510.2
CalibrationGPCG-304108.087.05.118017841.241.8343.1
CalibrationGPCG-304207.987.95.218017538.038.9443.8
CalibrationGPCG-304308.088.05.317917936.537.3514.1
CalibrationGPCG-304407.987.95.418018135.536.1574.5
CalibrationGPCG-304507.987.65.518018234.935.2614.4
CalibrationGPCG-304597.987.85.518018134.634.6654.8
CalibrationGPCG-305107.987.56.515114844.842.9332.8
CalibrationGPCG-305208.088.76.615015043.041.3383.0
CalibrationGPCG-305307.987.96.714914842.140.4413.2
CalibrationGPCG-305408.088.76.314915141.640.0433.3
CalibrationGPCG-305507.988.06.714915041.039.6443.5
CalibrationGPCG-305607.988.06.515014840.739.3463.4
CalibrationGPCG-305727.987.96.714915140.338.8473.5
ValidationWSG-12011030.089.92.369971541.340.1513.6
ValidationWSG-12012030.190.02.269969238.738.1544.0
ValidationWSG-12013029.990.12.469869837.336.9594.3
ValidationWSG-12014030.090.02.369969836.536.1644.7
ValidationWSG-12015030.090.12.269870035.935.5674.9
ValidationWSG-12016230.090.02.369769935.535.1695.0
ValidationWSG-12021027.485.04.084865035.737.5454.3
ValidationWSG-12022028.185.03.884964332.233.9596.2
ValidationWSG-12023028.185.03.884964930.932.2687.6
ValidationWSG-12024028.185.03.984864630.331.2749.3
ValidationWSG-12025128.085.03.984864730.130.87810.8
ValidationWSG-12031031.994.93.854873049.847.0262.6
ValidationWSG-12032032.195.13.855071250.348.4232.6
ValidationWSG-12033031.595.14.154874650.849.2222.8
ValidationWSG-12034031.995.13.854974351.049.5212.6
ValidationWSG-12035032.095.03.954974250.949.6212.8
ValidationWSG-12036031.295.04.154974351.049.7212.7
ValidationWSG-12037031.895.03.954974151.049.7212.6
ValidationWSG-12037931.695.13.854874151.149.7202.5

Time: spraying time (min), PP1: inlet air volume (m3/min), PP2: inlet air temperature (°C), PP3: inlet air humidity (g-water/kg-air), PP4: spray rate (g/min), PP5: spray air volume (NL/min), PP6: product temperature (°C), PP7: exhaust air temperature (°C), PP8: exhaust air humidity (%RH), and WC: water content of granules (%).

In addition, in all process conditions investigated in this study, it was confirmed that granule growth was proceeding during the spraying process according to the particle size of granules after drying. The particle size of granule after drying obtained through the experiment of WSG-120 Batch No.3, which is the driest process condition, was 95.8 µm and larger than that of the mixture with drug substance and several excipients, i.e., 36.6 µm.

Measurement

The measured values of PPs, NIR spectra of granules, and water content of granules, which were measured at the time points provided in Table 1, were utilized for calibration or validation.

Process Parameters (PPs)

The following eight PPs were measured during the granulation process: inlet air volume (m3/min), inlet air temperature (°C), inlet air humidity (g-water/kg-air), spray rate (g/min), spray air volume (NL/min), product temperature (°C), exhaust air temperature (°C), and exhaust air humidity (%RH). Among the eight PPs, inlet air volume (m3/min), inlet air temperature (°C), spray rate (g/min), and spray air volume (NL/min) are operable PPs. All the PPs were acquired using standard instruments such as a thermometer, hygrometer, flowmeter, and electric balance.

NIR Spectra

The NIR spectra of the granules were measured during the granulation process using a Fourier-transform NIR spectrometer MPA (Bruker Optik GmbH, Germany) or Matrix-F (Bruker Optik GmbH) through a fiber-optic probe attached to the fluid bed granulators. Because the MPA and Matrix-F use an equivalent optical device, the NIR spectra obtained are comparable.

Reference

The water content of the granules sampled during the granulation process was measured by the following loss on drying (LOD) device: HR73 (Mettler-Toledo K.K., Japan) or its equivalent HR83 (Mettler-Toledo K.K.).

Calibration Dataset

In this study, a scenario in which the granulation process was scaled-up from a pilot scale to a commercial scale was investigated. As shown in Table 1, the calibration dataset for both the soft sensors and NIR spectra-based models consists of 30 samples, which were acquired at the laboratory and pilot scale: NFLO-5 (3 lots) and GPCG-30 (2 lots).

Preprocessing

The PPs were normalized, that is, their means were equal to 0 and the variances were equal to 1. For the NIR spectra, the wavenumber region ranging from 8007 to 4258 cm−1 was used to exclude any noisy regions, and the prediction models were constructed using the following five preprocessing methods: first derivative, second derivative, standard normal variate (SNV),20) first derivative and SNV, and second derivative and SNV. The method presenting the highest prediction accuracy was then selected. Each preprocessing method was performed using the OPUS software (Bruker Optik GmbH).

Input Variable Selection

Process Parameters (PPs)

To predict the water content at a commercial scale without using the commercial scale data in the calibration dataset, critical PPs regardless of the manufacturing scales were selected as follows. First, a PLSR model was constructed using all the PPs provided in the calibration dataset. Then, variable importance in the projection (VIP) scores, which indicate contributions of the input variables to the prediction value, were calculated. The VIP score for the j-th variable is defined as follows:

  
(1)

where M is the number of input variables, R is the number of latent variables (LVs), qr is the r-th regression coefficient vector, tr is the r-th score vector, and wjr is the j-th element of the r-th weighting vector wr. Here, R was determined to minimize the prediction error sums of squares of the leave-one-out cross validation (LOOCV). In general, the input variables whose VIP scores are greater than or equal to one are considered critical to the prediction value.21) In this study, PPs with VIP scores greater than or equal to one were selected as the input variables.

NIR Spectra

The spectral fluctuation dividing (SFD)-nearest correlation spectral clustering (NCSC)-PLSR22) was adopted for wavenumber selection using the Python software (Python Software Foundation). SFD-NCSC-PLSR was reported to achieve a higher prediction accuracy than commonly used variable selection methods such as VIP, PLSR-beta, and interval PLSR.

Regression Methods

PLSR and LW-PLSR were used to construct the prediction model using the selected input variables. To evaluate the effect of the input variable selection on the prediction performance, a PLSR model using all the PPs was constructed and evaluated. The main difference between PLSR and LW-PLSR is the weighting rule for the calibration samples. In PLSR, a prediction model is constructed using fixed weighting values for the calibration samples. In contrast, in LW-PLSR, which is a type of just-in-time modeling method, the weighting values are updated according to the distances between a query and the calibration samples whenever a prediction is required for a query, and a local PLSR model is constructed. That is, LW-PLSR includes PLSR as a special case.11)

The PLSR and LW-PLSR models were constructed using MATLAB® software (MathWorks, Inc., U.S.A.). LOOCV was performed to determine the tuning parameters of PLSR and LW-PLSR: the number of LVs for PLSR and the combination of the number of LVs and the localization parameter for LW-PLSR. The root mean square error of cross validation (RMSECV) and the correlation coefficient (RCV) were calculated to evaluate the prediction accuracy as follows:

  
(2)

where N is the number of samples, and ŷn and yn are the prediction value and reference value of the water content for the n-th sample, respectively;

  
(3)

where ŷ̅ a̅nd y̅ are the mean values of ŷ̅n and yn, respectively.

Validation

As shown in Table 1, the validation dataset consists of 19 samples acquired at the commercial scale: WSG-120 (3 lots), which is not included in the calibration dataset. The validation dataset was used to evaluate the prediction accuracy of the constructed models based on the root mean square error of prediction (RMSEP) and RP.

  
(4)
  
(5)

To assess the validity of the LW-PLSR model as the soft sensor, two distance criteria were used: Hotelling’s T2 and square prediction error (SPE), also known as Q residual.23)

  
(6)
  
(7)

where tr, query is the r-th LV score of the query, t̅r, calibration is the mean value of tr, calibration, which is the r-th score vector of the calibration dataset samples, s2tr, calibration is the variance of tr, calibration, xm, query and x̂m, query are the query’s experimental value and reconstructed value of the m-th input variable.

Muteki et al.24) demonstrated that two distance criteria, T2 and Q, could test the validity of the PLSR model for the query. In this study, a 99% confidence limit was adopted as the threshold of T2 and Q to test whether the LW-PLSR model was valid for the query.

Results

Soft Sensors

Input Variable Selection

Figure 2 shows the VIP scores of all the PPs, which were calculated using the PLSR model after normalization. The results show that the VIP scores of four PPs, i.e., inlet air temperature (°C), product temperature (°C), exhaust air temperature (°C), and exhaust air humidity (%RH), were greater than one. Therefore, these four PPs were selected as the input variables.

Fig. 2. Importance of PPs Based on VIP Scores

All PPs were normalized before PLSR modeling.

RMSEPs of the PLSR models are provided in Figs. 3A and 3B. RMSEP was significantly reduced from 27.2 to 3.5% by using the four selected PPs as input variables. In addition, RMSECVs are provided in Figs. 3C and 3D. The PLSR model with the four selected PPs showed a notably smaller difference between RMSEP and RMSECV than that with all PPs; their differences were 1.7 and 25.6%, respectively. These results confirm that the proposed method constructed a robust soft sensor to counter the manufacturing scale change.

Fig. 3. Prediction Accuracy of PLSR Models Using the Four Selected PPs (A, C) and All PPs (B, D)

(A, B) RMSEPs and RP values, (C, D) RMSECVs and RCV values.

Regression Method

LW-PLSR and PLSR were compared in terms of prediction accuracy. The four selected PPs were used as input variables in both models; RMSEPs were calculated using the validation dataset shown in Table 1. As shown in Figs. 3A and 4A, the RMSEP of 0.5% for the LW-PLSR model was significantly smaller than that of 3.5% for the PLSR model. In addition, the difference between the RMSEP and RMSECV of −0.2% for the LW-PLSR model was smaller than that of 1.7% for the PLSR model as shown in Figs. 3 and 4. These results indicate that the LW-PLSR model is more accurate and robust than the PLSR model. The LW-PLSR model with the four selected PPs was confirmed to show the highest prediction accuracy among the three soft sensors.

Fig. 4. Prediction Accuracy of LW-PLSR Model Using the Four Selected PPs

(A) RMSEP and RP value, (B) RMSECV and RCV value.

The number of LVs in the LW-PLSR model is the same as that of the input variables. This indicates that dimension reduction was not performed, and Q was constant at 0. In addition, Hotelling’s T2 values of all validation dataset samples were lower than the 99% confidence limit.

NIR Spectra-Based Models

Preprocessing

Table 2 shows the RMSECVs and RCV values of the five PLSR models constructed by applying various preprocessing methods. As a result, SNV, which showed the highest prediction accuracy, was selected as the preprocessing method for constructing the prediction model using the NIR spectra.

Table 2. Comparison of Five Preprocessing Methods in RMSECVs and RCV Values
Preprocessing methodRMSECV (%)RCV
First derivative0.60.98
Second derivative0.70.97
SNV0.50.99
First derivative + SNV0.70.98
Second derivative + SNV0.80.97

Wavenumber Selection

Figure 5 shows the wavenumber region selected by SFD-NCSC-PLSR, i.e., ranging from 7212 to 6935 cm−1, which includes the first overtone wavenumber region for H2O.

Fig. 5. NIR Spectra, Preprocessed Using SNV, of Granules during Fluid Bed Granulation at the Wavenumber Region Ranging from 8007 to 4258 cm−1

Regression Method

Two models based on LW-PLSR and PLSR were constructed after applying SNV to the calibration set of the NIR spectra with a wavenumber region ranging from 7212 to 6935 cm−1. The RP values for both models were evaluated using the validation dataset shown in Table 1. The RP value of 0.98 for the LW-PLSR model was higher than that of 0.93 for the PLSR model, as observed from the results shown in Figs. 6A and 6B. Moreover, Fig. 6 shows the 0.5% difference between the RMSEP and RMSECV for the LW-PLSR model, which was equivalent to that of 0.6% for the PLSR model. Thus, the LW-PLSR model utilizing the NIR spectra was confirmed to be the better estimator of the water content compared to the PLSR model.

Fig. 6. Prediction Accuracy of LW-PLSR (A, C) and PLSR (B, D) Models Using Absorbance at the Wavelength Selected by SFD-NCSC-PLSR

(A, B) RMSEPs and RP values, (C, D) RMSECVs and RCV values.

Comparison of the Prediction Accuracy

The best soft sensor being the LW-PLSR model with the four PPs, and the best NIR spectra-based model being the LW-PLSR model with a wavenumber region from 7212 to 6935 cm−1 were compared. As shown in Figs. 4A and 6A, the 0.5% RMSEP for the soft sensor was slightly smaller than that of 1.0% for the NIR spectra-based model. In addition, Figs. 4A, 4B, 6A, and 6C show the −0.2% difference between the RMSEP and RMSECV for the soft sensor, which was equivalent to that of 0.5% for the NIR spectra-based model. These results indicate that the soft sensor was as accurate as the NIR spectra-based model and robust enough to counter manufacturing scale changes.

Discussion

In this study, soft sensors were developed using both laboratory (approx. 4 kg) data and pilot (approx. 25 kg) scale data; the prediction accuracy on a commercial scale (approx. 100 kg) was evaluated on the assumption that the process was scaled-up from the pilot scale to the commercial scale. The developed soft sensor achieved a high prediction accuracy equivalent to that of the NIR spectra-based model and was robust enough to counter the manufacturing scale change, only when an appropriate selection of the input variables and the regression method selection were carried out.

As shown in Fig. 2, the inlet air temperature (°C), product temperature (°C), exhaust air temperature (°C), and exhaust air humidity (%RH) were selected as the input variables of the soft sensor because the VIP scores were larger than 1.0. The soft sensor utilizing the four PPs showed higher prediction accuracy compared to the models using different PPs selected by the various VIP criteria, as shown in Fig. 7. The difference between the RMSEP and RMSECV became remarkably smaller when the VIP criterion was 1.0 or greater. Furthermore, the RMSEP and RMSECV of the model, which used the four PPs selected according to the VIP criterion of 1.0, were smaller than those of the model constructed using the PPs selected at the VIP criterion of 1.2. Based on the results, the four selected PPs were considered critical for constructing a robust soft sensor to counter manufacturing scale changes. As shown in Fig. 8, the four selected PPs were in similar ranges regardless of the manufacturing scale, while the parameter ranges of the other four PPs, i.e., inlet air volume (m3/min), inlet air humidity (g-water/kg-air), spray rate (g/min), and spray air volume (NL/min), were significantly different depending on the manufacturing scale. Reflecting the ranges of the four PPs used in the soft sensor, the values of Hotelling’s T2 and Q of all validation dataset samples were within the 99% confidence limit, which contributed to the validity of the LW-PLSR models for the query. The four selected PPs include one operable PP, i.e., inlet air temperature (°C). Thus, constructed soft sensors are expected to be useful for not only in-line monitoring but also water content control by adjusting inlet air temperature.

Fig. 7. Difference between RMSEPs and RMSECVs of LW-PLSR Models Depending on VIP Score Criteria
Fig. 8. Ranges of Input Variables at Three Fluid Bed Granulators: NFLO-5, GPCG-30, and WSG-120

Furthermore, LW-PLSR was effective in improving the prediction accuracy because the relationship between the input and output variables was nonlinear. As shown in Fig. 9, the four selected input variables represent the nonlinearity of the output variable of the water content. In LW-PLSR, a local PLSR model is constructed per a query using a weighted calibration dataset based on the distances between the query and the calibration samples. Because of its applicability to the nonlinearity in the dataset, the LW-PLSR model achieved a better prediction performance and robustness compared to the PLSR model.

Fig. 9. Relationship between Input Variables and an Output Variable (i.e., Water Content) in the Calibration Dataset

Conclusion

In this study, we demonstrated that the proposed soft sensor could predict water content accurately beyond the manufacturing scale range of the calibration dataset. There are two key ideas for constructing a scale-free soft sensor. First, to accommodate the changes in the manufacturing scale, PPs that are critical to water content at different manufacturing scales were used as the input variables. Second, to construct an accurate statistical model, LW-PLSR was utilized to cope with collinearity and nonlinearity. The experiments demonstrated that the soft sensor based on the proposed method exhibited a high prediction accuracy, which was equivalent to the commonly used NIR spectra-based method. Unlike the NIR-based method, the proposed method requires only standard instruments. Therefore, the proposed method is expected to be a cost-effective alternative to the existing NIR spectra-based monitoring method and has the ability to enhance the implementation of in-line monitoring of the fluid bed granulation processes.

Conflict of Interest

The authors declare no conflict of interest.

References
 
© 2020 The Pharmaceutical Society of Japan
feedback
Top