2020 Volume 45 Issue 12 Pages 737-750
2,4-Dichlorophenoxyacetic acid (2,4-D) is one of the most widely used herbicides in the world, but its mutagenic and carcinogenic potential is still controversial. We simulated environmental exposure to 2,4-D, with the objective of evaluating the genotoxic effect of acute and chronic exposure to 2,4-D in rodents. We also evaluated the performance of machine learning algorithms in detecting differences in exposure groups through recognition performed from genotoxic characteristics. In the acute phase, 88 Swiss mice were used, distributed in five groups and exposed to nebulizations at different time intervals (24, 48, 72 and 192 hr). In the chronic phase, 88 Wistar rats were used, distributed in two groups (inhaled and oral) and exposed for six months. Femoral bone marrow cells were collected for a micronucleus test and comet assay. Data were evaluated by pattern recognition algorithms. In acute exposure, medium and high concentrations induced DNA damage in the comet assay, but these concentrations did not increase micronucleated cells. In the chronic exposure, there was an increase in micronuclei and DNA damage in the comet assay in all exposed groups regardless of the exposure route. The data showed a robust pattern of distinction between exposed and nonexposed groups to 2,4-D. Our data showed that both acute inhalation exposure and chronic oral and inhalation exposure to 2,4-D can cause genotoxic effects regardless of concentration. Machine learning showed a clear distinction between the control groups and those exposed to 2,4-D, and the effects of exposure are not concentration-dependent.
The 2,4-dichlorophenoxyacetic acid (2,4-D) herbicide is a synthetic auxin belonging to the group of phenoxyacetic acids developed during World War II with the aim of controlling broadleaf weeds (Sbano et al., 2013). 2,4-D is considered one of the most widely used herbicides in the world due to its low cost, high selectiveness, systemic effects, ease of transport by the plant and inhibition of plant growth (Song, 2014).
Human exposure to pesticides represents a large public health problem due to the possible induction of mutations in genetic material (Siqueira et al., 2012).
The 2,4-D has been placed in group 2B (possibly carcinogenic to humans) by the International Agency for Research on Cancer (IARC) (IARC, 2015) and in group D (not classifiable as to human carcinogenicity) by U.S. Environmental Protection Agency (US EPA) (RED, 2005). Genotoxicity to 2,4-D is controversial in the literature. A previous study has shown that breakage of the primary DNA strand in fish occurs after 7 days of exposure to 2,4-D (Ruiz de Arcaute et al., 2018). Other studies have shown that 2,4-D is not genotoxic and has no evidence of carcinogenicity in laboratory animals (Charles et al., 1999; Gollapudi et al., 1999; Bus and Hammond, 2007). However, the carcinogenicity of 2,4-D in humans has been widely debated due to the appearance of non-Hodgkin’s lymphoma in farmers using this herbicide (Hoar et al., 1986).
Machine learning has advanced considerably in recent years as it is a practical and broad technology (Burrel, 2016). The use of machine learning algorithms in the medical field has provided important results. This type of learning belongs to Artificial Intelligence because it is an automation process that generalizes the experience expressed in a set of data (Carvalho and Dallagassa, 2014).
Although there are several studies that have evaluated the genetic damage induced by pesticides in different experimental models, there is no data available for relevant concentrations to human exposures of the 2,4-D. In the present study, we adapted the environmental exposure of 2,4-D, using concentrations of exposure in crops, to assess the genotoxic effect of acute and chronic exposure to 2,4-D in rodent bone marrow cells. In addition, we evaluated the performance of machine learning algorithms in detecting differences in exposure groups through recognition performed from genotoxic characteristics.
This study was approved by the Ethics Committee on the Use of Animals of the Universidade do Oeste Paulista (Protocols 3140 and 3761).
Exposure to 2,4-dichlorophenoxyacetic acid (2,4-D)The animals were exposed to the 2,4-D (Nortox® SA, Arapongas, Paraná, Brazil) with the following composition: 806 g/L dimethylamine salt of 2,4-dichlorophenoxyacetic (80.6% m/v), 670 g/L 2,4-D acid equivalent (67.0% m/v) and 424 g/L inert ingredients (42.4% m/v), and the concentration of total dioxins is 0.00001 g/kg (= 0.01 ppm).
For each type of crop, a specific concentration of 2,4-D is required. The choice of doses was based on the lowest dose, the intermediate dose and the highest dose of 2,4-D used for spraying crops. Thus, the following concentrations were used based on the manufacturer’s recommendation and adjusted to the size of the experimentation box to simulate the environmental exposure [12]: 3.71 x 10-3 grams of active ingredient per hectare (g.a.i./ha) corresponding to 187.17 mg/m3, 6.19 x 10-3 g.a.i./ha corresponding to 313.31 mg/m3 and 9.28 x 10-3 g.a.i./ha corresponding to 467.93 mg/m3.
Nebulization protocolFor nebulization in the acute phase, 2,4-D was diluted in 10 mL of 0.9% sodium chloride solution (NaCl), and for nebulization in the chronic phase, 2,4-D was diluted in 10 mL of distilled water (Mello et al., 2018).
The nebulization protocol had two boxes (32 x 24 x 32 cm), and each box was coupled to an ultrasonic nebulizer (Pulmosonic Star®, Soniclear Ind. Com. Imp. E Exp. Ltda., São Paulo, Brazil) (Mello et al., 2018). For simulation of an environmental exposure, holes were made manually in both boxes, allowing air to be exchanged between the internal and external environment of each box. The nebulization time was 15 min (time needed for the entire solution to be nebulized) per day.
Animal protocolFor acute exposure, 88 adult male Swiss mice (initial age of 60 days) were used and for chronic exposure, 88 adult male albino Wistar rats (initial age of 90 days) were used (El Kabbaoui et al., 2017).
The animals were placed in collective plastic cages measuring 30 x 16 x 19 cm and stored in a controlled environment (acclimatized vivarium) with photoperiods of 12 hr of light and 12 hr of darkness at an ambient temperature of 22 ± 2ºC (National Research Council, 2011). The animals received filtered water and balanced commercial feed (Supralab®, Alisul, Brazil) ad libitum.
All groups were exposed at the same time in the acute and in the chronic phases.
Design of acute exposure (inhalation only)The animals were randomly assigned to five groups as follows: C group (saline group), exposed to nebulization with 10 mL of sodium chloride solution (0.9% NaCl) (n = 20); BC group, exposed to nebulization with low concentration of 2,4-D (n = 20); MC group, exposed to nebulization with a mean concentration of 2,4-D (n = 20); AC group, exposed to nebulization with a high concentration of 2,4-D (n = 20); CPa group (positive control of acute exposure), exposed to cyclophosphamide (Genuxal, Baxter Oncology GmbH, Halle, Westfalen, Germany) in a single subcutaneous dose (50 mg/kg) on the first day of the experiment (n = 8) (MacGregor et al., 1987).
The animals of groups C, BC, MC and AC were exposed repeatedly to nebulizations to the herbicide at different time intervals (24, 48 and 72 hr) with 5 animals for 24 and 48 hr and 10 animals for 72 hr. The animals were euthanized 24 hr after the last nebulization, and five animals nebulized three times were euthanized 192 hr after the last nebulization (Mello et al., 2018) (Fig. 1). The animals in the CPa group were euthanized 24 hr after the beginning of the experiment (MacGregor et al., 1987).
Design of acute exposure. C: saline group; BC: low 2,4-D concentration group; MC: middle 2,4-D concentration group; AC: high 2,4-D concentration group; CPa: positive control of acute exposure.
The animals were randomly divided into two major groups, namely, inhalation exposure and oral exposure, with respective subgroups (Table 1).
Exposure intensity | Inhalation exposure | Oral exposure | Sample |
---|---|---|---|
Control | CI | CO | n = 10/group |
Low concentration | BCI | BCO | n = 10/group |
Medium concentration | MCI | MCO | n = 10/group |
High concentration | ACI | ACO | n = 10/group |
CI: inhalation control group; CO: oral control group; BCI: inhalation low 2,4-D concentration group; BCO: oral low concentration group; MCI: inhalation medium concentration group; MCO: oral medium concentration group; ACI: inhalation high concentration group; ACO: oral high concentration group.
The CPc group (positive control of chronic exposure) was exposed to cyclophosphamide (Genuxal, Baxter Oncology GmbH) in a single subcutaneous dose (50 mg/kg) on the first day of the experiment (n = 8) (MacGregor et al., 1987).
The animals exposed by inhalation were nebulized for five days in a week to simulate the occupational exposure of the rural worker. The feed of the animals in the oral group was nebulized one day before being offered to them, being changed every two days. The residual feed was weighed at each change.
Animals in all chronic exposure groups were exposed for 6 months and then euthanized 24 hr after the last nebulization. The animals in the CPc group were euthanized 24 hr after the beginning of the experiment (MacGregor et al., 1987).
Euthanasia of all animals (acute and chronic exposure) was performed with sodium thiopental (Syntec, Santana do Parnaíba, SP, Brazil) at a dose of 100 mg/kg of weight administered in the peritoneal cavity (Paiva et al., 2005; National Research Council, 2011).
Micronucleus testFor the micronucleus test, bone marrow material was collected from one of the femurs of each animal at the time of euthanasia, and two slides per animal were prepared (MacGregor et al., 1987). Slides were stained with Giemsa stain (Dolles, São Paulo, Brazil). To determine the number of micronuclei, 2000 polychromatic erythrocytes were counted per animal using an optical microscope at 400x magnification (MacGregor et al., 1987). Micronuclei were characterized according to Tolbert et al. (1992). The analysis of the slides was blind and performed by one researcher (GAN).
Comet assayBone marrow samples were collected from a femur of each animal. Cells were resuspended in storage medium composed of fetal bovine serum (Cultilab, São Paulo, SP, Brazil), RPMI 1640 medium (Sigma, San Luis, MO, USA) and dimethyl sulfoxide (DMSO; Merck, Darmstadt, Germany), and they were stored at -80°C.
After defrosting the sample, for viability analysis, 10 µL of the bone marrow cell solution was mixed with 10 µL of trypan blue (Thermo Scientific, South Loga, UT, USA) and 200 cells were analyzed per treatment. The minimum recommended bioavailability for Comet assay is 75% (Singh et al., 1988).
Slides were prepared, placed in a horizontal electrophoresis chamber, covered with electrophoresis buffer, and incubated for 20 min. Electrophoresis was then performed in alkaline conditions (pH > 13) at 25 V and a current of 300 mA for one hour. After electrophoresis, samples were neutralized with Tris buffer (Tris 0.4 M; pH 7.5). Subsequently, slides were placed in a fixative solution [containing trichloroacetic acid, zinc sulfate (heptahydrate) and glycerol] and then washed. Samples were then stained in a silver solution at 37°C and stirred until the solution started to darken. Slides were then washed and placed in a stop solution for five minutes and washed again. After this process, slides were dried at room temperature and covered with a coverslip for analysis (Scherer and Strohschoen, 2013).
One hundred cells from each animal were chosen at random and analyzed under an optical microscope with 400x magnification. Cells were classified according to the shape and size of the tail into four classes of DNA damage as follows: 0 (no damage), 1, 2, 3 and 4 (highest degree of damage). “Hedgehog” cells were excluded from analysis (Villela et al., 2006). The analysis of the slides was blind and performed by one researcher (FMA).
Statistical analysisBecause the variables did not show normality by the Kolmogorov-Smirnov test or homogeneity of variances by the Levene test, the Kruskal-Wallis test was applied followed by the Dunn test for multiple comparisons.
SPSS V. 23 software was used, and the level of significance used for each test was 5%.
Machine learning algorithmsThe machine learning analysis experiments were performed as described by Tomiazzi et al. (2018).
To evaluate the hypothesis that acute and chronic exposure to 2,4-D causes a genotoxic effect on bone marrow cells, machine learning algorithms were used. Pattern recognition was performed based on the following genotoxic characteristics: (a) comets and (AA) micronuclei. The hypothesis was confirmed if the classification algorithm indicated the group (control or exposure) according to the parameters presented in each group.
The confusion or classification matrix was created from a set of data divided into categories, which determined whether the expected values correspond to the real value, with the total of these results displayed in a matrix.
The classification algorithms learned from a first set of examples, in which the group was previously known, which is called a training set. In this set, the algorithms are responsible for identifying the different types that can be used to complete the classification task. After training, the samples that were obtained by the algorithms and by the real group were tested. The hypothesis was proven if the methods used for the classification identified the groups with high precision.
For computational experiments, the following five versions of the collected dataset were considered:
• Dataset1: Groups subjected to acute exposure (C, BC, MC, AC and CPa) in the micronucleus test and in the comet assay.
• Dataset2: Groups subjected to acute exposure (C, BC+MC+AC and CPa) in the micronucleus test and in the comet assay.
• Dataset3: Groups subjected to chronic exposure (CI, CO, BCI, BCO, MCI, MCO, ACI, ACO and CPc) in the micronucleus test and the comet test.
• Dataset4: Groups subjected to chronic exposure (CI, CO, BCI+MCI+ACI, BCO+MCO+ACO and CPc) in the micronucleus test and in the comet assay.
• Dataset5: Groups subjected to chronic exposure (CI+CO, BCI+MCI+ACI+BCO+ MCO+ACO and CPc) in the micronucleus test and the comet assay.
For the size of each training set and each dataset, 20 runs were performed. All experiments were performed on a computer with an Intel i5 6500U processor with 8 GB of RAM using Linux as the operating system. The samples and test sets were part of a set of resources that outlined the measurements of mice and rats, and they were labeled according to their respective classes (C, BC, MC, AC, CPa, CI, BCI, MCI, ACI, CO, BCO, MCO, ACO and CPc). Five different sizes of training sets were used (50, 60, 70, 80 and 90%) with the aim of investigating the learning rate of the classification algorithms and datasets of all training sets and remaining tests.
Classification algorithms K-Nearest neighborsThe first machine learning algorithm used in computational analysis was the k-Nearest Neighbors (k-NN). k-NN is a simple and effective technique, and it is known for making use of all samples, which were used for training as a prototype through the input parameter k. The classification of a sample is achieved considering the nearest k training samples. For the value of k = 1, the test sample ends up being classified according to the class of the training sample that is closest to it (Pereira et al., 2018).
Support vector machineThe second machine learning algorithm used in computational analysis was the support vector machine (SVM) approach, which is based on the reduction of structural risks (Vapnik, 1999). The objective of SVM is to establish a function that discriminates between two classes of standards. During the analysis of the results, the kernel used was the Radical Base Function (RBF). The SIGMA and C parameters were determined using a grid search in the interval [2-15,215] with a power step of 2.
Decision TreeThe third machine learning algorithm used in computational analysis was the Decision Tree (DT). This model uses a decision tree with possible results, including accidental ones. This algorithm is commonly used in operational research, with emphasis on decision analysis, to find strategies that reach the goal with greater probability.
MetricsIn the context of this work, we used accuracy as a performance metric. Given a set with n test samples, we calculated the accuracy (Acc) as follows: Acc = hits / n, where hits denote the amount of correct ratings.
Training data setsThe training data sets ranged from 0 (zero) to 90 (ninety), that is, the matrices are equivalent to 90%. Therefore, the number of training data sets is equal to 90 times the size of the data set.
The parameters used in the experiments were defined empirically, they are: for the K-NN classifier we used the number of neighbors as 3; for the SVM we used the RBF kernel and gamma as 2 and C as 1; and for Decision Tree we used the maximum depth as 5.
The median micronuclei in the negative control group (C), BC group, MC group, AC group and CPa (positive control) groups was 0, 0, 0, 1 and 11.5, respectively. There was a difference between the groups studied (p < 0.0001) and when the CPa group was compared to the C, BC, MC and AC groups (p < 0.05), but there was no difference regarding the exposure times (p > 0.05) (Table 2, Fig. 2).
Groups | Exposure time | |||
---|---|---|---|---|
24 hr | 48 hr | 72 hr | 192 hr | |
C | 0-0Aa | 0-0Aa | 0-0Aa | 0-1Aa |
BC | 0-3Aa | 0-0Aa | 0-1Aa | 0-1Aa |
MC | 0-3Aa | 0-0Aa | 0-3Aa | 0-1Aa |
AC | 0-3Aa | 0-2Aa | 1-5Aa | 0-2Aa |
CPa | 7-15B,b |
C: saline group; BC: low 2,4-D concentration group; MC: middle 2,4-D concentration group; AC: high 2,4-D concentration group; CPa: positive control of acute exposure. Capital letters compare the same group in the same line. Lower case letters compare groups at the same time and in the same column. Different letters means p-value < 0.05, where: A ≠ B; and a ≠ b.
Micronucleus count per study group regardless of exposure time (median and interquartile range) with acute exposure. C: saline group; BC: low 2,4-D concentration group; MC: middle 2,4-D concentration group; AC: high 2,4-D concentration group; CPa: positive control of acute exposure. a x b: p-value < 0,0001 (Dunn test).
The minimum bioavailability of 86% was verified in the samples.
Regarding cells with a score of 0, there was a difference between the C group and the MC, AC and CPa groups as well as between the BC group and the MC, AC and CPa groups (p < 0.0001). Regarding cells with a score of 1, there was no difference between groups (p = 0.2013). The number of cells showing a score pf 2 differed between the BC and MC groups (p = 0.0180). Regarding cells with a score of 3, there was a difference between the C group and the MC and AC groups as well as the BC group and AC group (p < 0.0001). For cells with a score of 4, there was a difference between the C group and the MC, AC and CPa groups as well as the BC group and the MC, AC and CPa groups (p < 0.0001). There was a difference in relation to the exposure times only for cells with a score of 2 with an increase in the number of cells with a score of 2 for the control group and for groups exposed after 192 hr compared to the control groups and groups exposed for 24 and 48 hr (p < 0.001) (Table 3 and Fig. 3).
Groups | Score | Exposure time | |||
---|---|---|---|---|---|
24 hr | 48 hr | 72 hr | 192 hr | ||
C | Escore 0 | 41-96A | 9-85A | 5-39B | 15-78A |
Escore 1 | 3-57A | 5-53A | 4-32A | 4-26A | |
Escore 2 | 0-2A | 5-16B | 6-21B | 4-35B | |
Escore 3 | 0-0A | 1-17B | 7-30B | 6-26B | |
Escore 4 | 0-0A | 0-62B | 17-55B | 8-26C | |
BC | Escore 0 | 2-94A | 19-87A | 15-40A | 5-27B |
Escore 1 | 1-19A | 2-28A | 7-20A | 13-38A | |
Escore 2 | 1-15A | 2-13A | 5-14A | 2-30A | |
Escore 3 | 1-29A | 3-16A | 12-25A | 2-27A | |
Escore 4 | 1-55A | 6-31A | 31-50A | 5-43A | |
MC | Escore 0 | 1-20A | 0-13A | 2-24A | 1-10A |
Escore 1 | 1-32A | 4-30A | 10-26A | 9-21A | |
Escore 2 | 12-18A | 10-16A | 8-25A | 17-25A | |
Escore 3 | 7-27A | 9-27A | 16-27A | 14-23A | |
Escore 4 | 35-66A | 36-60A | 26-48A | 35-47A | |
AC | Escore 0 | 1-17A | 2-5A | 2-11A | 1-5A |
Escore 1 | 0-34A | 4-20A | 5-19A | 3-9B | |
Escore 2 | 0-26A | 5-12A | 7-39A | 12-39A | |
Escore 3 | 1-42A | 12-22A | 17-27A | 25-33A | |
Escore 4 | 19-98A | 48-68A | 23-62A | 30-48A | |
CPa | Escore 0 | 32-44 | |||
Escore 1 | 8-27 | ||||
Escore 2 | 17-27 | ||||
Escore 3 | 11-21 | ||||
Escore 4 | 8-13 |
C: saline group; BC: low 2,4-D concentration group; MC: middle 2,4-D concentration group; AC: high 2,4-D concentration group; CPa: positive control of acute exposure. Capital letters compare the same group in the same line. Different letters means p-value < 0.05, where: A ≠ B.
Comet assay with acute exposure. A - Number of cells with a score of 0 in each study group regardless of exposure time (a x b: p-value < 0,0001 - Dunn test). B - Number of cells with a score of 1 in each study group regardless of exposure time (a x b: p-value = 0.2013 - Dunn test). C - Number of cells with a score of 2 in each study group regardless of exposure time (a x b: p-value = 0,0180 - Dunn test). D - Number of cells with a score of 3 in each study group regardless of exposure time (a x b and b x c: p-value < 0,0001 - Dunn test). E - Number of cells with a score of 4 in each study group regardless of exposure time (a x b: p-value < 0,0001 - Dunn test). C: saline group; BC: low 2,4-D concentration group; MC: middle 2,4-D concentration group; AC: high 2,4-D concentration group; CPa: positive control of acute exposure. °: outlier and animal number.
There was no difference in feed intake between exposed and unexposed groups (p > 0.05).
Micronucleus testThe median micronuclei in the inhaled control (CI), oral control (CO), BCI, BCO, MCI, MCO, ACI, ACO and CPc (positive control) groups was 0, 0, 1.5, 3, 4, 6, 5, 5 and 9, respectively. Figure 4 shows that the following groups had differences (p < 0.0001): between the CI group and the ACI and CPc groups; between the CO group and the MCO, ACO and CPc groups; and between the CPc group and the BCI and BCO groups. There was no difference between the routes of exposure (oral and inhaled) (p > 0.05).
Count of micronuclei per study group (median and interquartile range) with chronic exposure. CI: inhalation control group; CO: oral control group; BCI: inhalation low 2,4-D concentration group; BCO: oral low concentration group; MCI: inhalation medium concentration group; MCO: oral medium concentration group; ACI: inhalation high concentration group; ACO: oral high concentration group; CPc: positive control of chronic exposure. °,*: outlier and animal number. a x c and b x d: p-value < 0,0001 – (Dunn test).
The minimum bioavailability of 90% was verified in the samples.
Regarding cells with a score of 0, the following groups showed differences (p < 0.0001): between the CI group and the MCI and ACI groups; between the CO group and the MCO and ACO groups; between the BCO and MCO groups; and between the CPc group and the MCO, ACI and ACO groups. Regarding cells with a score of 1, the following groups showed differences (p < 0.0001): between the CI and ACI groups; between the BCI and ACI groups; between the BCO groups and the MCO and ACO groups; and between the CPc group and the ACI and ACO groups. Regarding cells with a score of 2, there was a difference between the CI and ACI groups as well as between the CO group and the ACO and CPc groups (p < 0.0001). For cells with a score of 3, the following groups showed differences (p < 0.0001): between the CI group and the MCI and ACI groups; between the CO group and the MCO and ACO groups; and between the BCO group and the MCO and ACO groups. For cells with a score or 4, the following groups showed differences (p <0.0001): between the CO group and the MCO and ACO groups; between the BCO and MCO groups; and between the CPc groups and the MCO and ACO groups. There was no difference between the routes of exposure (oral and inhaled) (p > 0.05) (Fig. 5).
Comet assay with chronic exposure. A - Number of cells with a score of 0 in each study group regardless of exposure time (a x c; b x d; and c, d x e: p-value < 0,0001 - Dunn test). B - Number of cells with a score of 1 in each study group regardless of exposure time (a x c; b x d; and c, d x e: p-value < 0,0001 - Dunn test). C - Number of cells with a score of 2 in each study group regardless of exposure time (a x c and b x d: p-value < 0,0001 - Dunn test). D - Number of cells with a score of 3 in each study group regardless of exposure time (a x c and b x d: p-value < 0,0001 - Dunn test). E - Number of cells with a score of 4 in each study group regardless of exposure time (b x c and c x d: p-value < 0,0001 - Dunn test). CI: inhalation control group; CO: oral control group; BCI: inhalation low 2,4-D concentration group; BCO: oral low concentration group; MCI: inhalation medium concentration group; MCO: oral medium concentration group; ACI: inhalation high concentration group; ACO: oral high concentration group; CPc: positive control of chronic exposure. °,*: outlier and animal number.
Confusion matrix analyses were performed individually according to the following genotoxic characteristics: comet and micronucleus (Supplemental data). Different machine learning algorithms (k-NN, SVM and DT) were used due to the accuracy of the training sample size effect. The DT algorithm was selected for the analysis and construction of the confusion matrix in most datasets as it was the one with the best values in relation to the size effect of the training sample (accuracy oscillated between 40 and 80%) (Fig. 6).
Accuracy of classification and different sizes of the training set: Dataset2. k-NN: K-Nearest neighbors; SVM: Support Vector Machine; DT: Decision Tree.
Table 4 lists the confusion matrix for Dataset1 where reasonable accuracy was observed for group C (56%). This conclusion occurred due to the 5 classes present in the analyzed dataset (C, BC, MC, AC and CPa) where a random classification process would have an accuracy of approximately 20% (100% / 5 classes = 20%). The BC class presented low accuracy (29%), and the MC class presented very good accuracy (73%). The AC class presented reasonable accuracy (47%), and CPa showed excellent accuracy (95%).
True class (%) | Classified as (%) | ||||
---|---|---|---|---|---|
C | BC | MC | AC | CPa | |
C | 56 | 26 | 8 | 9 | 0 |
BC | 30 | 29 | 19 | 22 | 0 |
MC | 7 | 9 | 73 | 11 | 0 |
AC | 6 | 21 | 25 | 47 | 0 |
CPa | 0 | 4 | 1 | 0 | 95 |
C: saline group; BC: low 2,4-D concentration group; MC: middle 2,4-D concentration group; AC: high 2,4-D concentration group; CPa: positive control of acute exposure. DT: Decision Tree.
Regarding the experiment setup for Dataset2 (Table 5), very good accuracy was observed for BC+MC+AC (81%). A random classification was demonstrated around an accuracy of approximately 33.33%. As a result, class C presented reasonably low accuracy (47%), and CPa presented excellent accuracy (93%).
True class (%) | Classified as (%) | ||
---|---|---|---|
C | BC+MC+AC | CPa | |
C | 47 | 53 | 0 |
BC+MC+AC | 19 | 81 | 0 |
CPa | 0 | 7 | 93 |
C: saline group; BC: low 2,4-D concentration group; MC: middle 2,4-D concentration group; AC: high 2,4-D concentration group; CPa: positive control of acute exposure. DT: Decision Tree.
In Dataset3, k-NN was used as this algorithm presented the best value in relation to the training sample size effect (accuracy oscillated between 40 and 60%) (Fig. 7).
Accuracy of classification and different sizes of the training set: Dataset3. k-NN: K-Nearest neighbors; SVM: Support Vector Machine; DT: Decision Tree.
Excellent accuracy was noted in CO (100%) and CI (84%). A random classification was demonstrated around an accuracy of approximately 11.11%. The BCI (67%), BCO (57%) and ACI (55%) classes showed reasonable accuracy. The MCI (17%), MCO (24%) and ACO (15%) classes showed reasonably low accuracy, and CPc showed excellent accuracy (100%) (Table 6).
True class (%) | Classified as (%) | ||||||||
---|---|---|---|---|---|---|---|---|---|
CI | CO | BCI | BCO | MCI | MCO | ACI | ACO | CPc | |
CI | 84 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
CO | 0 | 100 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
BCI | 0 | 0 | 67 | 0 | 6 | 21 | 3 | 3 | 0 |
BCO | 5 | 0 | 22 | 57 | 11 | 0 | 3 | 0 | 3 |
MCI | 0 | 0 | 38 | 3 | 17 | 10 | 24 | 7 | 0 |
MCO | 0 | 0 | 0 | 0 | 17 | 24 | 31 | 28 | 0 |
ACI | 0 | 0 | 0 | 0 | 3 | 30 | 55 | 12 | 0 |
ACO | 0 | 0 | 0 | 0 | 30 | 11 | 44 | 15 | 0 |
CPc | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
CI: inhalation control group; CO: oral control group; BCI: inhalation low 2,4-D concentration group; BCO: oral low concentration group; MCI: inhalation medium concentration group; MCO: oral medium concentration group; ACI: inhalation high concentration group; ACO: oral high concentration group; CPc: positive control of chronic exposure. k-NN: k-Nearest neighbors.
Regarding the confusion matrix for Dataset4, there was excellent accuracy in CO (100%) and very good accuracy in CI (88%) and CPc (85%). The BCI+MCI+ACI (64%) and BCO+MCO+ACO (61%) classes showed good accuracy (Table 7).
True class (%) | Classified as (%) | ||||
---|---|---|---|---|---|
CI | CO | BCI+MCI+ACI | BCO+MCO+ACO | CPc | |
CI | 88 | 3 | 6 | 3 | 0 |
CO | 0 | 100 | 0 | 0 | 0 |
BCI+MCI+ACI | 0 | 0 | 64 | 36 | 0 |
BCO+MCO+ACO | 4 | 0 | 31 | 61 | 3 |
CPc | 0 | 0 | 0 | 15 | 85 |
CI: inhalation control group; CO: oral control group; BCI: inhalation low 2,4-D concentration group; BCO: oral low concentration group; MCI: inhalation medium concentration group; MCO: oral medium concentration group; ACI: inhalation high concentration group; ACO: oral high concentration group; CPc: positive control of chronic exposure. DT: Decision Tree.
When analyzing the data in isolation, the BCI+MCI+ACI and BCO+MCO+ACO classes showed confusion between them. Thus, a grouped analysis was performed as shown in Dataset5.
In the confusion matrix for Dataset5, there was good accuracy for CI+CO (78%) and excellent accuracy for BCI+MCI+ACI+BCO+MCO+ACO (98%) and CPc (100%) (Table 8).
True class (%) | Classified as (%) | ||
---|---|---|---|
CI+CO | BCI+MCI+ACI+BCO+MCO+ACO | CPc | |
CI+CO | 78 | 22 | 0 |
BCI+MCI+ACI+BCO+MCO+ACO | 2 | 98 | 0 |
CPc | 0 | 0 | 100 |
CI: inhalation control group; CO: oral control group; BCI: inhalation low 2,4-D concentration group; BCO: oral low concentration group; MCI: inhalation medium concentration group; MCO: oral medium concentration group; ACI: inhalation high concentration group; ACO: oral high concentration group; CPc: positive control of chronic exposure. DT: Decision Tree.
In the present study, we observed that acute inhalation exposure to the 2,4-D caused DNA damage at the level of a single cell in the comet assay and that chronic exposure resulted in an increase in the number of micronuclei and DNA damage at the level of a single cell regardless of the route of exposure. Machine learning showed a distinction effect between the control groups and those exposed to 2,4-D. Our study added one more data favoring the genotoxicity of 2,4-D. And this data was obtained under conditions and concentrations of environmental exposure (oral and inhaled) to 2,4-D.
Inhalation exposure is the main route of occupational exposure, while the oral route, through contaminated food or water, is the main route of non-occupational exposure, not only for exposure to the 2,4-D, but also to other pesticides (RED, 2005). Therefore, in this study, we chose to evaluate these two routes of exposure.
The significant increase in the use of pesticides has caused numerous harmful effects on the health of individuals and animals (Siqueira et al., 2012). The micronucleus test and comet assay are techniques used to assess the genotoxic and mutagenic changes that may precede the development of cancer (Hovhannisyan, 2010).
In acute exposure, the different concentrations of 2,4-D tested did not increase the micronucleus count in the mice compared to the control group. In a previous study that treated male and female mice with 2,4-D by gavage at different concentrations (300, 675, 1050 and 1800 mg/kg) in a single time interval, there was also no increase in micronuclei in the bone marrow cells (Charles et al., 1999).
Although we did not observe an increase in the number of micronuclei in acute exposure, exposure to medium and high concentrations of 2,4-D induced DNA damage in the comet assay (score 4) regardless of the number of nebulizations. The same results were observed in a previous study using Chinese hamster ovary cells and the comet assay. In this previous study, ovary cells were treated with 2.6 and 10 μL/mL 2,4-D in a single dose, resulting in a dose-dependent increase in the frequency of DNA strand breaks due to the herbicide (González et al., 2005). In another study using mice treated at doses of 1.7, 3.3 and 33 mg/kg of body weight by gavage for 3 and 5 consecutive days, 2,4-D showed a genotoxic effect (Amer and Aly, 2001). However, these studies were performed using high doses of 2,4-D and by routes that do not reflect human exposure.
More than one genotoxicity or mutagenicity test must be performed to assess the presence of DNA damage as well as to assess the DNA damage type (Hovhannisyan, 2010). The micronucleus test and the comet assay assess different damage. The micronucleus test measures chromosomal loss and/or breakage of a chromosome. Therefore, the micronucleus test evaluates clastogenic substances (which cause chromosome breakage) or aneugenic substances (which lead to whole chromosomes not completing anaphasic migration of cell division). The comet assay detects breaks in a DNA strand and alkali-labile sites, breaks in both strands of DNA as well as breaks in sites where there has been incomplete repair involving excision of the bases. In the comet assay, the amount of DNA in the tail is proportional to the number of strand breaks. Thus, larger comet tails indicate greater breaks (Hovhannisyan, 2010). This difference between the tests justifies the observation of no increase in the number of micronuclei but an increase in damage in the comet assay with acute exposure to 2,4-D. The observation of micronuclei induction suggests that at least some of the damage induced by environmental exposure to medium and high concentration 2,4-D is not repaired.
There are few studies in the literature that have performed chronic experimental exposure to pesticides. With chronic exposure, we observed a significant increase in micronuclei in all exposed groups regardless of the route of exposure. In addition, we observed a significant increase in the score of 4 (greater damage) in the comet assay in all exposed groups, except in the group exposed to low oral concentration of 2,4-D, regardless of the route of exposure. Previous studies with long-term human exposure to pesticides have shown an increase in the tail length of comets in the exposed groups (Ali et al., 2018) and an increase in the number of micronuclei in exfoliated oral epithelial cells (Tomiazzi et al., 2018). Although it was not statistically significant, we observed a slight increase in micronucleus number in acute exposure of animals exposed to high concentration in the 72 hr interval but with damage recovery after 192 hr of exposure. These data agreed with the findings of the chronic phase, indicating a higher probability of longer exposure time resulting in the formation of micronuclei. Our work corroborates the genotoxic potential of pesticides, specifically 2,4-D, mainly in chronic exposure.
The carcinogenicity of 2,4-D may be associated with a concentration of dioxins. Some studies have shown a small increase in the risk of cancer in workers exposed to herbicides contaminated with higher chlorinated dioxins (Kogevinas et al., 1997). However, other studies show that there is no increased risk for all types of cancer at lower levels of exposure to dioxins (McBride et al., 2018). In our study, the 2,4-D formulation is in accordance with the concentration of dioxins assessed as permissible by IARC (IARC, 2015). Thus, the genotoxic effects that we observed do not appear to be associated with dioxins.
Although we observed greater DNA damage both to the micronucleus test and the comet assay in both phases of exposure (acute and chronic) at higher doses of 2,4-D, this increase in damage was not statistically significant. This data shows the possibility that 2,4-D has a dose at which it has maximum genotoxic effects. And from this dose, the increase in damage is small to the point of not being significantly different upon exposure to higher doses. Studies that evaluate the internal dose of 2,4-D for each animal exposed to the concentrations of environmental exposure may provide a better understanding of the possible dose-response effects.
Recognition pattern is one of the most prominent functions in Artificial Intelligence (Papa et al., 2012). Different studies have shown the effectiveness of using machine learning in the biological and medical fields (Yılmaz et al., 2016; Tomiazzi et al., 2018; Vieira et al., 2019). The evaluation of data made through machine learning, makes the analysis non-subjective, which can happen in relation to statistical analysis.
In the acute phase, the machine learning analysis identified a pattern in the distribution of genotoxic effects in groups exposed to the herbicide at different concentrations (Dataset2) with the classification performance of groups exposed to 2,4-D (BC+MC+AC) being 81%. This finding indicated that there is a clear difference between groups exposed and not exposed to 2,4-D. This difference is probably associated with the results of the comet assay, in which DNA damage was observed in groups exposed to 2,4-D.
In the same dataset, the accuracy of the control group (C) was equal to 47%, but 53% were confused with groups exposed to 2,4-D (BC+MC+AC). This confusion between the control group exposed to saline solution (C) and the groups exposed to 2,4-D may be associated with the fact that we did not find any difference in the micronucleus count between these two groups.
In the chronic phase with the first analysis (Dataset3), the control groups showed excellent accuracy ranging from 84 to 100%, but the groups exposed to 2,4-D showed confusion. When we grouped based on inhalation exposure to 2,4-D (BCI+MCI+ACI) and oral exposure to 2,4-D (BCO+MCO+ACO), the accuracy of these groups was good (64 and 61%, respectively), showing no difference between concentrations. However, these groups were still confused. When unifying all groups exposed to 2,4-D (BCI+MCI+ACI+BCO+MCO+ACO) (Dataset5), the accuracy was excellent (98%). The algorithm recognized them more robustly (98%), showing that chronic exposure to 2,4-D can cause a genotoxic effect regardless of the route of exposure and concentration.
The machine learning analysis reinforced the results found in the statistical analysis of both the acute and chronic phases of the experiment. In addition, machine learning analysis showed to be an effective tool in identifying patterns in the results of genotoxic characteristics.
The present study concluded that 2,4-D causes both clastogenic or aneugenic damage as well as DNA strand breakage through acute and chronic exposure to environmental concentrations relevant to human exposure. Machine learning showed that the effects of exposure are neither concentration-dependent nor route-dependent but that there is a clear distinction between the control groups and those exposed to 2,4-D. Thus, greater care is needed in the application of 2,4-D in crops because its use can cause health risk.
This work was funded by research funds from the Universidade do Oeste Paulista (UNOESTE) and carried out with support from the Coordination for the Improvement of Higher Education Personnel - Brazil (CAPES) - Financing Code 001. F. M. Azevedo was a CAPES fellow. P. H. N. Chagas was a fellow in the Scientific Initiation Scholarship Program of the National Council for Scientific and Technological Development (PIBIC / CNPq). The authors would like to thank the technician from the Genetics Laboratory of the Universidade do Oeste Paulista (UNOESTE), Mayara de Oliveira Vidotto Figueiredo, for the help in carrying out the Comet Assay.
Conflict of interestThe authors declare that there is no conflict of interest.