Human Health

Chromosomal aberration

Endpoint

In vitro chromosomal aberrations (CA) test identifies agents that cause structural chromosome aberrations in cultured mammalian cells [1]. Chromosomal aberrations are changes affecting chromosome morphology. In general, CA involves breaking of chromosome segments, their loss or union with same or different chromosomes.

Data

Collected are 1077 chemicals with documented CA data without accounting for S9 metabolic activation [2]. Of them, positive as parents are 438 chemicals, whereas 639 chemicals are CA negative. Chemicals which are positive as parents have been used for deriving the alerts (see next section). CA data obtained after S9 metabolic activation have been collected for another set of chemicals (397). Of them, positive are 99 chemicals (which are negative as parents) and 298 chemicals are CA negative (as parents and metabolites). These two types of training sets are considered as biologically dissimilar in the modeling process; i.e., chemicals being mutagenic as parents are distinguished from chemicals, which are mutagenic after metabolic activation. These subsets are subsequently modeled separately by the TIMES structural alerts approach. In this respect, there are two CA models: with and without S9 metabolic activation included in the TIMES system.

Alerts in CA models

Direct binding of chemicals to DNA is one of the underlying mechanisms that is responsible for CA mutagenicity [3]. Disturbance of protein synthesis due to inhibition of topoisomerases and interaction of chemicals with nuclear proteins associated with DNA (e.g., histone proteins) are identified as additional mechanisms also leading to positive CA effect. Based on that, the toxicodynamic component of the CA models, i.e. alerts (functionalities responsible for eliciting CA) are derived to account for interactions with DNA and/or proteins. Alerts are developed by chemicals which are positive as parents (one should be reminded that chemicals which are positive only after metabolic activation are negative as parents). Alerts associated with the CA models consist of boundaries, which provide the minimum structural requirements for interacting with DNA and/or proteins. Usually, additional structural and parametric boundaries are required for completing definition of alerts.

A subset of the training set is associated with each alert. In fact, these are chemicals used to define alerts boundaries. Cardinality of these local training sets depends on the number of substances captured by the alert boundaries. Only those alerts are included in the model for which a clear interpretation of the molecular interaction mechanism could be provided. Currently, the total number of alerting groups included in the TIMES CA models is 124, where 91 of them are associated with DNA binding and 33 are associated with protein binding [4].

CA model without accounting for metabolic activation (TIMES_CA (-S9))

The training set of the CA model (-S9) comprises chemicals which are positive (438) and negative (639) as parents. In this respect, the model predicts DNA and/or protein damaging effect of parent chemicals, only. Positive predictions are obtained when the complete set of structural/parametric boundaries of the alert(s) are met in the parent chemicals.

According to the performance of TIMES_CA model (-S9) when applied on the training set chemicals, correct predictions are provided for:

• 362 out of 438 observed positive parents (Sensitivity = 83%)
• 580 out of 639 observed negative parents (Specificity = 91%)

CA model accounting for metabolic activation (TIMES_CA (+S9))

The training set of the CA model (+S9) comprises substances which are positive (99) and negative (298) after metabolic activation. Since both CA models (-S9 and +S9) share the same alerts, the CA model (+S9) predicts DNA and/or protein damaging effect of parent chemicals and their metabolic products, obtained after S9 metabolic activation. The toxicokinetic component of the model consists of a metabolic simulator which is trained to reproduce documented maps for mammalian liver metabolism of 261 chemicals. The number of generated metabolites is manageable which allows combining in a single modeling platform metabolic simulation of chemicals and their interaction with target macromolecules. The S9 metabolic simulator has its own training set which includes 542 metabolic reactions:

• Phase I transformations, such as aliphatic C-oxidation, aromatic C-hydroxylation, oxidative N- and O-dealkylation, epoxidation, ester and amide hydrolysis, carbonyl group reduction, nitro group reduction, N-hydroxylation, etc.

• Phase II transformations, such as glucuronidation, sulfation, glutathione conjugation, N-acetylation, etc.

A characteristic feature of the in vitro metabolic simulator is the significant prevalence of Phase I reactions (85-90%) over Phase II reactions, which reflects the specificity of the in vitro experimental systems with respect to their metabolic capacity [5]. The metabolic simulator cannot estimate the reaction rates (e.g. kinetic constants, half-lives, etc.) but is able to provide tentative information for their probability. The logic of the in vitro CA model (+S9) is illustrated in Figure 1:

Figure 1. Illustration of the integral CA mutagenicity prediction based on metabolic simulator and defined structure-activity relationships (SARs).

Parent chemicals and each of the generated metabolites are submitted to a battery of SARs (alerts) to screen for a general CA effect and mutagenicity mechanisms. Thus, chemicals are predicted to be mutagenic as parents only, parents and metabolites, and metabolites only.

According to the performance of TIMES_CA model (+S9) when applied on the training set chemicals, correct predictions are provided for:

• 71 out of 99 observed positive after metabolic activation (Sensitivity = 72%)
• 223 out of 298 observed negative after metabolic activation (Specificity = 75%)

Domain

Each of the TIMES CA models (-S9 and +S9) has its own model applicability domain, which supports the CA predictions [6]. The applicability domains consist of three sub-domain layers: general parametric requirements, structural features and alert(s) reliability. Two chemical subsets are used for deriving the model domains. The first subset includes the training set chemicals which are correctly predicted by the models, whereas the second subset comprises training set chemicals which are incorrectly predicted by the models.
The correct chemical subset is used for defining the general parametric requirements. Extracted are specific ranges of the molecular weight (MW) and the 1-octanol/water partition coefficient (log KOW):

• Molecular weight MW (in Da) ϵ [44; 1,416],
• log KOW ϵ [-10, 30].

The atom-centered fragments extracted from the correct subset of chemicals are used to define the structural domain. Briefly, the structural domain is assessed based on atom-centered fragments, extracted from correctly and incorrectly predicted (i.e., false positives and false negatives) substances from the model training sets by accounting for the atom type, hybridization and attached H-atoms of the central atom and its first neighbours. If the neighbour is a heteroatom then the diameter of the fragment is increased up to three consecutive heteroatoms or to the first carbon atoms in sp3 hybridization. In order to assess if a new chemical belongs to the structural domain, the system partitions the chemical to atom-centered fragments, which are then matched to the fragments extracted from the correct and incorrect chemical subsets. The new chemical is estimated to belong to the structural domain only when its atom-centered fragments are found in the list of correct fragments.

The third level of the domain accounts for reliability of alerts. Currently, information for alerts reliability is provided in the model reports.

Reporting

The predictions of in vitro Chromosomal aberrations models could be reported as:

• A tab delimited text file providing the following information for chemical: identity (CAS number, Name, SMILES), observed CA mutagenicity, predicted CA mutagenicity, information for the alert(s) responsible for DNA and/or protein interaction, alert reliability and performance, applicability domain, etc.

• QSAR Prediction Reporting Format (QPRF) is available, generating a report for one chemical or group of chemicals.

The TIMES CA (+S9) model provides information for:

• Simulated metabolic map,
• Positive parents and/or metabolites,
• Mechanisms of DNA and/or protein damage,
• Local training set of alerts,
• Performance of alerts,
• Different types of reporting
• …..

Decision support character of TIMES_CA models

Users of the TIMES model may assess reliability of the predictions by evaluating the amount of mechanistic justification supporting the ultimate prediction. In this respect, the modeling results could be considered as decision supporting rather than decision making outcome.

References

1. Evans, H.J. 1976. Cytological Methods for Detecting Chemical Mutagens. In: Chemical Mutagens, Principles and Methods for their Detection, Vol. 4, Hollaender, A. (ed) Plenum Press, New York and London, pp. 1-29.
2. Sofuni, T., Ed. 1998. Data Book of Chromosomal Aberration Test in vitro, Revised Edition. Life-Science Information Center, Tokyo, Japan.
3. Mekenyan, О., Todorov, М., Serafimova, Р., Stoeva, С., Aptula, А., Finking, Р., Jacob, Е. 2007. Identifying the structural requirements for chromosomal aberration by incorporating molecular flexibility and metabolic activation of chemicals. Chem. Res. Toxicol. Vol. 20, pp. 1927−1941.
4. Petkov, P.I., Schultz, T.W., Donner, E.M., Honma, M., Morita, T., Hamada, Sh., Wakata, A., Mishima, M., Maniwa, J., Todorov, M., Kaloyanova, E., Kotov, S., Mekanyan, O. 2016. Integrated approach to testing and assessment for predicting rodent genotoxic carcinogenicity, J. Appl. Toxicol. 36(12): 1536-1550.
5. Mekenyan, O., Dimitrov, S., Pavlov, T., Dimitrova, G., Todorov, M., Petkov, P., Kotov, S. 2012. Simulation of chemical metabolism for fate and hazard assessment. V. Mammalian hazard assessment, SAR and QSAR in Environmental Research, 23:5-6, 553-606.
6. Dimitrov, S., Dimitrova, G., Pavlov, T., Dimitrova, N., Patlewicz, G., Niemela J., Mekenyan, O. 2005. J. Chem. Inf. Model., Vol. 45, pp. 839-849. .

Chromosomal Aberratitions

Model Features

Simulated metabolic map

Highlighting positive parents and/or metabolites

Mechanisms of DNA and/or protein interaction associated with each of the alerts

Local training set of alerts

Performance of alerts