TY - JOUR
T1 - CERAPP: Collaborative estrogen receptor activity prediction project
AU - Mansouri, Kamel
AU - Abdelaziz, Ahmed
AU - Rybacka, Aleksandra
AU - Roncaglioni, Alessandra
AU - Tropsha, Alexander
AU - Varnek, Alexandre
AU - Zakharov, Alexey
AU - Worth, Andrew
AU - Richard, Ann M.
AU - Grulke, Christopher M.
AU - Trisciuzzi, Daniela
AU - Fourches, Denis
AU - Horvath, Dragos
AU - Benfenati, Emilio
AU - Muratov, Eugene
AU - Wedebye, Eva Bay
AU - Grisoni, Francesca
AU - Mangiatordi, Giuseppe F.
AU - Incisivo, Giuseppina M.
AU - Hong, Huixiao
AU - Ng, Hui W.
AU - Tetko, Igor V.
AU - Balabin, Ilya
AU - Kancherla, Jayaram
AU - Shen, Jie
AU - Burton, Julien
AU - Nicklaus, Marc
AU - Cassotti, Matteo
AU - Nikolov, Nikolai Georgiev
AU - Nicolotti, Orazio
AU - Andersson, Patrik L.
AU - Zang, Qingda
AU - Politi, Regina
AU - Beger, Richard D.
AU - Todeschini, Roberto
AU - Huang, Ruili
AU - Farag, Sherif
AU - Abildgaard Rosenberg, Sine
AU - Slavov, Svetoslav
AU - Hu, Xin
AU - Judson, Richard S.
PY - 2016
Y1 - 2016
N2 - Background: Humans are exposed to thousands of man-made chemicals in the environment. Some chemicals mimic natural endocrine hormones and, thus, have the potential to be endocrine disruptors. Most of these chemicals have never been tested for their ability to interact with the estrogen receptor (ER). Risk assessors need tools to prioritize chemicals for evaluation in costly in vivo tests, for instance, within the U.S. EPA Endocrine Disruptor Screening Program. oBjectives: We describe a large-scale modeling project called CERAPP (Collaborative Estrogen Receptor Activity Prediction Project) and demonstrate the efficacy of using predictive computational models trained on high-throughput screening data to evaluate thousands of chemicals for ER-related activity and prioritize them for further testing. Methods: CERAPP combined multiple models developed in collaboration with 17 groups in the United States and Europe to predict ER activity of a common set of 32,464 chemical structures. Quantitative structure-activity relationship models and docking approaches were employed, mostly using a common training set of 1,677 chemical structures provided by the U.S. EPA, to build a total of 40 categorical and 8 continuous models for binding, agonist, and antagonist ER activity. All predictions were evaluated on a set of 7,522 chemicals curated from the literature. To overcome the limitations of single models, a consensus was built by weighting models on scores based on their evaluated accuracies. results: Individual model scores ranged from 0.69 to 0.85, showing high prediction reliabilities. Out of the 32,464 chemicals, the consensus model predicted 4,001 chemicals (12.3%) as high priority actives and 6,742 potential actives (20.8%) to be considered for further testing. conclusion: This project demonstrated the possibility to screen large libraries of chemicals using a consensus of different in silico approaches. This concept will be applied in future projects related to other end points.
AB - Background: Humans are exposed to thousands of man-made chemicals in the environment. Some chemicals mimic natural endocrine hormones and, thus, have the potential to be endocrine disruptors. Most of these chemicals have never been tested for their ability to interact with the estrogen receptor (ER). Risk assessors need tools to prioritize chemicals for evaluation in costly in vivo tests, for instance, within the U.S. EPA Endocrine Disruptor Screening Program. oBjectives: We describe a large-scale modeling project called CERAPP (Collaborative Estrogen Receptor Activity Prediction Project) and demonstrate the efficacy of using predictive computational models trained on high-throughput screening data to evaluate thousands of chemicals for ER-related activity and prioritize them for further testing. Methods: CERAPP combined multiple models developed in collaboration with 17 groups in the United States and Europe to predict ER activity of a common set of 32,464 chemical structures. Quantitative structure-activity relationship models and docking approaches were employed, mostly using a common training set of 1,677 chemical structures provided by the U.S. EPA, to build a total of 40 categorical and 8 continuous models for binding, agonist, and antagonist ER activity. All predictions were evaluated on a set of 7,522 chemicals curated from the literature. To overcome the limitations of single models, a consensus was built by weighting models on scores based on their evaluated accuracies. results: Individual model scores ranged from 0.69 to 0.85, showing high prediction reliabilities. Out of the 32,464 chemicals, the consensus model predicted 4,001 chemicals (12.3%) as high priority actives and 6,742 potential actives (20.8%) to be considered for further testing. conclusion: This project demonstrated the possibility to screen large libraries of chemicals using a consensus of different in silico approaches. This concept will be applied in future projects related to other end points.
U2 - 10.1289/ehp.1510267
DO - 10.1289/ehp.1510267
M3 - Journal article
C2 - 26908244
SN - 0091-6765
VL - 124
SP - 1023
EP - 1033
JO - Environmental Health Perspectives
JF - Environmental Health Perspectives
IS - 7
ER -