Lhasa Limited shared knowledge shared progress
  • Publisher:
    Lhasa Limited
  • Publication Date:
    Jun 2016
  • Reference:
  • DOI:
  • PMID:
  • Publication Type:
  • Related Products:
  • Scientific Area:
  • Endpoint:
  • Industry Type:
  • Related event:

Data Dos and Don'ts in Building Statistical Models For Ames Mutagenicity

pdf fileCayley A; Hanser T; Vessey J;

This Poster was presented by AlexCayley at the 2016 QSAR International Conference on Quantitative Structure-Activity Relationships


The ICH M7 guidance relating to the detection and control of potentially mutagenic impurities in drug substances places statistically-based QSAR systems in a key position in the decision-making process. It is, therefore, extremely important that any predictions made by models used in this role show high levels of accuracy and transparency. The underlying algorithms used to build these models undoubtedly play a major role in determining their accuracy and interpretability. However, the data used to train them will also have a significant effect on how well they perform. Consequently, the quality and quantity of this data should be carefully considered before any model is built.
With this in mind we undertook investigations into the various aspects of data which can affect statistical model performance and how these can be optimised to improve a statistical model. The statistical system, Sarah Nexus, produced by Lhasa Limited, was used in these studies along with a large training set built from publicly available Ames mutagenicity data in order to make the findings as general as possible. Two key aspects of the data sets used to build the models were investigated. Firstly, the structural representation used to define the substance which has been tested was considered. In addition, the quality of the biological results associated with each substance in the training set was also assessed.


© 2018 Lhasa Limited | Registered office: Granary Wharf House, 2 Canal Wharf, Leeds, LS11 5PS, UK Tel: +44 (0)113 394 6020
VAT number 396 8737 77 | Lhasa Limited is registered as a charity (290866)| Company Registration Number 01765239 (England and Wales).

Thanks to QuestionPro's generosity, we now have survey software that powers our data intelligence.