Informatics
study if the doses are too high, allowing for an early remediation if needed during the study conduct.
Example 4: Advanced analytics and predictive modelling A variety of opportunities to perform more advanced analytics and predictive modelling emerge if the challenges of study formatting and data cap- ture into large databases are overcome. These opportunities range from implementing traditional statistical methods in an automated manner, to more sophisticated data-led and machine learning tech- niques. From a single study perspective, novel imple- mentations could include automated methods to surface anomalous findings, generating dose response models for anomalous findings, and fitting simple pharmacokinetic and pharmacodynamic models to study data. When multiple studies are available, then emphasis can shift to either analysis of multiple compounds against any given study end point, or to a study end point specific focus, where the behaviour of endpoints can be investigated. Hierarchical dose response models, that fit end point specific data for multiple compounds in a sin- gle model and allow subsequent ordering of potency have been developed but are not yet available com- mercially. In addition, endpoint specific analysis that reveals, for instance, the background incidence rate of histopathology findings or putative maximum physiological responses for laboratory tests, have already been performed on large datasets of preclin- ical data, but not yet on data formatted in the SEND standard (see Figure 4). With the emergence of these approaches,
enabled by adoption of SEND as an exchange for- mat, the field will move towards increasing automation and more data-led toxicological pre- dictions. Since data will be stored consistently in databases, it will become easier to generate data representations for any given compound that link compound with target (primary and secondary) and with perturbed preclinical endpoints. This enables toxicology to engage with system biology approaches that offer another approach to the pre- diction of clinical adverse events.
Barriers to advanced analytics The tools are available to leverage cloud resources to share standard formatted data across research sites and partners. However, challenges remain in leveraging SEND datasets to their full capacity. Sponsors may lack the infrastructure (eg
resources, budget) to maintain a database and visu- alisation tools. Also, instituting that infrastructure often requires staff to change business processes to
Drug Discovery World Winter 2018/19
connect data science with operations, particularly when moving from fragmented, locally-owned data sources to centralised databases that offer richer data but more complexity. As a result, shared data that is available may be overanalysed or even misinterpreted. In addition, there are key data omissions from
the SEND standard that will limit its utility in terms of advanced analytics. For instance, there is no means to capture compound structure within the standard. In order to develop Quantitative Structure Activity Relationship (QSAR)-based models from the datasets, compound structure data must be linked with a SEND data warehouse. Furthermore, there remain issues with the homo-
geneity by which key information is captured in the standard by different partners. There is arguably too great a flexibility in how the representation of study design, dose group identification and study elements are captured in the format. However, these issues could be overcome with a focused effort by stakeholders.
Precompetitive SEND use cases for safety/tox Widespread use of the SEND standard will facilitate precompetitive collaborations by providing a stan- dard format for sharing preclinical data and reducing the amount of harmonisation needed in order to cre- ate integrated datasets. The Innovative Medicines Initiative (IMI) project, Enhancing TRANslational SAFEty Assessment through Integrative Knowledge Management (eTRANSAFE)5, aims to develop just such a dataset, utilising data from 12 pharmaceutical companies for translational safety analysis. In partic- ular, it plans to address the following questions:
lWhat is the relevance of preclinical safety studies for clinical study safety assessment? l Can we refine preclinical experiments to better predict clinical safety? l Can we build predictive models of safety out- comes from the data?
By supplementing the SEND data with informa-
tion on the structure and pharmacology of the drug, the dataset will support translational research analytics, such as determining the possible liabilities for new drug candidates or identifying mechanistic hypotheses for adverse effects. As the amount of data in SEND format evolves,
and by combining it with legacy data in the same format, it will be possible to identify the back- ground incidence of histopathological findings along with the most common drug-related
59
Page 1 |
Page 2 |
Page 3 |
Page 4 |
Page 5 |
Page 6 |
Page 7 |
Page 8 |
Page 9 |
Page 10 |
Page 11 |
Page 12 |
Page 13 |
Page 14 |
Page 15 |
Page 16 |
Page 17 |
Page 18 |
Page 19 |
Page 20 |
Page 21 |
Page 22 |
Page 23 |
Page 24 |
Page 25 |
Page 26 |
Page 27 |
Page 28 |
Page 29 |
Page 30 |
Page 31 |
Page 32 |
Page 33 |
Page 34 |
Page 35 |
Page 36 |
Page 37 |
Page 38 |
Page 39 |
Page 40 |
Page 41 |
Page 42 |
Page 43 |
Page 44 |
Page 45 |
Page 46 |
Page 47 |
Page 48 |
Page 49 |
Page 50 |
Page 51 |
Page 52 |
Page 53 |
Page 54 |
Page 55 |
Page 56 |
Page 57 |
Page 58 |
Page 59 |
Page 60 |
Page 61 |
Page 62 |
Page 63 |
Page 64 |
Page 65 |
Page 66 |
Page 67 |
Page 68