We have developed a machine learning and rule-based tool using natural language processing which allows a user to upload a trial protocol, and which categorises the protocol as high, medium or low risk of ending uninformatively. The tool is at app.clinicaltrialrisk.org and is open-sourced on Github. You can read an explanation of how the tool works here, and a description of how we validated its accuracy here.
There are several indicators of high risk of uninformativeness which can be identified in a protocol, such as a lack of and or an inadequate statistical analysis plan, use of non-standard endpoints, or the use of cluster randomisation. One of the most common causes of a trial ending uninformatively is underpowering. Low-risk trials are often run by well-known institutions with external funding and an international or intercontinental array of sites. These indicators can be referred to as features or parameters.
This project is an initial Proof of Concept (POC) which to showcase what is possible with natural language processing, with a view to moving towards a more comprehensive main project which may identify a more complete set of cost, complexity, or uninformativeness risk factors.
The tool is designed with a feedback form so that inaccurate data extractions can be reported back to the developers.
In addition the MIT License means that you are free to add features or extend the scope of the tool.
We hope that researchers who are considering submitting a protocol of a trial to a prospective source of funding will be able to use the tool as a kind of checklist to ensure that their trial is designed to reduce risk and increase the prospects of being funded.
On 8 October, Thomas Wood of Fast Data Science presented the Clinical Trial Risk Tool, along with the Harmony project, at the AI and Deep Learning for Enterprise (AI|DL) meetup sponsored by Daemon. You can now watch the recording of the live stream on AI|DL’s YouTube channel below: The Clinical Trial Risk Tool leverages natural language processing to identify risk factors in clinical trial protocols. The initial prototype Clinical Trial Risk Tool is online at https://app.
Shining a Light on Clinical Trial Risk: Exploring Clinical Trial Protocol Analysis Software Clinical trials are the backbone of medical progress, but navigating their design and execution can be complex. Fast Data Science is dedicated to helping researchers by analysing clinical trial protocols through the power of Natural Language Processing (NLP). We are presenting a selection of software which can be used for clinical trial protocol analysis or clinical trial cost prediction and risk assessment.
Understand your clinical trials Are your clinical trials risky? Are costs running away? It’s very tricky to estimate clinical trial costs before a trial is run. Try our free cost calculator. This is a regression model, trained on real clinical trial data. Trial is for condition HIV Tuberculosis COVID Influenza Malaria Enteric and diarrheal diseases Neglected tropical diseases Polio Diabetes Pneumonia Hypertension (see full product) Motor neurone disease (see full product) Multiple sclerosis (see full product) Obesity (see full product) Sickle cell anemia (see full product) Stroke (see full product) Cystic fibrosis (see full product) Cancer (see full product) Other (see full product) Phase Early Phase 1 1 1.