
We have developed a machine learning and rule-based tool using natural language processing which allows a user to upload a trial protocol, and which categorises the protocol as high, medium or low risk of ending uninformatively. The tool is at https://clinicaltrialrisk.org/tool and is open-sourced on Github. You can read an explanation of how the tool works here, and a description of how we validated its accuracy here.
There are several indicators of high risk of uninformativeness which can be identified in a protocol, such as a lack of and or an inadequate statistical analysis plan, use of non-standard endpoints, or the use of cluster randomisation. One of the most common causes of a trial ending uninformatively is underpowering. Low-risk trials are often run by well-known institutions with external funding and an international or intercontinental array of sites. These indicators can be referred to as features or parameters.
This project is an initial Proof of Concept (POC) which to showcase what is possible with natural language processing, with a view to moving towards a more comprehensive main project which may identify a more complete set of cost, complexity, or uninformativeness risk factors.
The tool is designed with a feedback form so that inaccurate data extractions can be reported back to the developers.
In addition the MIT License means that you are free to add features or extend the scope of the tool.
We hope that researchers who are considering submitting a protocol of a trial to a prospective source of funding will be able to use the tool as a kind of checklist to ensure that their trial is designed to reduce risk and increase the prospects of being funded.

Thomas Wood presents the Clinical Trial Risk Tool at the Clinical AI Interest Group at Alan Turing Institute The Clinical AI Interest group is a community of health professionals from a broad range of backgrounds with an interest in Clinical AI, organised by the Alan Turing Institute. In the group’s November 2025 meeting, the talk was given by Dr Jeff Hogg, Programme Director, MSc AI Implementation (Healthcare), University of Birmingham and Clinical Innovation Officer in AI, University Hospitals Birmingham NHSFT, titled AI Readiness for Health and Care Provider Organisations.
Guest post by Safeer Khan, Lecturer at Department of Pharmaceutical Sciences, Government College University, Lahore, Pakistan Multi-Arm & Multi-Stage (MAMS) Clinical Trials Design Tips The design of clinical trials is increasingly challenged by the Rising Costs, limited availability of eligible patient populations, and the growing demand for timely therapeutic evaluation. Traditional parallel-group designs, which typically compare a single intervention to a control, are often insufficient to meet these pressures in terms of speed, efficiency, and resource utilization.

You can use the t-test when you want to compare the means (averages) of continuous data between two groups, such as blood pressure or maximum concentration of a drug in urine (Cmax). If you have data with a dichotomous outcome, you can use the Chi-Squared test instead - please try our Chi-Squared sample size calculator. The calculator below will calculate the minimum sample size for you. Your expected effect size d is the standardised effect size according to Cohen’s definition.