search.noResults

search.searching

saml.title
dataCollection.invalidEmail
note.createNoteMessage

search.noResults

search.searching

orderForm.title

orderForm.productCode
orderForm.description
orderForm.quantity
orderForm.itemPrice
orderForm.price
orderForm.totalPrice
orderForm.deliveryDetails.billingAddress
orderForm.deliveryDetails.deliveryAddress
orderForm.noItems
STATISTICS


Common mistake Starting with the test


Treating hypothesis testing as the whole analysis


Confusing difference from a target with acceptability


Ignoring paired data


Using repeated t-tests for multiple groups


Treating non-parametric tests as assumption-free


Reporting only P-values


Interpreting ‘not significant’ as ‘no difference’


Forgeting practical significance Using normality tests mechanically Leting software defaults decide


Choosing a one-tailed test after seeing the result


Why it maters


The analysis may not answer the laboratory question.


A P-value does not replace plots, summaries, effect sizes, confidence intervals, or judgement.


‘Significantly different’ does not always mean


unacceptable; ‘not significant’ does not prove acceptable. Treating linked results as independent can give


misleading conclusions.


Multiple testing increases the risk of false-positive findings.


Rank-based tests still have assumptions and may answer a different question.


Statistical significance does not show the size or importance of an effect.


A small or noisy study may simply lack power.


A statistically significant result may be too small to mater.


Normality testing alone should not dictate the method.


Software cannot know the laboratory question or data structure.


This can make evidence appear stronger than it is. Table 3. Some possible pitfalls if correct choice of statistical test is not undertaken.


produce higher values than another, but it may not directly estimate the mean concentration difference that maters for a specific analytical claim. Similarly, a statistically significant ANOVA result may show evidence of difference somewhere among the groups, but it does not identify the cause or determine whether the difference is operationally acceptable (Table 3).


Calculators


Here are some calculators to help with some of your analytical work.


htps://


pathologyuncertainty.com/ calculators/


Further reading MacDonald S. Distribution, outliers and


confidence intervals in laboratory data. Pathology in Practice 2026 May; 27 (3): 22–6.


Dr Stephen MacDonald is Consultant Clinical Scientist, The Specialist Haemostasis Unit, Cambridge University Hospitals NHS Foundation Trust, Cambridge Biomedical Campus, Hills Road, Cambridge CB2 0QQ.


+44 (0)1223 216746 20 WWW.PATHOLOGYINPRACTICE.COM June 2026


Conclusions It is clear that software-driven statistics can be very helpful (when looking at the maths involved if we had to do it by hand! – Fig 1), but come at a cost that can quickly outdo the benefit of point and click interfaces. Just because we can run a statistical test, doesn’t mean we should. Statistical assumptions, study design and data quality are central to us making


the correct decision for what test to run, if at all! This is equally important for operational data in the laboratory, as it is in the research seting. If you are doing a research project, don’t collect your data then ask your colleagues what test you should be running – by that time it is too late.


It’s not to say that software is a bad thing – of course it isn’t, and in an ironic twist I have again developed a learning tool for one sample t-test that shows the complexity of the underlying assumptions with what is a simple test to run normally. If you are bored, feel free to have a play about with the app and see if it helps – there is lots of information about the mechanics of the calculations available in there. It is available at: htps:// pathologyuncertainty.com/calculators/ PPi


Beter approach


Define the question, comparison, outcome, and design first.


Use the test as one part of interpretation.


Compare results with predefined acceptability limits and uncertainty.


Use paired analysis for split samples, before/after data, or repeated measurements.


Use ANOVA, Welch ANOVA, Kruskal–Wallis, or planned comparisons as appropriate.


Choose them only when their interpretation matches the laboratory question.


Report the observed difference and confidence interval.


Treat non-significance as inconclusive unless equivalence was specifically tested.


Interpret findings against clinical, operational, or predefined criteria.


Use plots, study design, sample size, outliers, and assumptions together.


Select the test deliberately based on design and interpretation.


Use one-tailed tests only when justified and specified in advance.


Page 1  |  Page 2  |  Page 3  |  Page 4  |  Page 5  |  Page 6  |  Page 7  |  Page 8  |  Page 9  |  Page 10  |  Page 11  |  Page 12  |  Page 13  |  Page 14  |  Page 15  |  Page 16  |  Page 17  |  Page 18  |  Page 19  |  Page 20  |  Page 21  |  Page 22  |  Page 23  |  Page 24  |  Page 25  |  Page 26  |  Page 27  |  Page 28  |  Page 29  |  Page 30  |  Page 31  |  Page 32  |  Page 33  |  Page 34  |  Page 35  |  Page 36  |  Page 37  |  Page 38  |  Page 39  |  Page 40  |  Page 41  |  Page 42  |  Page 43  |  Page 44  |  Page 45  |  Page 46  |  Page 47  |  Page 48  |  Page 49  |  Page 50  |  Page 51  |  Page 52  |  Page 53  |  Page 54  |  Page 55  |  Page 56