How I Determine the Quality of A Clinical Article

2025-03-14 11:30:00 ‧ 27reads

We should approach clinical articles with critical thinking rather than blind trust... Every scientific study should be critically appraised to determine its validity, reliability, and applicability before integrating it into clinical practice. I was a clinical researcher and have a clinical medicine PhD. I had conducted plenty of clinical studies and always teach my students how important "EBM" (Evidence-Based Medicine) is. Surprisingly, a lot of dental students in the U.S. don't know what "EBM" is.

Determining the Level of Evidence and Grading a Scientific Article for Clinical Recommendation in Evidence-Based Medicine (EBM)

Evidence-Based Medicine (EBM) integrates clinical expertise, patient values, and the best available research evidence to guide clinical decision-making. The ability to assess the level of evidence and grade recommendations based on scientific articles is crucial for ensuring high-quality patient care.

Levels of Evidence (The Pyramid)

Levels of evidence classify clinical research studies based on their methodological rigor, reliability, and applicability to clinical practice. Various hierarchies exist, but a commonly used system follows this general structure as the picture shows:

Level I: Systematic Reviews and Meta-Analyses of Randomized Controlled Trials (RCTs)

Considered the highest level of evidence
Combines multiple high-quality studies to provide a comprehensive conclusion

Level II: Randomized Controlled Trials (RCTs)

The gold standard for assessing interventions
Minimizes bias through randomization and control groups

Level III: Cohort and Case-Control Studies

Observational studies useful for assessing associations and risk factors
Cohort studies follow subjects over time, while case-control studies compare groups retrospectively

Level IV: Case Series and Case Reports

Descriptive studies without control groups
Provide insights into rare conditions but lack generalizability

Level V: Expert Opinions

Based on clinical expertise or theoretical models
Least rigorous due to the absence of empirical data

Grading Clinical Recommendations

Once the level of evidence is determined, recommendations are graded based on the quality, consistency, and clinical applicability of the evidence. The GRADE (Grading of Recommendations, Assessment, Development, and Evaluations) system is widely used for this purpose.

The primary factors used in grading are (1) Quality, (2) Consistency, and (3) Clinical applicability of the evidence:

(1) The Quality of evidence reflects how confident we are that the true effect is close to the estimated effect. It is categorized into four levels: High, Moderate, Low, and Very Low. The assessment is based on the following criteria: risk of bias (flaws in study design or execution, lack of randomization, etc), inconsistence (variations in results across studies), indirectness (evidence does not directly apply to the population, intervention, and outcomes), imprecision (wide conficence intervals or small sample sizes, etc), and publication bias... In a lot of systematic review study, you can see this part in "Green-Yellow-Red light" table.

(2) Consistency. Consistency refers to the reproducibility of results across multiple studies.

(3) Clinical Applicability (External Validity): The applicability of evidence is evaluated by considering: relevance to patient population, intervention feasibility (the intervention should be practical and accessible in real-world settings), balance of benefits and harms, cost and resource considerations (the affordability and cost-effectiveness of the intervention should be considered in decision-making, and finally patient values and preferences

By systematically assessing these factors, the GRADE system categorizes recommendations as follows:

Strong Recommendation (Grade A): High-quality evidence from RCTs or systematic reviews with consistent findings.
Moderate Recommendation (Grade B): Evidence from well-conducted cohort or case-control studies.
Weak Recommendation (Grade C): Based on lower-quality studies, expert opinions, or inconsistent findings.
Insufficient Evidence (Grade D): Lacks substantial support for a definitive recommendation.

And this helps clinicians and policymakers make well-informed, transparent, and evidence-based healthcare decisions.