Rating Scale Comparison
Rating Scale Comparison
Quick Reference Tool | PsychHQ Source: Module 2, Assessment & Differential Diagnosis | Last Updated: February 2026
Foundational Principles
Rating scales are one component of a comprehensive ADHD evaluation — they are not the evaluation itself. No single rating scale, used in isolation, should be the basis for confirming or ruling out ADHD. (Pelham et al., 2005, J Clin Child Adolesc Psychol)
Think of rating scales like a thermometer: a high reading tells you something is going on and you need to look further, but it does not tell you exactly what is causing the fever. A high score on an ADHD rating scale means the next step is a full evaluation, not that the diagnosis is settled.
Head-to-Head Comparison Table
| Feature | Vanderbilt (VADRS) | Conners 4 | SNAP-IV | BRIEF-2 | CBCL | ASRS |
|---|---|---|---|---|---|---|
| Ages | 6–12 | 6–18 | 6–18 | 5–18 | 1.5–18 | Adults (18+) |
| Cost | Free | Licensed (per-use) | Free | Licensed | Licensed | Free |
| Items | 55 (Parent), 43 (Teacher) | Up to 118 (Full), ~50 (Short), 12 (Index) | 26 | 63 (Parent/Teacher), 55 (Self-Report) | 113 (Parent), 120 (Teacher) | 18 (Full), 6 (Screener) |
| Informants | Parent + Teacher | Parent + Teacher + Youth Self-Report (8–18) | Parent + Teacher | Parent + Teacher + Self-Report (11–18) | Parent + Teacher | Self-Report |
| ADHD symptom coverage | DSM-mapped inattention + hyperactivity-impulsivity | Inattention/Executive Dysfunction, Hyperactivity, Impulsivity subscales with age/sex-normed T-scores | Direct DSM symptom wording (18 ADHD + 8 ODD items) | Not an ADHD diagnostic tool; measures executive function | Attention Problems scale (not ADHD-specific) | DSM-mapped ADHD symptoms |
| Comorbidity screening | ODD, conduct disorder, anxiety, depression | Emotional Dysregulation, Depressed Mood, Anxious Thoughts, severe indicator items (self-harm, substance use) | ODD only | N/A (EF profile, not diagnostic) | Comprehensive: internalizing, externalizing, DSM-oriented | N/A |
| Performance/impairment items | Yes (academic + social) | Yes (dedicated Impairment + Functional Outcome Scales) | No | Yes (ecological EF) | Yes | No |
| Sensitivity | ~0.80 | ~0.83 (ADHD Index at T≥65) | Good (research standard) | N/A | N/A | High |
| Specificity | ~0.75 | ~0.85 (ADHD Index at T≥65) | Good | N/A | N/A | Low in general population |
| NPV (negative predictive value) | 0.98 (reliably rules out) | High | Good | N/A | N/A | High |
| PPV (positive predictive value) | 0.19 (most positives are false) | Higher than Vanderbilt | Moderate | N/A | N/A | Very low (~10% true positives in general screening) |
| Treatment monitoring | Yes (follow-up forms with side effect tracking) | Yes (short form) | Yes (primary use case) | Yes (track EF changes) | Not ideal (too long for repeated use) | Yes |
| Best use case | Primary care screening; treatment monitoring; free, fast, validated | Comprehensive clinical evaluation; broadest clinical coverage; multi-dimensional profile | Treatment monitoring; research; DSM symptom tracking | Executive function profiling; intervention planning | Broad psychopathology screening; detecting comorbidities | Adult ADHD screening (not diagnosis) |
Instrument-by-Instrument Clinical Guide
Vanderbilt ADHD Diagnostic Rating Scale (VADRS)
Reach for this when: You need a quick, free, validated screen in a busy pediatric practice. Also ideal for treatment monitoring since the follow-up forms include medication side effect tracking.
Psychometric highlights:
- Internal consistency: Cronbach's α 0.91–0.94 across subscales
- Test-retest reliability: >0.80 over two weeks
- NPV of 0.98: a negative Vanderbilt reliably rules out ADHD
- PPV of only 0.19: out of every 100 children who score positive, only ~19 actually have ADHD. This is not a flaw — it reflects the mathematics of screening in populations where the base rate is relatively low
Key limitation: Weak for detecting internalizing comorbidities (sensitivity ~0.37 for anxiety/depression modules). If you suspect internalizing comorbidity, supplement with a broadband measure — a scale that screens across many diagnostic categories rather than just ADHD (CBCL or Conners 4).
Age range limitation: Validated for ages 6–12 only. For adolescents, use Conners 4 or SNAP-IV.
(Wolraich et al., 2003, Pediatrics; Bard et al., 2013, J Dev Behav Pediatr)
Conners 4th Edition (Conners 4)
Reach for this when: You need the most comprehensive clinical picture — age/sex-normed T-scores, multi-dimensional symptom profile, and comorbidity screening all in one instrument.
What the 4th Edition adds:
- Modern normative data (2022 standardization)
- Digital administration option
- Emotional Dysregulation scale (captures the "hot" executive function deficits)
- Severe indicator items flagging conduct problems, self-harm risk, and substance use (immediate safety alerts)
- Dedicated Impairment and Functional Outcome Scales
T-score interpretation:
- 50 = average; each 10 points = one standard deviation
- T ≥ 65 = top ~7% for symptom severity compared to same-age, same-sex peers
- ADHD Index at T ≥ 65: sensitivity ~0.83, specificity ~0.85
Note: Psychometric data are from the publisher's standardization sample. Independent validation in community samples may yield different performance.
Key limitation: Per-use licensing cost. For high-volume primary care screening, the free Vanderbilt may be more practical.
(Conners, 2022)
SNAP-IV (Swanson, Nolan, and Pelham-IV)
Reach for this when: You need to track treatment response over time using direct DSM symptom wording. This is the scale the landmark MTA study used.
Clinical strengths:
- Direct translation of DSM criteria into a rating format (exact symptom wording)
- 26 items: 18 ADHD symptoms + 8 ODD symptoms, rated 0–3
- Free and fast
- Ideal for measuring medication response at each titration step
Key limitation: Uses DSM wording rather than behaviorally anchored descriptions, making it somewhat less informative than the Conners or Vanderbilt for initial evaluation. Does not include impairment/performance items.
(Swanson et al., 2001, Educ Psychol Meas)
BRIEF-2 (Behavior Rating Inventory of Executive Function, 2nd Edition)
Reach for this when: You need to map the executive function profile for intervention planning — particularly when standard neuropsych testing shows no deficit but the child is clearly struggling in daily life.
Why it matters: Roughly 50% of children with confirmed ADHD show no deficit on standard neuropsych batteries (Coghill, Seth & Matthews, 2014, Psychol Med). The BRIEF-2 captures the real-world expression of executive dysfunction — the kid who cannot organize their backpack, start homework independently, or shift between tasks, even when they perform fine in a structured testing room.
Yields:
- Global Executive Composite (overall EF score)
- Behavioral Regulation Index (impulse control, self-monitoring)
- Emotion Regulation Index (managing emotional responses)
- Cognitive Regulation Index (working memory, planning, mental flexibility)
Clinical application: If a child's BRIEF-2 shows elevated Working Memory and Plan/Organize indices but normal Inhibit and Shift indices, that tells you something different about their intervention needs than the reverse pattern.
Key limitation: Not a diagnostic instrument for ADHD. Supplemental to ADHD-specific scales.
(Gioia et al., 2015)
Child Behavior Checklist (CBCL)
Reach for this when: You need to screen broadly for psychopathology beyond ADHD — especially when internalizing comorbidities (anxiety, depression, somatic complaints) are suspected.
Structure:
- Parent form: ages 6–18; Preschool version: ages 1.5–5
- Internalizing composite (anxiety, depression, withdrawal)
- Externalizing composite (aggression, defiance, hyperactivity)
- DSM-oriented scales including Attention Problems
Best used as: The broadband complement to a narrowband ADHD scale. The Vanderbilt or Conners tells you about ADHD severity; the CBCL tells you what else is going on.
Key limitation: Not designed as an ADHD-specific tool. The Attention Problems scale is less precise for ADHD diagnosis than purpose-built instruments.
(Achenbach & Rescorla, 2001)
Adult ADHD Self-Report Scale (ASRS)
Reach for this when: Screening an adult for possible ADHD. The 6-item screener is fast; the 18-item version maps DSM criteria.
Critical limitation: The ASRS is highly sensitive but has low specificity in general populations. Using it as a diagnostic tool rather than a screening prompt can produce a 7-to-10-fold over-identification rate, with roughly 90% of positive screens representing false positives when subjected to comprehensive multi-informant evaluation. (Sibley et al., 2021, Am J Psychiatry)
Bottom line: A positive ASRS means "evaluate further." It does not mean "diagnose ADHD." Adult diagnosis requires retrospective childhood history + collateral informant + comprehensive clinical interview.
(Kessler et al., 2005; Sibley et al., 2021)
Clinical Decision Framework
| Clinical Situation | Recommended Scale(s) |
|---|---|
| Primary care screening (ages 6–12) | Vanderbilt (free, validated, includes comorbidity screen and follow-up forms) |
| Comprehensive clinical evaluation | Conners 4 (broadest coverage) + CBCL (if internalizing suspected) |
| Treatment monitoring / titration | SNAP-IV (free, DSM-mapped, fast) or Vanderbilt follow-up forms |
| Executive function profiling | BRIEF-2 (especially when neuropsych testing is normal but daily function is impaired) |
| Adolescent evaluation (13–18) | Conners 4 (includes Youth Self-Report) + adolescent self-report |
| Adult screening | ASRS-6 screener → if positive, comprehensive clinical interview with collateral informant |
| Suspected internalizing comorbidity | Add CBCL to any ADHD-specific scale |
| Research / clinical trials | SNAP-IV (MTA standard) |
| Budget-constrained practice | Vanderbilt (screening) + SNAP-IV (monitoring) — both free |
When Parent and Teacher Ratings Disagree
Cross-informant agreement for ADHD symptoms is consistently modest (correlations ranging from near zero to ~0.44 on hyperactivity-impulsivity). This is not a measurement failure. Different informants observe different demands:
| Informant | What They Observe | What They May Miss |
|---|---|---|
| Parent | Unstructured time, emotional regulation, homework, family interactions, weekend behavior | Sustained cognitive demands, peer comparison, classroom behavior |
| Teacher | Academic performance, sustained attention demands, peer interactions, classroom behavior | Home behavior, emotional regulation outside school, medication wear-off |
| Adolescent self-report | Internal experiences, subjective distress, medication effects, peer context | May minimize symptoms (poor insight or desire to appear normal) or maximize (seeking diagnosis) |
| Collateral informant (adult dx) | Childhood behavior, long-term patterns, relationship impact | May have recall bias or limited contact |
When ratings diverge, the clinical question is not "who is right?" but "what is different about these two environments?" Where is the child scaffolded, and where are they not? A child who scores low on teacher Vanderbilt but high on parent Vanderbilt may have strong classroom structure compensating for deficits that emerge in unstructured home settings — or the reverse.
Psychometric Concepts Translated
| Term | What It Means | Why It Matters |
|---|---|---|
| Sensitivity | How well the test catches real cases (true positive rate) | High sensitivity = few missed cases. Important for screening |
| Specificity | How well the test avoids false alarms (true negative rate) | High specificity = few false positives. Important for diagnosis |
| Positive Predictive Value (PPV) | If the test is positive, what is the probability the patient actually has the condition? | Depends heavily on the base rate in the population being tested. Low PPV in general screening is expected, not a flaw |
| Negative Predictive Value (NPV) | If the test is negative, what is the probability the patient does NOT have the condition? | High NPV = a negative result reliably rules out. Vanderbilt's NPV of 0.98 means a negative screen is very reassuring |
| T-score | Standardized score: 50 = average, each 10 points = one standard deviation | Allows comparison to same-age, same-sex peers. T ≥ 65 = top ~7% |
| Cronbach's alpha | Internal consistency — do the items measure the same construct? | α > 0.80 = good; α > 0.90 = excellent |
This quick reference tool is extracted from Module 2: Assessment & Differential Diagnosis. For full evidence review, sensitivity/specificity data, clinical vignettes, and the complete differential diagnosis framework, see the full clinical module.
For the full clinical curriculum, visit psychhq.com
Key References: Wolraich et al. (2003) Pediatrics; Bard et al. (2013) J Dev Behav Pediatr; Conners (2022); Swanson et al. (2001) Educ Psychol Meas; Gioia et al. (2015); Achenbach & Rescorla (2001); Pelham et al. (2005) J Clin Child Adolesc Psychol; Sibley et al. (2021) Am J Psychiatry; Kessler et al. (2005)