We studied over 140 peer-reviewed papers across cognitive science, neuropsychology, and psychometrics. Then we built every test from scratch, grounded in the Cattell-Horn-Carroll framework and aligned with clinical gold standards like the WAIS-V.
The battery is organized according to the Cattell-Horn-Carroll (CHC) theory of cognitive ability (Carroll, 1993; McGrew, 2009), the predominant framework in modern cognitive assessment. CHC integrates Cattell's fluid-crystallized distinction, Horn's multi-factor model, and Carroll's massive factor-analytic synthesis of decades of psychometric data.
We chose CHC because it offers empirical integration (70+ years of factor-analytic research), clinical utility (aligns with practitioner expectations and existing instruments like the WAIS-V), and measurement transparency (clear definitions of narrow abilities that guide principled test selection).
| Domain | CHC Broad Ability | WAIS-V Index | Narrow Abilities |
|---|---|---|---|
| Processing Speed | Gs | PSI | Perceptual Speed, Inspection Time |
| Working Memory | Gsm | WMI | Memory Span, WM Capacity, Attention |
| Fluid Reasoning | Gf | PRI | Inductive Reasoning, Spatial Relations |
| Verbal Comprehension | Gc | VCI | Vocabulary, Verbal Knowledge |
| Executive Function | Gf + Gs + Attn | Composite | Inhibition, Flexibility, Planning |
| Attention | Gs (Narrow) | PSI | Sustained, Selective, Vigilance |
| Dyscalculia Screening | Gq | N/A | Number Sense, Subitizing, Arithmetic |
The full battery requires approximately 120 minutes. All tests are delivered via jsPsych (de Leeuw, 2015) with millisecond-precision timing and run entirely in the browser.
| Test | Domain | Paradigm Source | Primary Measure | Time |
|---|---|---|---|---|
| Symbol Coding | Speed | SDMT (Smith, 1973) | Correct pairs / 90s | ~3 min |
| Simple Reaction Time | Speed | Jensen (2006) | Median RT (ms) | ~3 min |
| Digit Span Forward | WM | WAIS-V (Wechsler, 2024) | Span length | ~4 min |
| Digit Span Backward | WM | WAIS-V (Wechsler, 2024) | Span length | ~4 min |
| Corsi Block-Tapping | WM | Corsi (1972) | Spatial span | ~4 min |
| N-Back (2-Back) | WM | Owen et al. (2005) | d-prime | ~5 min |
| Operation Span | WM | Turner & Engle (1989) | Words recalled | ~6 min |
| Matrix Reasoning | Fluid | Raven (1938) | Correct / 20 | ~5 min |
| Pattern Completion | Fluid | KABC-II (Kaufman, 2004) | Correct / 18 | ~4 min |
| Vocabulary | Verbal | WAIS-V (Wechsler, 2024) | Correct / 25 | ~4 min |
| Verbal Similarities | Verbal | WAIS-V (Wechsler, 2024) | Score (0-40) | ~4 min |
| Verbal Analogies | Verbal | Miller Analogies (1967) | Correct / 18 | ~4 min |
| Category Fluency | Verbal | Troyer et al. (1998) | Words / 60s | ~4 min |
| Stroop | Exec | Stroop (1935) | Interference (ms) | ~5 min |
| Trail Making A & B | Exec | Partington & Leiter (1949) | Completion time (s) | ~4 min |
| WCST Analogue | Exec | Grant & Berg (1948) | Perseverative errors | ~6 min |
| Tower of London | Exec | Shallice (1982) | Excess moves | ~5 min |
| Flanker Task | Attention | Eriksen & Eriksen (1974) | Flanker effect (ms) | ~4 min |
| Go/No-Go | Attention | Aron et al. (2007) | False alarm rate | ~3 min |
| Stop-Signal Task | Attention | Logan (1994) | SSRT (ms) | ~5 min |
| CPT Analogue | Attention | Rosvold et al. (1956) | d-prime, vigilance | ~12 min |
| Subitizing | Dyscalculia | Kaufman et al. (1949) | Accuracy, slope | ~3 min |
| Magnitude Comparison | Dyscalculia | Moyer & Landauer (1967) | Distance effect (ms) | ~3 min |
| Dot Comparison | Dyscalculia | Piazza et al. (2010) | Weber fraction | ~3 min |
| Arithmetic Fluency | Dyscalculia | WRAT-5 (Wilkinson, 2017) | Correct / 60s | ~5 min |
| Number Line Estimation | Dyscalculia | Siegler & Booth (2004) | % Absolute Error | ~3 min |
Every test below includes its CHC alignment, task structure, scoring methodology, the published paradigm it's based on, and known limitations. Expand a domain to see the full specifications.
Within each domain, subtest raw scores are converted to z-scores using preliminary normative data, then averaged into domain composites. Direction is adjusted so higher always means better (reaction-time scores are reversed). Domain z-scores are converted to T-scores (M=50, SD=10) for interpretability.
Scores are referenced against published normative datasets, selected by the user's language and region. Two normative populations are currently supported:
International norms (English): Derived from published studies including WAIS-IV/V (Wechsler, 2008; 2024), Tombaugh (2004), Strauss, Sherman & Spreen (2006), and NIH Toolbox standardization samples. Applied to English-language administrations.
NEURONORMA Spanish norms: For Spanish-language administrations, 8 tests are normed against the NEURONORMA project (Peña-Casanova et al., 2009–2013), the gold-standard normative dataset for Spanish adults. NEURONORMA tested 1,365 cognitively healthy, community-dwelling Spanish adults aged 18–90 across multiple centers, providing age- and education-adjusted norms. Tests with NEURONORMA norms: Digit Span, Corsi Blocks, Stroop, Trail Making, Tower of London, DSST, Word Fluency (P/M/R), and Wisconsin Card Sorting. Additional Spanish norms are being integrated from the NEURONORMA-Plus project (2024).
A domain composite requires completion of at least 50% of subtests. Missing subtests are imputed using the mean z-score of completed subtests. Domains below the threshold are flagged as incomplete.
Interpretation guidance: Focus on relative strengths and weaknesses within the individual's profile rather than absolute T-scores, which are provisional until formal norms are available. Scores from this battery are not diagnostic and should never be used for clinical decisions without professional evaluation.
Unlike clinical evaluations, there is no examiner present to observe behavior, assess effort, or note concerning presentations. Behavioral observations are unavailable.
Participants may test in suboptimal environments (noisy rooms, interruptions, poor lighting). There is no way to ensure standardized conditions.
No embedded validity indicators or effort-correction indices. Individuals with low motivation cannot be reliably distinguished from those with true cognitive deficits.
Current norms are based on a small, non-representative sample. Absolute T-scores should be interpreted cautiously. Formal normative data will be available in Version 1.1 (late 2026).
Convergent validity (vs WAIS-V), discriminant validity, and predictive validity have not yet been established. Validation studies are in progress.
Visuospatial abilities are underrepresented. Long-term declarative memory, expressive language, naming, and constructional abilities are not assessed. Individuals with suspected neurocognitive disorders need comprehensive evaluation including these missing domains.
English norms validated for adults aged 18–65. Spanish (NEURONORMA) norms cover ages 18–90 for 8 tests; the remaining tests use international reference data. Adolescent norms and norms for languages other than English and Spanish are not yet available.
Dominguez, C. (2026). Neuropsych Cognitive Battery: Technical Manual, Version 1.0. Retrieved from neuropsych.io