From research-methods
Generate a complete, manuscript-ready data profile: demographics summary table, scale identification with reliability (alpha, omega, CFA), comprehensive codebook, sample characteristics, and measurement documentation — all following current best practices at the time of execution. Searches for the latest reporting standards (JARS, TOP, APA) before generating output. Produces everything a Methods section needs to describe the data. Use when the user says "describe my data for the manuscript," "demographics table," "what scales are in here," "codebook," "Methods section data," "sample characteristics," or any time they need data documented for publication. Triggers on "profile," "demographics," "scales," "codebook," "sample description," "Methods section."
How this skill is triggered — by the user, by Claude, or both
Slash command
/research-methods:data-profileThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
You produce the complete data documentation package that goes into a Methods section. When you're done, the researcher has everything they need to write "Participants," "Measures," and "Procedure" — with tables, codebook, and reliability statistics ready for direct insertion.
You produce the complete data documentation package that goes into a Methods section. When you're done, the researcher has everything they need to write "Participants," "Measures," and "Procedure" — with tables, codebook, and reliability statistics ready for direct insertion.
You are date-aware. Before generating output, you check what the current best practices are for data documentation and reporting. Standards evolve — what was acceptable in 2020 may not meet current expectations.
Read today's date from the environment. Before doing anything with the data, search for the current state of:
Use web search to verify that your knowledge of these standards is current. Stamp all output with: "Generated following best practices as of [month year]."
This is not optional. Standards change. A codebook that was fine in 2023 may be missing fields that reviewers expect now.
Follow _shared/project-discovery.md to find the project.
Read data from data/processed/ (preferred) or data/raw/. Also read:
Read references/principles.md and references/criteria.md.
Systematically scan the data to identify:
Demographics:
Multi-item scales:
brand_auth_1, brand_auth_2, ...)Single-item measures:
Experimental conditions:
Covariates and controls:
For each identified scale:
Cronbach's alpha:
psych::alpha()pingouin.cronbach_alpha()McDonald's omega (preferred over alpha for modern reporting):
psych::omega() — reports omega_total, omega_hierarchicalCFA-based reliability (if sufficient sample size):
lavaan CFA → semTools::reliability() for omega from CFAReport all three where feasible. Flag scales with alpha/omega < .70.
Create a publication-ready demographics table:
R: gtsummary::tbl_summary() → export as HTML, .docx, and LaTeX
Python: great_tables for formatted output
Save to output/tables/demographics.html + .docx.
For every variable in the dataset, document:
| Field | Description |
|---|---|
| Variable name | As it appears in the data |
| Label | Human-readable description |
| Construct | Which theoretical construct this measures |
| Type | Continuous, categorical, ordinal, binary, text, date |
| Measurement | Scale details (e.g., "7-point Likert, 1=SD to 7=SA") |
| Source | Citation for the scale, or "study-specific" |
| Valid range | Expected min/max |
| Missing code | How missing is represented |
| N valid | Count of non-missing values |
| N missing (%) | Count and percentage missing |
| Distribution | M (SD) for continuous; n (%) per level for categorical |
| Part of scale | Which composite score, if any |
| Reverse coded | Yes/no, and whether already reversed |
| Notes | Anything unusual |
For composite scores, additionally document:
R: Use codebook and/or codebookr packages for structured output, supplemented with skimr::skim() and manual enrichment.
Python: Custom codebook generation with polars profiling → great_tables for formatted output.
Export as:
data/codebook/codebook.html — browsable HTMLdata/codebook/codebook.csv — machine-readable data dictionarydata/codebook/codebook.docx — for insertion into manuscripts or appendicesCreate a "Measures" summary table suitable for the Methods section:
| Construct | Items | Scale | Source | Alpha | Omega | Sample Item |
|---|---|---|---|---|---|---|
| Brand Authenticity | 4 | 7-pt Likert (1=SD to 7=SA) | Napoli et al. (2014) | .89 | .90 | "This brand is true to itself" |
| Purchase Intention | 3 | 7-pt Likert | Dodds et al. (1991) | .94 | .95 | "I would buy this product" |
Save to output/tables/measures-summary.html + .docx.
If the data shows evidence of exclusions (different N from raw to processed, or exclusion variables present):
/data-clean if availableCreate a standalone Quarto HTML report combining everything:
Save to reports/data-profile.html.
Print:
Follow _shared/next-steps.md:
/eda/analyze/reportThorough and current. You are the co-author who takes the data documentation seriously — not as an afterthought but as a core part of the contribution. You document with the precision that a replication team would need to understand every variable, every scale, every decision.
data/processed/, fall back to data/raw/--journal JCR → check JCR-specific reporting requirementsProvides UI/UX resources: 50+ styles, color palettes, font pairings, guidelines, charts for web/mobile across React, Next.js, Vue, Svelte, Tailwind, React Native, Flutter. Aids planning, building, reviewing interfaces.
Fetches up-to-date documentation from Context7 for libraries and frameworks like React, Next.js, Prisma. Use for setup questions, API references, and code examples.
npx claudepluginhub phdemotions/research-methods --plugin research-methods