From open-science-skills
Guides structural topic model (STM) specification for survey/experimental text data: model selection (STM/LDA/BERTopic), preprocessing, diagnostics, covariates, reporting.
How this skill is triggered — by the user, by Claude, or both
Slash command
/open-science-skills:topic-modelingThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
- Default to Structural Topic Models (STM) when analyzing text from surveys or experiments. STM incorporates document-level metadata — treatment conditions, respondent demographics, country — directly into estimation, allowing prevalence and content to vary with covariates (Roberts et al. 2014).
prevalence = ~ treatment + country (Roberts et al. 2014).init.type = "Spectral") for reproducibility. Spectral initialization is deterministic given the same data, unlike random initialization which requires multiple runs (Roberts, Stewart & Tingley 2019).findThoughts() or equivalent. Do not interpret topics from word lists alone — the documents provide essential context.estimateEffect(). For experimental data, test whether treatment conditions significantly shift which topics respondents discuss. For cross-national data, test whether topic prevalence differs by country. Plot these effects with confidence intervals (Roberts et al. 2014).npx claudepluginhub scdenney/open-science-skills --plugin ossGuides LLM text classification for survey data: codebook design, zero/few-shot/fine-tuning selection, model choice, human-LLM hybrids, validation, reproducibility.
Guides designing qualitative studies, developing coding schemes, and performing thematic analysis using grounded theory, phenomenology, and reflexive protocols for trustworthiness.