From market-scout
Advanced product/market research — find "the best X on the market" in any category (best laptop, best phone, best 5G router, best headphones, best monitor, etc.) across major retailers (Amazon, Best Buy, Walmart, Newegg, B&H, Target, manufacturer). Runs a multi-agent pipeline: expert-review consensus + contrarian/owner-complaint mining + live cross-retailer pricing + spec verification, then a DETERMINISTIC weighted decision-matrix scoring engine, adversarial red-team verification, and a wiki-filed ranked recommendation with segment winners (best overall / value / performance / budget) and a buy sequence. TRIGGERS: "best <product> on the market", "what <product> should I buy", "find me the best <category>", "compare <products> and recommend", "market-scout", "scout the market for", "buying guide for". Integrates with your Obsidian wiki, ultradeep agents, llm-council, agentmemory, and search stack. DO NOT use for: a single known URL price check (just fetch it), or non-product research (use /ultradeep).
How this skill is triggered — by the user, by Claude, or both
Slash command
/market-scout:market-scoutThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
> The brain that decides **what is the best thing on the market** in a category —
The brain that decides what is the best thing on the market in a category — evidence-gathered, math-scored, red-teamed, and filed into the wiki. Built to the same architecture as
/ultradeep(tunableprogram.md+ orchestrator + explorer/red-team agents + wiki self-learning), specialized for buying decisions where the failure mode is fabricated confidence.
Read program.md (in this skill folder) before every run — it holds the
tunable defaults (depth, scoring profiles, source policy, retailer set, wiki
paths, self-learning Domain Notes). Edit that file, not this one, to tune.
A naive "best laptop?" answer is an LLM repeating listicles. market-scout separates the two things that decision needs:
scripts/score.py. The ranking cannot be
hallucinated; the LLM only writes the prose around it.The deliverable always takes a position (per program.md decisiveness rule):
a #1 pick + segment winners + a buy sequence, with confidence labels.
Create a TodoWrite item per phase. Scratch dir: ~/research/<category-slug>-<YYYY-MM-DD>/.
Pin the decision before researching. Capture (infer sane defaults; do NOT block
the user if a /goal is active — assume and state):
criteria.json block; if none, propose adding one).default / value / performance / travel / category-specific).Per the standard wiki traversal (hot → index → drill):
~/agentic-wiki/hot.md, then index.md.wiki/research-reports/ for a prior scout on this category — if found,
factor it in ("we covered X; this run updates prices + new models Y, Z").mcp__agentmemory__memory_recall for prior buying decisions / preferences.claude-obsidian:wiki-query is available, semantic-query the vault.Load references/criteria.json for the category. State the evaluation criteria,
their weights (active profile), and metric directions. This is the rubric the
whole run is graded against — make it explicit so the red-team can attack it.
If the category isn't modeled, add a block to criteria.json (copy _generic)
and justify the weights.
Dispatch 4–6 explorers in parallel (Agent tool, deep-research-explorer
subagent, or a dynamic workflow if >10). Give each the 4-field spec
(Objective / Output format / Tools & sources / Boundaries) + SCRATCH_DIR.
Standard angle set (tune per category):
| Explorer | Angle | Primary sources |
|---|---|---|
| A. Expert consensus | Who do the testing labs crown? | RTINGS, Wirecutter, Tom's Hardware/Guide, PCMag, Consumer Reports, category specialists (e.g. Dong Knows Tech, RVMobileInternet for cellular) |
| B. Contrarian / reliability | What breaks, what disappoints, who regrets it? | Reddit (category subs), forums, long-term reviews, owner complaints, RMA threads |
| C. Live cross-retailer pricing | Real street price + stock across retailers | Amazon, Best Buy, Walmart, Newegg, B&H, Target, manufacturer; product MCPs if installed (see references/integration.md) |
| D. Spec / datasheet verification | Ground every spec in the manufacturer source | Official product pages, datasheets, regulatory filings |
| E. (opt) New/discontinued tracker | Newer model imminent? EOL? successor? | Manufacturer roadmap, news, release calendars |
| F. (opt) Regional availability | Can you actually buy it (your region / globally)? | Local retailers, import/customs, regional pricing |
Anti-bias rule: at least one explorer must hunt for the non-obvious pick and
for reasons NOT to buy the front-runner. See references/sources.md for the
trusted-source registry and tiered Exa routing (keyed→shared→searxng→WebSearch,
per program.md).
candidates.jsonSynthesize explorer notes into the run file the engine consumes:
{ "category": "5g-router", "query": "best 5G router 2026", "generated": "YYYY-MM-DD",
"weights_profile": "default",
"candidates": [ { "name": "...", "model": "...", "form_factor": "...",
"price_usd": 499, "metrics": { "expert_rating": 9.2, "max_5g_downlink_gbps": 3.4, ... },
"review_consensus": "one-paragraph judgment", "sources": ["url", ...] } ] }
Rules: triangulate every quantitative claim across ≥2 unrelated sources; use the
cheapest verified street price as price_usd; normalize expert_rating to a
/10 from each source's scale; leave a metric null rather than guessing
(completeness penalty handles it honestly).
python3 scripts/score.py SCRATCH/candidates.json \
--criteria references/criteria.json \
--profile <profile> \
--out SCRATCH/results.json --md SCRATCH/matrix.md
Returns ranking + segment winners (best overall / value / performance / budget) +
per-metric normals + confidence labels. Re-run with --profile value /
performance / travel to show how the pick shifts by priority — surfacing this
sensitivity is a feature, not noise.
Dispatch deep-research-redteam on the draft + scratch notes. It must attack:
unsupported specs, single-source prices, an over-stated #1, missing strong
candidates, recency rot (newer model out?), and adverse-interest sourcing
(a vendor/affiliate is not neutral on "X is best"). Escalate to a 2nd cycle on any
Critical finding (per program.md). Apply fixes; re-score if specs/prices change.
If the decision is a genuine coin-flip whose decisive variable is user-specific
and not in the evidence (e.g. "does the user value global eSIM over raw speed"),
run llm-council AFTER the red-team to calibrate the question and convert it
into a decision RULE / buy sequence. Skip for clear-cut categories.
Write the judgment prose: 3-sentence exec summary, the decisive recommendation (take a position), segment winners with reasoning, the buy sequence (what to buy first / what to verify before buying), and honest caveats with confidence labels. Declarative, present tense, no hedging unless evidence demands it.
python3 scripts/report.py SCRATCH/results.json SCRATCH/candidates.json \
--summary "<exec summary>" --verdict "<verdict para>" \
--out ~/agentic-wiki/research-reports/<slug>.md
Then (see references/integration.md for exact steps):
wiki/log.md: ## [YYYY-MM-DD] market-scout | <query> → [[research-reports/<slug>]]wiki/hot.md.wiki/index.md section with [[wikilinks]].mcp__agentmemory__memory_save the decision (type: decision).program.md → "Domain Notes" (Phase 7.5
self-learning — this is how the tool compounds, exactly like ultradeep's 9.5).graphify on the scratch dir; (optional) cross-file to ~/llm-wiki/.deep-research-explorer (fan-out), deep-research-redteam (verify);
Trend Researcher / Tool Evaluator agents are good optional explorers.exa-key deep → shared exa → searxng unlimited →
WebSearch) + jina/searxng URL read + claude-obsidian:defuddle on noisy pages.references/integration.md. If none
installed, web research is the floor and the run still completes.~/llm-wiki cross-file.llm-council (Phase 5.5), graphify (Phase 7).Full detail: references/integration.md. Source registry: references/sources.md.
Human-readable criteria rationale: references/criteria.md.
Provides UI/UX resources: 50+ styles, color palettes, font pairings, guidelines, charts for web/mobile across React, Next.js, Vue, Svelte, Tailwind, React Native, Flutter. Aids planning, building, reviewing interfaces.
Fetches up-to-date documentation from Context7 for libraries and frameworks like React, Next.js, Prisma. Use for setup questions, API references, and code examples.
npx claudepluginhub justsima/agentic-stack --plugin market-scout