Skill

restaurant-ranker

Rank a city's restaurants from one prompt — "rank the best restaurants in {city}", "which restaurants in {city} are worth it", "best places to eat in {city}", "top restaurants in {city}". Fuses editorial prestige + Google reviews + LIVE booking-demand into a Bayesian ranking with honest uncertainty intervals. Read-only — reads reservation calendars to measure demand, never books anything. Orchestrates the full pipeline: corpus → reviews → booking → model → render.

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/restaurant-ranker:restaurant-ranker

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Fuses four independent signals — editorial prestige (best-of lists, Michelin), Google reviews

Supporting Files

SKILL.md

362 lines · ~4.5k tokens

Stats

LanguagePython

Stars0

MaintenanceExcellent

Last CommitJun 8, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

restaurant-ranker

Fuses four independent signals — editorial prestige (best-of lists, Michelin), Google reviews (rating + count), live booking-demand (revealed preference read off reservation calendars), and booking friction (phone-only / lottery gate level) — into a single Bayesian latent-quality ranking. Produces a <slug>-board.md with the ranked list, 94% HDI uncertainty intervals, per-channel coverage flags, and honest treatment of off-platform venues. Read-only; it never books anything.

Mechanical chain detail: references/pipeline.md.

THE READ-ONLY LAW (non-negotiable)

Read availability only. NEVER submit, confirm, or complete a reservation.

Push through gates to reach and read the slot list (accept T&C, set party size, pick a default course menu, dismiss cookie banners). Then STOP. Reading the offered slots IS the probe. Clicking a slot, a "Next"/"Book"/"Confirm"/"Complete" button, or entering payment / deposit details is a FAILURE of the task — it creates a real reservation, blocks a real table, and violates lakbai's core anti-goal.

Forbidden, no exceptions:

Don't click a time-slot button "just to see what happens" — it advances to checkout.
Don't fill the final contact/payment form, even with fake data, even if "I'll stop before submit."
Don't proceed past a deposit / credit-card-hold gate. Record the deposit requirement and stop.
"Verifying it's really bookable" is NOT a reason to click through. Offered slot = the datapoint.

Red flags — STOP immediately if you think any of these:

"Let me click one slot to confirm the flow works."
"I'll fill the form but not hit submit."
"The deposit step is fine, I'll just look at the next screen."
"It's a test reservation, I'll cancel it after."

All of these mean: you are about to create a real booking. Back out. The slot list is enough.

This law governs the booking step (Step C') and any delegation to the booking-probe sibling skill. It is a security invariant (T-03-01).

Setup (run once, before first use)

Model venv (required — system python3 cannot run model.py):

# Install uv if absent
command -v uv >/dev/null || { curl -LsSf https://astral.sh/uv/install.sh | sh; export PATH="$HOME/.local/bin:$PATH"; }

# Create the 3.12 venv
uv venv /tmp/ranker-venv --python 3.12

# Install the model stack
uv pip install --python /tmp/ranker-venv/bin/python pymc arviz numpy h5netcdf

System python3 is 3.10 and has no pymc/numpy — model.py raises ImportError on it. All other steps use system python3. See references/pipeline.md §VENV SETUP for detail.

API key (optional — reviews channel degrades gracefully if absent):

Load from the uploaded .env file if present, then fall back to whatever is already in env:

[ -f /mnt/session/uploads/.env ] && set -a && source /mnt/session/uploads/.env && set +a

If the key is absent, the reviews channel is skipped and flagged in the coverage report. The ranking still runs on prestige + booking-demand alone.

THE WORKFLOW

Each step is a single action; do not batch. Observe the outcome of each step before proceeding. Run all commands from this skill's own directory (the directory containing scripts/ and cities/).

Step A — Corpus (Seam 1: agent gathers venues via web_search)

Warning: do NOT just shell python3 scripts/corpus.py --city <city>. Its gather_source() returns [] in the stdlib environment — running it alone writes an empty corpus.

The agent assembles the corpus itself:

Load the city config to find prestige_sources:
```
import sys; sys.path.insert(0, "scripts")
from cityconfig import load_city
cfg = load_city("<city>")
```
If UnknownCityError is raised, STOP (see Checkpoint 1 below).

For each source in cfg["prestige_sources"], run web_search("<source_name> best restaurants <city>") and collect venues. Build one row per venue:

row = {
    "venue": str,                     # canonical name
    "area": str | None,               # neighbourhood, or None
    "cuisine_raw": str,               # raw editorial cuisine label
    "prestige_sources": [str, ...],   # which source(s) named it
}

Dedup and score:
```
from corpus import dedup, bucket_cuisine, normalize_name, _venue_slug
deduped = dedup(rows)   # adds prestige_score, sources_n (NOT slug, NOT cuisine — add those explicitly after)
for v in deduped:
    v["slug"] = _venue_slug(v["venue"])
    v["cuisine"] = bucket_cuisine(v.get("cuisine_raw"), cfg["cuisine_taxonomy"])
```
Why this loop is required: dedup() returns records without slug or cuisine. model.prepare_model_data filters [r for r in corpus if r.get("slug")], so any corpus record missing slug is silently discarded — meaning zero venues would be modeled without this loop. The cfg variable is already bound above by cfg = load_city("<city>").
Write deduped to <slug>-corpus.json in the caller's CWD.

Checkpoint 2: Report "N venues gathered from S sources" before proceeding.

Step B — Reviews

python3 scripts/reviews.py --city <city> --corpus <slug>-corpus.json

Writes <slug>-reviews.json. If GOOGLE_MAPS_API_KEY is absent, the channel is skipped and flagged (status: "skipped", reason: key not set). No action needed — proceed to Step C.

Step C — Engine map (Seam 2: agent discovers booking engines)

The agent authors <slug>-venues.json. No script produces this file.

For every venue in <slug>-corpus.json, you must actively TRY to find a booking engine — do not assume it has none.

🚫 "Off-platform" is a verdict you EARN by exhausting every avenue — never a default. A notable urban restaurant almost always books through some engine. Before a venue may be omitted/off-platform, you must have actually searched for it on each engine and come up empty. Skipping discovery and calling a venue "phone/email-only" because you didn't look is the single biggest cause of a falsely empty booking channel. (Proof: a Manila run that did the discovery found 10 readable venues on SevenRooms/Eatigo over plain HTTP — venues a lazy pass would have written off as off-platform.)

Per-venue discovery loop — exhaust ALL of these before giving up on a venue:

Search the venue + each engine by name. For each venue run searches like "<venue>" sevenrooms, "<venue>" eatigo, "<venue>" covermanager, "<venue>" reservation, then the anti-bot engines (tablecheck, chope, opentable, tock, resy, zenchef, autoreserve, …). Many venues book through an engine even when their own site hides it.
Resolve the venue's own website → its embedded engine. Open the reservation/"Book" button on the official site and read the final host (SevenRooms widget, Eatigo iframe, etc.) — method in ../booking-probe/references/engine-routing.md.
Capture the engine-specific identifier (SevenRooms slug, Eatigo branch id, CoverManager slug) so booking.py can probe it.
Only after all of the above turn up nothing for a venue may it be omitted as off-platform.

Hard/anti-bot engines (TableCheck, Chope, OpenTable, Tock, Resy, Zenchef, …) are NOT a dead end — they go to Tier C'-2 escalation (below), via the booking-probe sibling skill. Map them in the venues file; do not drop them.

File shape:

{
  "helm":        {"engine": "sevenrooms", "identifier": "helmbgc"},
  "toyo-eatery": {"engine": "eatigo",     "identifier": "12345"}
}

CRITICAL — slug mismatch silently zeros fill: keys MUST be the slug field from each record in <slug>-corpus.json (the _venue_slug output), never the display name. A slug mismatch causes the booking join to fail — every mismatched venue appears as missing fill in the model, producing wide uncertainty that the data does not support. Always copy the slug value verbatim from the corpus JSON.

Valid engines: sevenrooms, eatigo, covermanager (booking.py PROBERS), or any engine listed in cities/<slug>.json under booking_platforms. A venue is omitted/off-platform ONLY after the per-venue discovery loop above genuinely exhausted every engine — then booking.py flags it gated/off_platform=True with fill=None (honesty rail BOOK-03). The honesty rail cuts both ways: never fabricate a fill, AND never fabricate an absence of one by failing to look. If you mark a venue off-platform, you must be able to name which engines you searched.

Checkpoint 3: Report the engine resolved for EVERY venue. For any venue you're calling off-platform, state which engines you searched (so "off-platform" is auditable, not assumed). If the readable-engine count looks low for a major city, you almost certainly under-searched — go back and finish the discovery loop before running booking.

Step C' — Booking (two tiers: HTTP fast-probe → CloakBrowser escalation)

Live booking-demand is a co-equal pillar of the ranking, not a nice-to-have. Half of a city's top venues sit behind anti-bot booking engines (TableCheck, OpenTable, DinnerBooking, Superb, ...). Reading them is what makes β_b identifiable. The booking step is therefore two tiers, and you run BOTH — stopping at tier 1 leaves most demand signal on the floor.

Tier C'-1 — HTTP fast-probe (cheap, no browser):

python3 scripts/booking.py --city <city> --venues <slug>-venues.json

Writes <slug>-booking.json. Reads the JSON-endpoint engines (SevenRooms, Eatigo, CoverManager) over plain HTTP. Prints === FILL EVIDENCE === and Off-platform / no-signal venues: K/total. Anti-bot engines come back channel="blocked-here", fill=None — an honest gap, not a verdict. Do NOT stop here.

Tier C'-2 — escalate EVERY blocked-here venue. This is not optional.

🚫 There is no "demo" or "full-run" exception. A run that stops at blocked-here and only describes what escalation "would" do is an incomplete run that you must not present as a result — it silently drops the booking pillar. Never write "in a full/production run I would escalate", "would normally escalate via CloakBrowser", or "(skipped for this demo)". If you catch yourself narrating escalation in the conditional tense, STOP and actually do it now. You are already in the full run. There is no other run.

Work each blocked-here venue, cheapest rung first — most cost you nothing:

C'-2a — just fetch the page (no browser, do this first). web_fetch (or curl) the venue's booking URL. Many Tock / Resy / TableCheck / Zenchef pages render the slot times directly in the HTML/JSON. If you see times, that is a live fill observation — you are done with this venue; capture it (below). Do NOT escalate further and do NOT defer it. (Real failure mode this prevents: fetching a Tock page, seeing "6:00 PM, 6:15 PM…", and then throwing it away while saying you "would" escalate. The times you just read ARE the data.)
C'-2b — CloakBrowser, only if C'-2a returned no times. Drive the real widget with the booking-probe sibling skill — CloakBrowser (patchright + xvfb + humanize) clears Turnstile / DataDome / ALTCHA / CDP walls. Verified recipes per engine: ../booking-probe/recipes/<engine>.md; the tiered playbook (which tier each wall needs, and the rule that a 403 from vanilla CloakBrowser means wrong tier, not "blocked") is the ../cloakbrowser/SKILL.md skill — read it before deciding a venue is unreadable.
C'-2c — only now may a venue be unresolved. A venue is honestly unreadable ONLY after C'-2a returned no times AND the correct CloakBrowser tier was actually attempted and blocked (e.g. PerimeterX / residential-IP walls). "I didn't try" is never a reason to mark unresolved.

THE READ-ONLY LAW governs every fetch/click — reach and read the slot list, then STOP. Do not rationalize a coverage gap as "architectural" before confirming you ran C'-2a and the correct CloakBrowser tier.

Capturing a read (C'-2a or C'-2b): write one resolved venue JSON per restaurant to ./<city>-booking-probe/clickthrough/<slug>.json with the units shape {slug, platform, units:[{date, party_size, status, slots:[{time, state}], note}]} (booking-probe writes this for you; for a C'-2a hand-fetch, write it yourself from the times you read). A venue you genuinely cannot read stays unresolved — never faked.

Merge the live reads back into the booking channel:

python3 scripts/merge_fill.py --city <city> --booking <slug>-booking.json

Folds every genuinely-resolved clickthrough read into <slug>-booking.json as channel="readable" (source="cloakbrowser"), with fill computed by the same fill_from_units the HTTP path uses. Prints an upgraded/inserted/skipped summary. Honesty rail intact: only resolved grids upgrade; unresolved reads stay fill=None. THE READ-ONLY LAW applies to the whole step.

Step D — Model

Checkpoint 4 (before running): Confirm "corpus + reviews + booking ready; X readable venues. Fitting the Bayesian model now (this takes a few minutes)."

/tmp/ranker-venv/bin/python scripts/model.py \
  --city <city> \
  --corpus <slug>-corpus.json \
  --reviews <slug>-reviews.json \
  --booking <slug>-booking.json

Never run model.py with python3 — it will raise ImportError. Always use the venv interpreter.

Writes <slug>-results.json and <slug>-posterior.nc. Prints venues: N (R/T readable) and divergences: D. Aim for ≥10 readable venues so the booking-demand coefficient (β_b) is cleanly identified. The CloakBrowser escalation (C'-2) is normally what carries you over this floor — HTTP-only rarely clears 10 readable in a top-venue city. Fewer than 10 readable is still valid — the board reports this honestly (render.py _COVERAGE_FLOOR).

Step E — Render

python3 scripts/render.py --city <city> --results <slug>-results.json

Writes <slug>-board.md. The board already contains the §1 coverage line (N/T read live), §3 honesty note (reviews-skipped notice, β_b verdict, off-platform flags), and identifiability-floor status.

Checkpoint 5: Present the board path and the coverage report (see below).

STEP DISCIPLINE (SKILL-03)

Each step is a single action; do not batch. Observe the outcome before acting again. This is the snapshot → reason → act → re-snapshot loop — not autonomous multi-step automation.

Read-only steps (B, C', E) proceed without a y/n permission gate, but each is still a discrete action whose output is observed before the next. The 5 named checkpoints require a confirmation or report before proceeding:

#	When	Action
1	After city config loaded	If `UnknownCityError`: relay the actionable error (name the exact `cities/<slug>.json` path and `cities/_SCHEMA.md` reference) and STOP. Do not proceed to corpus.
2	After corpus assembled	Report "N venues gathered from S sources." Proceed to reviews.
3	After engine map built	Report the engine resolved for every venue; for any off-platform venue, name the engines searched. Low readable count for a major city = under-searched → finish the discovery loop. Proceed to booking.
3'	After C'-1 HTTP probe	Report "K blocked-here venues to escalate via CloakBrowser." Run C'-2 + merge_fill, then report readable count after merge.
4	Before model fit	Confirm "corpus + reviews + booking ready, X readable (after CloakBrowser merge); fitting now." Proceed to model.
5	After render	Present board path + coverage report. Done.

Narrate progress between checkpoints. Do NOT add a y/n gate before every micro-action. Do NOT batch multiple steps without observing intermediate output.

COVERAGE REPORT (SKILL-04)

After Step E, emit a one-line-per-channel ran-vs-skipped table built from each step's already-emitted status signals. The render board (§1 + §3) already bakes the honest coverage — this chat summary mirrors it. No new code required.

Channel    | Status
-----------|-----------------------------------------------------------------
corpus     | ran — N venues (S sources)
reviews    | ran — ok (N/T enriched)         OR  skipped (GOOGLE_MAPS_API_KEY not set)
booking    | ran — K/T off-platform; readable=R
gate       | ran — always present (derived from corpus/booking metadata)
model      | ran — R/T readable; divergences=D; rhat_max=X
render     | ran — board: <slug>-board.md; coverage floor (≥10) cleared / not cleared

Signals come from each step's existing stdout:

corpus: venues: N (deduped) + len of corpus JSON. N == 0 → Seam 1 not done.
reviews: channel_status printed as reviews: ok|skipped (n/T enriched). Skipped reason names GOOGLE_MAPS_API_KEY (REV-02).
booking: === FILL EVIDENCE === table + Off-platform / no-signal venues: K/total. Per-record channel ∈ {readable, online-noslots, blocked-here, gated}.
model: results.json.channel_coverage = {readable, total, reviews_status}; diagnostics = {divergences, rhat_max, ess_min}. Printed as venues: N (R/T readable) + divergences: D.
render: <slug>-board.md §1 coverage line + §3 honesty note contain N/T-read-live, reviews-skipped notice, and ≥10 identifiability-floor cleared/not.

skills/booking-probe/ — engine discovery recipes + THE READ-ONLY LAW origin; delegate hard engines here.
references/pipeline.md — exact invocation chain, both seam contracts, venv setup detail.

restaurant-ranker

Invocation

Context Preview

Supporting Files

SKILL.md

restaurant-ranker

Invocation

Context Preview

Supporting Files

SKILL.md

restaurant-ranker

THE READ-ONLY LAW (non-negotiable)

Setup (run once, before first use)

THE WORKFLOW

Step A — Corpus (Seam 1: agent gathers venues via web_search)

Step B — Reviews

Step C — Engine map (Seam 2: agent discovers booking engines)

Step C' — Booking (two tiers: HTTP fast-probe → CloakBrowser escalation)

Step D — Model

Step E — Render

STEP DISCIPLINE (SKILL-03)

COVERAGE REPORT (SKILL-04)

Related

Similar Skills

restaurant-ranker

THE READ-ONLY LAW (non-negotiable)

Setup (run once, before first use)

THE WORKFLOW

Step A — Corpus (Seam 1: agent gathers venues via web_search)

Step B — Reviews

Step C — Engine map (Seam 2: agent discovers booking engines)

Step C' — Booking (two tiers: HTTP fast-probe → CloakBrowser escalation)

Step D — Model

Step E — Render

STEP DISCIPLINE (SKILL-03)

COVERAGE REPORT (SKILL-04)

Related

Similar Skills