Skill

Download Protocol

Paper download rules — priority chain, Docker container warning, manual link requirements. Use when downloading or providing paper access links.

Popularity

Parent stars

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/agent-teams-papersearch:download-protocol

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Docker-based MCP download tools (`download_arxiv`, `download_biorxiv`, etc.) save files INSIDE the container. The container has NO volume mapping to the host filesystem. **Downloaded files will be LOST.**

SKILL.md

74 lines · ~755 tokens

Stats

Parent stars1

MaintenanceGood

Last CommitMar 18, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

Download Protocol

Critical Warning

Docker-based MCP download tools (download_arxiv, download_biorxiv, etc.) save files INSIDE the container. The container has NO volume mapping to the host filesystem. Downloaded files will be LOST.

Always use curl for direct downloads.

Download Priority Chain

Priority	Source	Method	Reliability
1	arXiv	`curl -L -o <dir>/<name>.pdf "https://arxiv.org/pdf/<id>"`	High
2	bioRxiv/medRxiv	`curl -L` from PDF URL	High
3	MDPI / OA repos	`curl -L -A "Mozilla/5.0"` from direct PDF URL	Medium
4	Sci-Hub (Playwright)	Navigate to sci-hub.ru/DOI → extract PDF URL → curl	Medium
5	Paywalled / login-required	Open all in browser via Playwright (user confirms)	Always works

Sci-Hub mirrors (in order): https://sci-hub.ru, https://sci-hub.st Sci-Hub PDF extraction: Look for <object type="application/pdf" data="..."> on the page, build absolute URL, then curl download.

Do NOT use: Docker MCP download tools (files lost), ResearchGate curl (Cloudflare 403), IEEE iframe PDF extraction (fragile).

File Naming Convention

<##>_<Author><Year>_<Short_Title>.pdf

Examples:

01_Perumal2013_SPICE_Level3_IGZO.pdf
02_Ghittorelli2014_Analytical_IGZO_Model.pdf

Mandatory Post-Download Verification

After every curl download, verify the file is a real PDF:

size=$(stat -f%z "$file")
head_bytes=$(head -c 4 "$file")
# Real PDF: size > 5000 AND head_bytes == "%PDF"
# Otherwise: delete the file, mark as failed

When Programmatic Download Fails — Browser Fallback

Open ALL failed papers in browser tabs at once using Playwright browser_run_code:

async (page) => {
  const urls = [
    /* DOI / IEEE / publisher URLs */
  ];
  const context = page.context();
  for (const url of urls) {
    await context.newPage().then((p) => p.goto(url, { waitUntil: "domcontentloaded", timeout: 15000 }).catch(() => {}));
  }
  return `Opened ${urls.length} tabs`;
};

Key rules:

ASK USER FOR CONFIRMATION before opening browser tabs — show the list of papers and ask "Open N tabs? (y/n)"
Open ALL failed papers at once — one tab per paper
Do NOT try to automate clicking PDF buttons on publisher sites
After opening, tell user: "Opened N tabs. Please click PDF on each page to download."
User has institutional access (e.g., Fudan → IEEE) so browser downloads will work

Download Protocol

Popularity

Invocation

Context Preview

SKILL.md

Download Protocol

Popularity

Invocation

Context Preview

SKILL.md

Download Protocol

Critical Warning

Download Priority Chain

File Naming Convention

Mandatory Post-Download Verification

When Programmatic Download Fails — Browser Fallback

Similar Skills

Download Protocol

Critical Warning

Download Priority Chain

File Naming Convention

Mandatory Post-Download Verification

When Programmatic Download Fails — Browser Fallback

Similar Skills