From voice-ai-integration
Integrates Shengwang products: ConvoAI voice agents, RTC audio/video, RTM messaging, Cloud Recording, and token generation. Routes workflows for voice AI, video calls, live streaming, and recording tasks.
How this skill is triggered — by the user, by Claude, or both
Slash command
/voice-ai-integration:voice-ai-integrationThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
> **⚠️ Execute this BEFORE any routing or code generation.**
references/cloud-recording/README.mdreferences/conversational-ai/README.mdreferences/conversational-ai/advanced.mdreferences/conversational-ai/common-errors.mdreferences/conversational-ai/convoai-restapi/agent-interrupt.mdreferences/conversational-ai/convoai-restapi/agent-speak.mdreferences/conversational-ai/convoai-restapi/agent-update.mdreferences/conversational-ai/convoai-restapi/get-agent-list.mdreferences/conversational-ai/convoai-restapi/get-history.mdreferences/conversational-ai/convoai-restapi/index.mdxreferences/conversational-ai/convoai-restapi/query-agent-status.mdreferences/conversational-ai/convoai-restapi/start-agent.mdreferences/conversational-ai/convoai-restapi/stop-agent.mdreferences/conversational-ai/generation-rules.mdreferences/conversational-ai/pipeline-integration.mdreferences/conversational-ai/providers.mdreferences/conversational-ai/quickstart.mdreferences/conversational-ai/sample-repos.mdreferences/doc-fetching.mdreferences/general/credentials-and-auth.md⚠️ Execute this BEFORE any routing or code generation.
Check if references/docs.txt already exists. If it does, skip this step entirely.
If it does not exist, download it:
bash skills/voice-ai-integration/scripts/fetch-docs.sh
This downloads a static doc index from doc.shengwang.cn — no user data is sent.
If download fails, proceed with local reference docs and fallback URLs.
Progressive disclosure rule: Only read the file that the current step points to. Do not scan the
references/directory or pre-read files that have not been linked from the current step. Each product module's README.md will tell you which files to read next and when. If a file is not referenced by the document you are currently reading, do not open it.
Match the user's request to a product module using the route table. If the match is clear, route directly — do not ask extra questions.
| User intent | Route to |
|---|---|
| Credentials, AppID, REST auth | general |
| Generate Token, token server, AccessToken2 | token-server |
| ConvoAI voice agent work | conversational-ai |
| RTC SDK integration | rtc |
| RTM messaging / signaling | rtm |
| Cloud Recording | cloud-recording |
| Download SDK, sample project, GitHub repo | Route to the relevant product module above |
When the user describes a use case without naming a product, use this to infer the match:
| Product | What it does | Typical user says |
|---|---|---|
| ConvoAI | AI voice agent (ASR→LLM→TTS over RTC) | "AI语音", "voice bot", "对话式AI", "AI agent", "AI 客服" |
| RTC SDK | Real-time audio/video between humans | "视频通话", "直播", "video call", "live streaming" |
| RTM | Real-time messaging / signaling | "聊天", "消息", "chat", "signaling" |
| Cloud Recording | Record RTC sessions server-side | "录制", "recording", "存档", "回看" |
| Token generation | Generate RTC / RTM tokens | "token", "鉴权", "token server" |
| Use case | Products needed |
|---|---|
| AI voice assistant | ConvoAI (primary) + RTC SDK (client) |
| AI voice assistant + chat history | ConvoAI + RTC SDK + RTM |
| 1v1 / group video call | RTC SDK |
| Video call + chat | RTC SDK + RTM |
| Live streaming with recording | RTC SDK + Cloud Recording |
| Record AI conversations | ConvoAI + RTC SDK + Cloud Recording |
| Chat / messaging only | RTM |
If the route table and recognition aid above are not enough to determine the product:
ZH:
已了解的信息
─────────────────────────────
场景: [use case]
主要产品: [primary product]
配套产品: [supporting products / 无]
─────────────────────────────
EN:
What I have so far
─────────────────────────────
Use case: [use case]
Primary: [primary product]
Supporting: [supporting products / none]
─────────────────────────────
Do not stop for a separate confirmation step — continue to the product module automatically.
Each product module follows its own workflow. Do not duplicate implementation logic here.
Common pattern across modules:
references/ firstbash and curl for local doc-fetch helper scriptsgit for sample-repo inspection when the sample-aligned path is chosendoc.shengwang.cn, doc-mcp.shengwang.cn, and gitee.comNetwork behavior:
fetch-docs.sh downloads a static file from doc.shengwang.cn/llms.txt — no user data is sentfetch-doc-content.sh fetches a single doc page by URI from doc-mcp.shengwang.cn — only the doc URI is sent, no user contextgit clone is used only for sample repo inspection from gitee.com — only the repo URL is sentCredential and service-activation requirements vary by product — see each product module and general/credentials-and-auth.md for details. Never hardcode credentials.
git clone --depth 1 <url> with HTTPS repo root URLs onlynpx claudepluginhub shengwang-community/skills --plugin voice-ai-integrationGuides building voice/video apps with Agora SDKs, including AI agents, calls, live streaming, screen sharing, messaging, recording, and CLI operations.
Builds real-time voice AI applications and agents using OpenAI Realtime API, Vapi, Deepgram for transcription, ElevenLabs for synthesis, LiveKit, and WebRTC fundamentals. Optimizes latency and audio quality.
Builds ElevenLabs conversational AI voice agents: configure via CLI/dashboard, add tools/knowledge, integrate React/React Native/Swift/JS SDKs, test/deploy. For voice AI, phone systems, or ElevenLabs errors.