Skill

catsu

Use Catsu for unified, high-performance embedding API calls across 11 providers (OpenAI, VoyageAI, Cohere, Jina, Mistral, Gemini, Together AI, Mixedbread, Nomic, DeepInfra, Cloudflare) through a single consistent interface. Covers model selection and discovery, automatic retry with exponential backoff, cost and token tracking, Matryoshka dimension reduction, input type hints (query vs document), async/await support, and per-request API key overrides. Use when: generating embeddings, comparing embedding providers, building search or RAG systems, or integrating embeddings into Python or Rust applications.

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/chonkie-skills:catsu

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Catsu provides a single, consistent interface for generating embeddings across 11 providers and 35+ models. Built-in retry logic, cost tracking, and model discovery eliminate the need for provider-specific SDKs.

Supporting Files

references/model_comparison.md

SKILL.md

340 lines · ~2.5k tokens

Stats

LanguagePython

Stars0

MaintenanceExcellent

Last CommitMay 6, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

Provider	Models	Dimensions	Max Tokens	Input Type	Custom Dims
OpenAI	text-embedding-3-small, 3-large, ada-002	1536, 3072, 1536	8191	No	Yes
VoyageAI	voyage-3, voyage-code-3, voyage-finance-2, voyage-law-2, voyage-multilingual-2, voyage-multimodal-3	1024	32000	Yes	Yes
Cohere	embed-v4.0, embed-english-v3.0, embed-multilingual-v3.0	1024	128000	Required (v3+)	Yes
Jina	jina-embeddings-v4, v3, jina-code-v2	1024	32768	Yes	Yes
Mistral	mistral-embed, codestral-embed-2505	1024	32768	Yes	Yes
Gemini	gemini-embedding-001	768	2048	Yes	Yes (128-3072)
Together AI	BAAI/bge models	1024	8192	No	No
Mixedbread	mxbai-embed models	1024	512	No	No
Nomic	nomic-embed-text-v1.5	768	8192	Yes	Yes
DeepInfra	BAAI/bge models	1024	8192	No	No
Cloudflare	BGE, Qwen models	768-1024	512-8192	No	No

Use Case	Recommended Model	Why
General purpose, low cost	`openai:text-embedding-3-small`	Best price/performance ratio
Highest quality retrieval	`voyageai:voyage-3`	Top MTEB scores
Code search	`voyageai:voyage-code-3` or `jina:jina-code-v2`	Code-optimized training
Legal / finance domain	`voyageai:voyage-law-2` / `voyage-finance-2`	Domain-specific
Multilingual content	`cohere:embed-multilingual-v3.0` or `voyageai:voyage-multilingual-2`	100+ languages
Long documents (128K)	`cohere:embed-v4.0`	128K token context
Free / self-hosted	`together:BAAI/bge-large-en-v1.5`	Open model, low cost
Multimodal (text + images)	`voyageai:voyage-multimodal-3` or `jina:jina-embeddings-v4`	Mixed content

Provider	Models	Dimensions	Max Tokens	Input Type	Custom Dims
OpenAI	text-embedding-3-small, 3-large, ada-002	1536, 3072, 1536	8191	No	Yes
VoyageAI	voyage-3, voyage-code-3, voyage-finance-2, voyage-law-2, voyage-multilingual-2, voyage-multimodal-3	1024	32000	Yes	Yes
Cohere	embed-v4.0, embed-english-v3.0, embed-multilingual-v3.0	1024	128000	Required (v3+)	Yes
Jina	jina-embeddings-v4, v3, jina-code-v2	1024	32768	Yes	Yes
Mistral	mistral-embed, codestral-embed-2505	1024	32768	Yes	Yes
Gemini	gemini-embedding-001	768	2048	Yes	Yes (128-3072)
Together AI	BAAI/bge models	1024	8192	No	No
Mixedbread	mxbai-embed models	1024	512	No	No
Nomic	nomic-embed-text-v1.5	768	8192	Yes	Yes
DeepInfra	BAAI/bge models	1024	8192	No	No
Cloudflare	BGE, Qwen models	768-1024	512-8192	No	No

Use Case	Recommended Model	Why
General purpose, low cost	`openai:text-embedding-3-small`	Best price/performance ratio
Highest quality retrieval	`voyageai:voyage-3`	Top MTEB scores
Code search	`voyageai:voyage-code-3` or `jina:jina-code-v2`	Code-optimized training
Legal / finance domain	`voyageai:voyage-law-2` / `voyage-finance-2`	Domain-specific
Multilingual content	`cohere:embed-multilingual-v3.0` or `voyageai:voyage-multilingual-2`	100+ languages
Long documents (128K)	`cohere:embed-v4.0`	128K token context
Free / self-hosted	`together:BAAI/bge-large-en-v1.5`	Open model, low cost
Multimodal (text + images)	`voyageai:voyage-multimodal-3` or `jina:jina-embeddings-v4`	Mixed content

catsu

Invocation

Context Preview

Supporting Files

SKILL.md

catsu

Invocation

Context Preview

Supporting Files

SKILL.md

Catsu — Unified Embedding API Client

When to Use This Skill

Installation

Setup — API Keys

Basic Usage

Python

Rust

Model Specification

Input Types — Query vs Document

Custom Dimensions (Matryoshka Embeddings)

Supported Providers & Models

Model Discovery

Model Selection Guide

Retry & Error Handling

Explicit error handling

Advanced Configuration

Per-request API key override

HTTP proxy and custom CA

Context managers for cleanup

NumPy conversion

Async Support

Integration with Chonkie

Similar Skills

Catsu — Unified Embedding API Client

When to Use This Skill

Installation

Setup — API Keys

Basic Usage

Python

Rust

Model Specification

Input Types — Query vs Document

Custom Dimensions (Matryoshka Embeddings)

Supported Providers & Models

Model Discovery

Model Selection Guide

Retry & Error Handling

Explicit error handling

Advanced Configuration

Per-request API key override

HTTP proxy and custom CA

Context managers for cleanup

NumPy conversion

Async Support

Integration with Chonkie

Similar Skills