Search everything...

Stats

Actions

Available In

truesight

Name: truesight
Author: goodeye-labs

By Goodeye-Labs

MCP server and agent skills for the Truesight AI quality platform. Score inputs, build evaluations, analyze errors, and review results through natural language.

npx claudepluginhub goodeye-labs/truesight-mcp-skills

Popularity

Stars

Top 25%

Med: 0·Avg: 285

Installs

Med: 0·Avg: 1

What's Inside

Skills9

bootstrap-template-evaluation

/bootstrap-template-evaluation

Fastest route to a deployed live evaluation using a pre-built Truesight template. Use when the user wants a quick start without building judgment configs from scratch.

build-review-interface

/build-review-interface

Build a custom web interface for trace annotation and review. Use when users need a bespoke review surface for their workflow.

create-evaluation

/create-evaluation

Scope what quality should be measured, convert it into one or more actionable binary evaluations, deploy those evaluations through Truesight MCP, and generate a companion skill that applies them correctly. Use when a user wants to create new evals, quality checks, guardrails, or pass/fail criteria for AI outputs.

error-analysis

/error-analysis

Systematically identify and categorize failure modes in evaluated traces using Truesight datasets and error-analysis tools. Use when quality issues are unclear, after major pipeline changes, or when incidents indicate drift.

eval-audit

/eval-audit

Audit an existing evaluation workflow and produce severity-ranked findings with concrete next actions. Use when inheriting an eval setup, diagnosing quality regressions, or checking LLM evaluation process maturity.

MCP Servers1

truesight

External

Stats

Version1.1.0

Stars6

Forks1

MaintenanceExcellent

LicenseMIT

Last CommitMar 26, 2026

AddedMar 22, 2026

Actions

View on GitHub View README Plugin Marketplace JSON Homepage

Own this plugin?

Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).

Safety Signals

Caution

External network access

Connects to servers outside your machine

README

Truesight MCP Skills

Agent skills and Cursor plugin for the Truesight MCP. Step-by-step workflow playbooks for scoring inputs, building live evaluations, error analysis, and the review loop.

Works with Claude Code, Cursor, and any client that supports the agent skills standard.

Install as a Claude plugin

In Claude Code, run:

# Step 1: Register this repository as a marketplace
/plugin marketplace add Goodeye-Labs/truesight-mcp-skills

# Step 2: Install the plugin
/plugin install truesight@goodeye-labs-truesight

This installs the Truesight plugin and its MCP skills in Claude, including truesight-workflows, create-evaluation, and companion workflow skills.

To upgrade:

/plugin update truesight@goodeye-labs-truesight

Install skills manually

If you installed via Claude Marketplace above, you can skip this. Use the manual commands below to install skill files directly (works with Claude Code, Cursor, and other clients).

Project-level (recommended for team workflows)

BASE=https://raw.githubusercontent.com/Goodeye-Labs/truesight-mcp-skills/main/skills
for skill in truesight-workflows evaluate-trace error-analysis generate-synthetic-data review-and-promote-traces bootstrap-template-evaluation create-evaluation eval-audit build-review-interface; do
  curl -fsSL "$BASE/$skill/SKILL.md" -o ".claude/skills/$skill/SKILL.md" --create-dirs
done

Global (available in all projects)

BASE=https://raw.githubusercontent.com/Goodeye-Labs/truesight-mcp-skills/main/skills
for skill in truesight-workflows evaluate-trace error-analysis generate-synthetic-data review-and-promote-traces bootstrap-template-evaluation create-evaluation eval-audit build-review-interface; do
  curl -fsSL "$BASE/$skill/SKILL.md" -o "$HOME/.claude/skills/$skill/SKILL.md" --create-dirs
done

Skills

Skill	What it does
`truesight-workflows`	Strict orchestrator that routes to the correct Truesight MCP skill based on user intent
`generate-synthetic-data`	Create diverse synthetic test inputs using dimension-based variation for evaluation bootstrapping
`error-analysis`	Analyze traces in datasets, label failure modes, consolidate categories, and prioritize fixes
`bootstrap-template-evaluation`	Provision a template dataset and deploy a live evaluation quickly
`create-evaluation`	Scope, build, and deploy new custom live evaluations from scratch
`evaluate-trace`	Evaluate one or more inputs against an existing live evaluation, with optional handoff to review flows
`review-and-promote-traces`	Review flagged traces, submit judgments, and promote judged items back to datasets
`eval-audit`	Audit evaluation workflow maturity and return severity-ranked findings with next-skill actions
`build-review-interface`	Build a custom web annotation interface when Truesight web UI is not the preferred review surface

Usage

Once the MCP is connected and skills are installed, your AI assistant will automatically pick up the right skill based on what you ask:

"I need help choosing the right Truesight workflow": triggers truesight-workflows
"Generate synthetic test data for my RAG pipeline": triggers generate-synthetic-data
"Analyze the errors in my dataset": triggers error-analysis
"Bootstrap a live eval from a template": triggers bootstrap-template-evaluation
"Create an evaluation for response quality": triggers create-evaluation
"Evaluate these traces against my live eval": triggers evaluate-trace
"Review and promote these flagged results": triggers review-and-promote-traces
"Audit my eval setup and tell me what is missing": triggers eval-audit
"Help me build a custom annotation interface for trace review": triggers build-review-interface

Prerequisites

Some skills (like generate-synthetic-data and build-review-interface) work without any Truesight account. For skills that use the Truesight MCP, you need a free Truesight account. When prompted, sign in to authorize access. All tools are available based on your account permissions.

Want more control over permissions? You can also connect using a Platform API Key instead. See Connecting with an API key below.

Connect the MCP

Claude.ai and Claude Desktop

View full README on GitHub

truesight

Popularity

What's Inside

Confidence

README

Truesight MCP Skills

Install as a Claude plugin

Install skills manually

Project-level (recommended for team workflows)

Global (available in all projects)

Skills

Usage

Prerequisites

Connect the MCP

Claude.ai and Claude Desktop

Similar Plugins

agent-eval-harness

self-care

evalview

ai-experiment-logger

evaluation

agentic-usability

Truesight MCP Skills

Install as a Claude plugin

Install skills manually

Project-level (recommended for team workflows)

Global (available in all projects)

Skills

Usage

Prerequisites

Connect the MCP

Claude.ai and Claude Desktop

Popularity

Health & Quality

Similar Plugins

agent-eval-harness

self-care

evalview

ai-experiment-logger

evaluation

agentic-usability