Stats
Actions
Tags
From data-science
Black Duck data product analytics — catalog structure, Spark SQL patterns, common gotchas, query standards
How this skill is triggered — by the user, by Claude, or both
Slash command
/data-science:data-analyticsThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Three catalog layers, each with multiple schemas:
Three catalog layers, each with multiple schemas:
| Catalog | Schemas | Purpose |
|---|---|---|
data_product_marts | core, enterprise, product-specific | Analytics-ready; start here |
data_product_intermediate | core, enterprise, product-specific | Shared calculations used across marts |
data_product_staging | product-specific only | Raw product-native data |
core (marts + intermediate): Cross-product entities — customers, applications, findings, scans, targetsenterprise (marts + intermediate): Cross-product analyticsllm_gateway, polaris, cd, etc.Default starting point: data_product_marts.core
model_group, proxy_base_url, team_alias in llm_gateway tables are VARIANT — cast to STRING before groupingmcp__superduck__data_products_list_data_products() — list available tablesmcp__superduck__data_products_describe_data_product(full_table_name) — schema + known issuescatalog.schema.tabledata_product_marts.core.customersdescribe_data_product()-- Customer overview
SELECT customer_name, COUNT(DISTINCT application_id) AS app_count
FROM data_product_marts.core.customers c
JOIN data_product_marts.core.applications a ON c.customer_id = a.customer_id
GROUP BY customer_name LIMIT 20;
-- Security posture by severity
SELECT severity, COUNT(*) AS finding_count
FROM data_product_marts.core.findings
WHERE finding_status = 'open'
GROUP BY severity ORDER BY finding_count DESC;
-- Scan volume trends
SELECT tool_type, DATE_TRUNC('month', created_at) AS month, COUNT(*) AS scan_count
FROM data_product_marts.core.scans
WHERE created_at >= DATE_SUB(CURRENT_DATE(), 365)
GROUP BY tool_type, month ORDER BY month DESC, scan_count DESC;
-- LLM Gateway weekly summary (VARIANT cast required)
SELECT COUNT(*) AS total_requests,
COUNT(DISTINCT user_id) AS unique_users,
SUM(total_tokens) AS total_tokens,
ROUND(SUM(response_cost), 2) AS cost_usd
FROM data_product_staging.llm_gateway.stg_llm_gateway__llm_gateway_usage
WHERE created_at >= 'YYYY-MM-DD' AND created_at < 'YYYY-MM-DD';
-- LLM Gateway by model (VARIANT cast)
SELECT CAST(model_group AS STRING) AS model, COUNT(*) AS requests
FROM data_product_staging.llm_gateway.stg_llm_gateway__llm_gateway_usage
WHERE created_at >= 'YYYY-MM-DD'
GROUP BY CAST(model_group AS STRING) ORDER BY requests DESC LIMIT 10;
Guides creation, editing, and verification of skills for AI coding agents using test-driven development with subagent scenarios. Use when authoring or debugging skills.
npx claudepluginhub bd-vitalstatistix/claude-marketplace --plugin data-science