From oracle-ai-data-platform-workbench-engineer-agent
Answer business questions over the AIDP lakehouse with Spark SQL. Use when the user asks a data question ("how many…", "top N…", "show me…", "trend of…", "revenue by…") or wants to run ad-hoc Spark SQL. Grounds in .aidp/catalog.md + .aidp/semantic.md and reuses validated verified queries before generating SQL, then executes via the bundled aidp_sql.py helper.
How this skill is triggered — by the user, by Claude, or both
Slash command
/oracle-ai-data-platform-workbench-engineer-agent:aidp-analyzing-dataThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Answer business questions by grounding in the catalog/semantic model, reusing verified queries when
aidp-analyzing-data — natural language → Spark SQLAnswer business questions by grounding in the catalog/semantic model, reusing verified queries when
possible, then executing Spark SQL via the bundled scripts/aidp_sql.py helper.
Source is an external / non-lakehouse system (Fusion, EPM, Oracle ADB/ExaCS, Snowflake, S3, …)? This skill is lakehouse-native Spark SQL. To pull from an external source, use the
oracle-ai-data-platform-workbench-spark-connectorsplugin'saidp-<source>skill (install it if absent; run itsaidp-connectors-bootstrapskill once to push the helper package to the cluster), oraidp-federateto join across sources.
.aidp/verified-queries.md; if a verified: true entry closely matches
the question (similar text + table overlap), reuse its SQL (adapt only dates/bind values) and say so..aidp/catalog.md + .aidp/semantic.md: map concepts→tables via Quick
Reference/synonyms, use recorded join keys (don't guess joins), use value dictionaries for WHERE
literals, prefer metric SQL expressions from the semantic model. If the catalog cache is missing, run
aidp-catalog-init first.python "$PLUGIN_DIR/scripts/aidp_sql.py" \
--region <region> --datalake <DATALAKE_OCID> --workspace <ws> --cluster <cluster-key> \
--code "spark.sql('''<SQL>''').show(50, truncate=False)"
Returns JSON {status, execution_count, outputs, spark_job_ids, error}. Each invocation runs the cell;
keep the same <SQL> shape across follow-ups. Smoke-test connectivity with --code "spark.sql('SELECT 1').show()"..aidp/catalog.md and/or register
the working query via aidp-verified-queries (which validates before marking it verified).SHOW COLUMNS / DESCRIBE cell) or ask.catalog.schema.table). Default catalog/schema only when the user implies them.
This includes metadata commands: use SHOW TABLES IN <catalog>.<schema> (e.g. SHOW TABLES IN default.default),
not the unqualified SHOW TABLES IN default — the bare form raises AnalysisException: [SCHEMA_NOT_FOUND]
because default resolves as a catalog, not a schema.error field, fix grounded in the catalog, and
retry — don't guess repeatedly.aidp-cluster-ops); the helper attaches to the
cluster you pass via --cluster.ai_generate) see aidp-ai-sql; for cross-source joins see aidp-federate.npx claudepluginhub ahmedawan-oracle/claude-code-plugins --plugin oracle-ai-data-platform-workbench-engineer-agentGuides creation, editing, and verification of skills for AI coding agents using test-driven development with subagent scenarios. Use when authoring or debugging skills.