From data-analysis-tools
Use when the user wants to query, analyze, or explore data through the Honeydew semantic layer. Covers structured queries, natural-language questions, and multi-step deep analysis.
How this skill is triggered — by the user, by Claude, or both
Slash command
/data-analysis-tools:queryThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Queries run against the workspace and branch set for the current session. Use `get_session_workspace_and_branch` to check the current context. If no workspace/branch is set, use `list_workspaces`, `list_workspace_branches`, and `set_session_workspace_and_branch` to select one. See the `model-exploration` skill for the full workspace/branch tool reference.
Queries run against the workspace and branch set for the current session. Use get_session_workspace_and_branch to check the current context. If no workspace/branch is set, use list_workspaces, list_workspace_branches, and set_session_workspace_and_branch to select one. See the model-exploration skill for the full workspace/branch tool reference.
Honeydew provides three ways to query data through the semantic layer. Each method suits a different situation — pick the right one based on how well you understand the model and how complex the question is.
| Method | Tool | Best For |
|---|---|---|
| Structured query | get_data_from_fields / get_sql_from_fields | You know the exact fields. Deterministic, full control. |
| Natural language | ask_question_get_data / ask_question_get_sql | Plain English question. Single query, fast answer. |
| Deep analysis | ask_deep_analysis_question | Complex, multi-step, "why" questions. Agentic reasoning. |
get_data_from_fields / get_sql_from_fields)Use when:
list_entities / get_entity)detailed_listings.price by detailed_listings.room_type"Do NOT use when:
How it works:
get_data_from_fields — executes the query and returns data rowsget_sql_from_fields — returns the generated SQL without executing (useful for review, debugging, or handing off to other tools)Both take the same field parameters.
ask_question_get_data / ask_question_get_sql)Use when:
Do NOT use when:
How it works:
ask_question_get_data — translates the question to SQL, executes it, and returns resultsask_question_get_sql — translates the question to SQL and returns the SQL onlyask_deep_analysis_question)Use when:
conversation_id)Do NOT use when:
User asks a data question
│
├─► Do you know the exact field names?
│ │
│ ├─► YES → get_data_from_fields (structured, deterministic)
│ │
│ └─► NO → Can you quickly discover them?
│ │
│ ├─► YES → list_entities / get_entity → then get_data_from_fields
│ │
│ └─► NO → ask_question_get_data (let Honeydew resolve fields)
│
├─► Is it a simple, single-query question in plain English?
│ └─► YES → ask_question_get_data
│
├─► Does it require investigation, "why", trends, or multi-step reasoning?
│ └─► YES → ask_deep_analysis_question
│
└─► Does the user want to see the SQL without running it?
├─► From known fields → get_sql_from_fields
└─► From plain English → ask_question_get_sql
A structured query uses flat field parameters to define what data to retrieve:
attributes — dimensions to group by (columns in the output), e.g. ["entity.attribute_name"]metrics — aggregated measures (SUM, COUNT, AVG, etc.), e.g. ["entity.metric_name"]filters — row-level filters applied before aggregation, e.g. ["entity.field = 'value'"]order_by — sort order for results. Each entry MUST be a quoted string, as if it were a SQL identifier — e.g. ["\"entity.field\" ASC"]. Always wrap the field reference in double quotes inside the string.domain — optional domain name for query contextlimit — max rows to return (default: 100)offset — rows to skip (for pagination)All fields use entity.field_name syntax. Cross-entity fields are supported when relations exist.
Before building a query, discover the available fields:
list_entities — see all entitiesget_entity with entity name — see its attributes, metrics, and relationsget_field with entity and field name — get detailed info about a specific fieldlist_domains — see all available domains (useful before passing domain parameter)search_model with a keyword — find fields across the modelSimple metric query — total count:
Call get_data_from_fields with:
metrics: ["detailed_listings.count"]Dimension breakdown — listings by room type:
Call get_data_from_fields with:
attributes: ["detailed_listings.room_type"]metrics: ["detailed_listings.count"]order_by: ["\"detailed_listings.count\" DESC"]Filtered query — only entire homes:
Call get_data_from_fields with:
attributes: ["detailed_listings.neighbourhood_cleansed"]metrics: ["detailed_listings.count"]filters: ["detailed_listings.room_type = 'Entire home/apt'"]order_by: ["\"detailed_listings.count\" DESC"]Cross-entity query — listings with host info:
Call get_data_from_fields with:
attributes: ["detailed_listings.room_type", "dim_host.host_is_superhost"]metrics: ["detailed_listings.count"]order_by: ["\"detailed_listings.count\" DESC"]Using aliases — rename fields or ad-hoc expressions:
You can alias any field or ad-hoc expression using AS "alias_name". This controls the column name in the output.
Call get_data_from_fields with:
attributes: ["detailed_listings.room_type"]metrics: ["detailed_listings.count AS \"total_listings\"", "AVG(detailed_listings.price) AS \"avg_price\""]order_by: ["\"total_listings\" DESC"]Once aliased, use the alias (not the original expression) in order_by.
Pagination — large result sets:
Call get_data_from_fields with:
attributes: (your fields)metrics: (your metrics)limit: 50 (max rows to return)offset: 100 (skip first 100 rows)Finding duplicate values:
Call get_data_from_fields with:
attributes: ["detailed_listings.host_name"]metrics: ["COUNT(detailed_listings.host_name)"]filters: ["COUNT(detailed_listings.host_name) > 1"]order_by: ["\"COUNT(detailed_listings.host_name)\" DESC"]This groups by the attribute, counts occurrences, and filters to only rows that appear more than once — surfacing duplicates.
SQL preview only:
Call get_sql_from_fields with the same field parameters to see the generated SQL without executing.
Filters use standard comparison expressions: =, >, <, IN (...), ILIKE, SEARCH(...), IS NULL, booleans, date ranges, and AND/OR combinations.
For the complete filter expression reference — including SEARCH, date handling, and type casting — see the filtering skill.
Call with:
question (required): the question in plain Englishmax_rows (required): maximum number of rows to returndomain (required): domain name for query contextquestion: "What are the top 10 neighbourhoods by number of listings?"
max_rows: 10
domain: "my_domain"
Returns the query results directly.
Call with:
question (required): the question in plain Englishdomain (required): domain name for query contextquestion: "What are the top 10 neighbourhoods by number of listings?"
domain: "my_domain"
Returns the generated SQL without executing.
Call with:
question (required): the analysis questiondomain (required): domain name for query contextconversation_id (optional): ID from a previous deep analysis call, for follow-up questionsquestion: "Analyze the relationship between host response time and review scores. Are there significant patterns?"
domain: "my_domain"
Returns:
conversation_id for continuing the conversationAfter a successful ask_deep_analysis_question call, the response includes a ui_url field. Always display this URL to the user so they can view the full analysis in the Honeydew application.
Use conversation_id from the previous response to ask follow-up questions that build on the prior analysis:
question: "Now break this down by room type — does the pattern hold across all types?"
domain: "my_domain"
conversation_id: "<id from previous response>"
For complex tasks, combine methods in sequence:
list_entities / get_entity to understand the modelask_question_get_data to get a quick feel for the dataget_data_from_fields for precise, targeted queriesask_deep_analysis_question for root cause or trend analysisUser: "Help me understand pricing patterns for Airbnb listings."
list_entities → find detailed_listingsget_entity for detailed_listings → find price, room_type, neighbourhood_cleansedask_question_get_data → "What is the average listing price by room type?"get_data_from_fields → price distribution by neighbourhood for Entire homes onlyask_deep_analysis_question → "What factors most influence listing price? Analyze correlations with room type, location, amenities, and reviews."Use the honeydew-docs MCP tools to search the Honeydew documentation when:
Search for topics like: "queries", "perspectives", "dynamic datasets", "parameters", "query API".
To retrieve the distinct (unique) values of a field, include it in attributes and add a count metric in metrics.
Use the entity's built-in count metric (e.g., entity.count) if available, or an ad-hoc count metric using COUNT(entity.field)
on the field whose distinct values you want — never use COUNT(*).
The metric forces aggregation, which groups by the attribute and returns one row per distinct value.
Example — distinct room types:
Call get_data_from_fields with:
attributes: ["detailed_listings.room_type"]metrics: ["COUNT(detailed_listings.room_type)"]order_by: ["\"COUNT(detailed_listings.room_type)\" DESC"]This returns each unique room_type along with its count, ordered by frequency. The count is a useful bonus — it tells you how common each value is — but the key point is that the query returns one row per distinct value.
This pattern is useful for:
list_entities / get_entity before building queries, so you reference real fieldsget_data_from_fields gives you full control and reproducible resultslimit and offset in get_data_from_fields to avoid overwhelming outputget_sql_from_fields or ask_question_get_sql to inspect the generated queryentity.field_name syntax in field parametersnpx claudepluginhub honeydew-ai/honeydew-ai-coding-agents-pluginsQueries and analyzes data through the Honeydew semantic layer, supporting structured field queries and natural language deep analyses.
Runs queries against Omni Analytics' semantic layer via the Omni CLI, interprets results, and chains queries for multi-step analysis. Useful for extracting data from Omni dashboards or workbooks programmatically.
Searches DataHub catalog to discover datasets, find entities by platform/domain, and answer ad-hoc questions about metadata ownership and PII.