Search everything...

Stats

Actions

Available In

dqx

Name: dqx
Author: databrickslabs

By databrickslabs

Profile PySpark DataFrames or Unity Catalog tables with AI to generate data quality rule candidates, define rules via Python classes or YAML, validate against DQEngine, run end-to-end checks splitting valid/quarantined rows, and persist rules to Delta tables, volumes, or Lakebase.

npx claudepluginhub databrickslabs/dqx --plugin dqx

Popularity

Stars

Top 5%

405

Med: 0·Avg: 285

Installs

Med: 0·Avg: 1

What's Inside

Skills5

dqx-define-checks

/dqx-define-checks

Create DQX quality rules (checks) for a PySpark DataFrame or Delta table. Use when the user asks to "add a DQX check", "define a data quality rule", "validate that column X is not null / unique / in a set", or wants checks expressed in YAML/JSON for storage. Covers DQRowRule, DQDatasetRule, DQForEachColRule, built-in check_funcs, filters, user_metadata, custom SQL/Python checks, and the declarative metadata form.

dqx-apply-checks

/dqx-apply-checks

Validate a PySpark DataFrame or Delta table against a set of DQX quality rules using DQEngine. Use when the user asks to "run data quality checks", "apply DQX rules to a DataFrame/table", "split valid and invalid rows", "quarantine bad records", or "integrate DQX into a streaming pipeline". Covers apply_checks, apply_checks_and_split, the by_metadata variants, and the shape of the result columns.

dqx-end-to-end

/dqx-end-to-end

Run DQX validation end-to-end — read an input table or path, apply checks, and write valid and quarantined rows to output locations — in a single call. Use when the user asks for "apply and save", "quality-check a table and split the output", "DQX on a whole table", "save valid and invalid rows", or wants to drop DQX into a Lakeflow / workflow that runs on a table or path. Covers apply_checks_and_save_in_table, the by_metadata variant, InputConfig / OutputConfig, and incremental streaming mode.

dqx-profile-and-generate

/dqx-profile-and-generate

Profile a DataFrame or table and generate DQX quality rule candidates with summary statistics. Use when the user asks to "profile a table", "generate DQX rules from data", "suggest data quality checks", "bootstrap a checks.yml", or "generate DLT expectations". Covers DQProfiler, DQGenerator, DQDltGenerator, the profiler workflow, sampling / filter options, and AI-assisted variants.

dqx-storage

/dqx-storage

Load and save DQX checks (quality rules) to a file, workspace path, Unity Catalog volume, Delta table, Lakebase, or the DQX installation folder. Use when the user asks to "load DQX checks from YAML", "save checks to a Delta table", "read checks from a volume", "share checks across notebooks", or "use the DQX workspace install's default checks location". Covers every *ChecksStorageConfig and the matching load/save calls.

Stats

Version0.1.0

ReleasedFeb 9, 2026

Stars405

Forks111

MaintenanceGood

LicenseSEE LICENSE IN LICENSE

Last CommitMay 4, 2026

AddedMay 4, 2026

Actions

View on GitHub View README Plugin Marketplace JSON Homepage

Own this plugin?

Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).

Available In

databrickslabs-dqx405

dqx

Popularity

What's Inside

Confidence

README

DQX by Databricks Labs

📖 Documentation

🛠️ Contribution

💬 Project Support

Similar Plugins

databricks-ai-dev-kit

bauplan

agentspec

databricks-pack

DQX by Databricks Labs

📖 Documentation

🛠️ Contribution

💬 Project Support

Popularity

Health & Quality

Similar Plugins

databricks-ai-dev-kit

bauplan

agentspec

databricks-pack

datahub-skills

databricks-skills