feat: add `elementary_extra_indexes` config var for custom index injection by haritamar · Pull Request #1019 · elementary-data/dbt-data-reliability

haritamar · 2026-06-02T23:15:18Z

Summary

Adds a new elementary_extra_indexes dbt var (default {}) that lets downstream consumers inject additional PostgreSQL indexes into Elementary models without modifying this package.

New macro: elementary.get_indexes_for_model(model_name, base_indexes) — merges the model's built-in indexes with any extras from elementary_extra_indexes, deduplicating by sorted column set. Returns [] on non-postgres targets.

Usage — pass a dict mapping model names to index configs via --vars:

elementary_extra_indexes:
  dbt_run_results:
    - columns: [\"created_at\"]
  elementary_test_results:
    - columns: [\"created_at\"]

All 5 models with postgres indexes now use get_indexes_for_model: dbt_run_results, elementary_test_results, data_monitoring_metrics, dbt_source_freshness_results, test_result_rows.

Motivation: Elementary Cloud needs cloud-specific indexes (e.g. (created_at) for time-range queries) that don't belong in the OSS defaults. This avoids polluting the base package while keeping index definitions co-located with models."

Link to Devin session: https://app.devin.ai/sessions/278ceddbe6d24dbea36a83ecc232f3b6
Requested by: @haritamar

Summary by CodeRabbit

New Features
- Added a centralized index helper and a new default config key for extra indexes.
Refactor
- Updated multiple models to use the centralized index helper for index definitions.
- Centralized index merging and deduplication, keeping Postgres-specific handling intact.
Chores
- Reduced inline DB-specific conditionals in model configurations for consistency.

Add a new get_indexes_for_model macro that merges base indexes with extra indexes passed via the elementary_extra_indexes dbt var. This allows downstream consumers (e.g. Elementary Cloud) to inject additional indexes without modifying the OSS package. The macro deduplicates indexes by sorted column set to prevent duplicate index creation when base and extra indexes overlap. Updated all models with postgres indexes to use get_indexes_for_model: - dbt_run_results - elementary_test_results - data_monitoring_metrics - dbt_source_freshness_results - test_result_rows Co-Authored-By: Itamar Hartstein <haritamar@gmail.com>

devin-ai-integration · 2026-06-02T23:15:21Z

🤖 Devin AI Engineer

I'll be helping with this pull request! Here's what you should know:

✅ I will automatically:

Address comments on this PR. Add '(aside)' to your comment to have me ignore it.
Look at CI failures and help fix them

Note: I can only respond to comments from users who have write access to this repository.

⚙️ Control Options:

Disable automatic comment, CI, and merge conflict monitoring

github-actions · 2026-06-02T23:15:30Z

👋 @haritamar
Thank you for raising your pull request.
Please make sure to add tests and document all user-facing changes.
You can do this by editing the docs files in the elementary repository.

coderabbitai · 2026-06-02T23:15:31Z

Warning

Review limit reached

@devin-ai-integration[bot], we couldn't start this review because you've reached your PR review rate limit.

More reviews will be available in 6 minutes and 58 seconds. Learn how PR review limits work.

Your organization has run out of usage credits. Purchase more in the billing tab.

⌛ How to resolve this issue?

After more reviews become available, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans include higher PR review limits than trial, open-source, and free plans. In all cases, reviews become available again over time. During sustained high-volume PR review activity, CodeRabbit may temporarily slow when the next review becomes available.

Please see our Fair Usage Limits Policy for further information.

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 5f645199-508e-4c2e-95d2-8079e4825044

📥 Commits

Reviewing files that changed from the base of the PR and between d4a280c and 1510b03.

📒 Files selected for processing (1)

macros/edr/system/system_utils/get_indexes_for_model.sql

📝 Walkthrough

Walkthrough

Adds a default elementary_extra_indexes config key and a new macro get_indexes_for_model; five dbt models are updated to call this macro for their indexes config instead of using inline Postgres-only conditionals.

Changes

Index Configuration Refactor

Layer / File(s)	Summary
Index infrastructure foundation `macros/edr/system/system_utils/get_config_var.sql`, `macros/edr/system/system_utils/get_indexes_for_model.sql`	Registers `elementary_extra_indexes` in defaults and introduces `get_indexes_for_model` which returns `[]` for non-Postgres or a merged, deduplicated list of `base_indexes` + per-model `elementary_extra_indexes` for Postgres.
Model migration to helper macro `models/edr/data_monitoring/data_monitoring/data_monitoring_metrics.sql`, `models/edr/dbt_artifacts/dbt_run_results.sql`, `models/edr/run_results/dbt_source_freshness_results.sql`, `models/edr/run_results/elementary_test_results.sql`, `models/edr/run_results/test_result_rows.sql`	Five models replace inline `target.type == "postgres"` index conditionals with calls to `elementary.get_indexes_for_model("<model_name>", [...])`, passing the same base index column sets.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Poem

🐰 A helper springs from burrowed ground,
Five models dance in tidy round.
Indexes merged, duplicates undone,
One little macro — work all done. ✨

🚥 Pre-merge checks | ✅ 5

✅ Passed checks (5 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately summarizes the main change: adding a new `elementary_extra_indexes` config variable for custom index injection, which is the core feature introduced across all modified files.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch devin/1780441951-add-extra-indexes-config

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

🧹 Nitpick comments (3)

macros/edr/system/system_utils/get_indexes_for_model.sql (2)
4-4: 💤 Low value

Use elementary.get_config_var for consistency with the new default config key.

You added elementary_extra_indexes to default__get_default_config(), but here it's read via raw var(...), bypassing the dispatched default config and runtime-config plumbing. Functionally equivalent today (default is {}), but reading it through the config accessor keeps a single source of truth.
♻️ Suggested change
-    {% set extra_indexes_config = var("elementary_extra_indexes", {}) %}
+    {% set extra_indexes_config = elementary.get_config_var("elementary_extra_indexes") %}
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@macros/edr/system/system_utils/get_indexes_for_model.sql` at line 4, Replace
the direct var(...) call with the config accessor to honor the new default
config plumbing: change the assignment of extra_indexes_config (currently using
var("elementary_extra_indexes", {})) to use
elementary.get_config_var("elementary_extra_indexes", {}) so the macro
get_indexes_for_model reads the value through the centralized config accessor
rather than raw var().
10-24: 💤 Low value

Clarify index dedup/override semantics so unique/type aren’t ignored when column sets match.

macros/edr/system/system_utils/get_indexes_for_model.sql (lines 10-24) dedups by idx.columns | sort | join(","), so entries that differ only by unique/type (or by column order) are silently dropped; base_indexes always wins for an identical column set.

macros/commands/create_indexes.sql also only checks columns when deciding whether to create a new index, so a requested unique/type change can be skipped if any index with the same column set already exists.

If an elementary_extra_indexes entry is missing columns, dbt compilation will fail because the macro unconditionally reads idx.columns.

Document the “dedup by column set; base wins” precedence (or update the dedup/existence key to also consider unique/type), and optionally guard/validate extras require columns.
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@macros/edr/system/system_utils/get_indexes_for_model.sql` around lines 10 -
24, The current index-merge logic in get_indexes_for_model.sql (looping
base_indexes and extra_indexes, tracking seen_column_sets and appending to
merged_indexes) dedups solely by sorted column set which ignores differing
attributes like unique and type and also assumes idx.columns exists; update the
dedup/existence key to include attribute equality (e.g., include idx.unique and
idx.type alongside the sorted, order-normalized columns) so extra_indexes with
different unique/type are not silently dropped (and ensure column order is
normalized via sort/join), and add a validation/guard for
elementary_extra_indexes entries so you check idx.columns exists before reading
it (or raise a clear error); mirror the same comparison change in
create_indexes.sql where existence is checked so creation respects unique/type
differences.
models/edr/run_results/test_result_rows.sql (1)
1-1: 💤 Low value

Stale comment. The Postgres-only gating now lives in get_indexes_for_model, so this top-of-file note is redundant and could be removed (or trimmed) to avoid implying the model still does inline warehouse gating.
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@models/edr/run_results/test_result_rows.sql` at line 1, Remove or trim the
stale top-of-file comment in models/edr/run_results/test_result_rows.sql that
states "indexes are not supported in all warehouses, relevant to postgres only"
since warehouse-specific gating is handled in get_indexes_for_model; update the
comment to reflect current behavior (or delete it) so it no longer implies
inline warehouse gating in this file and instead points readers to
get_indexes_for_model for Postgres-specific logic.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Nitpick comments:
In `@macros/edr/system/system_utils/get_indexes_for_model.sql`:
- Line 4: Replace the direct var(...) call with the config accessor to honor the
new default config plumbing: change the assignment of extra_indexes_config
(currently using var("elementary_extra_indexes", {})) to use
elementary.get_config_var("elementary_extra_indexes", {}) so the macro
get_indexes_for_model reads the value through the centralized config accessor
rather than raw var().
- Around line 10-24: The current index-merge logic in get_indexes_for_model.sql
(looping base_indexes and extra_indexes, tracking seen_column_sets and appending
to merged_indexes) dedups solely by sorted column set which ignores differing
attributes like unique and type and also assumes idx.columns exists; update the
dedup/existence key to include attribute equality (e.g., include idx.unique and
idx.type alongside the sorted, order-normalized columns) so extra_indexes with
different unique/type are not silently dropped (and ensure column order is
normalized via sort/join), and add a validation/guard for
elementary_extra_indexes entries so you check idx.columns exists before reading
it (or raise a clear error); mirror the same comparison change in
create_indexes.sql where existence is checked so creation respects unique/type
differences.

In `@models/edr/run_results/test_result_rows.sql`:
- Line 1: Remove or trim the stale top-of-file comment in
models/edr/run_results/test_result_rows.sql that states "indexes are not
supported in all warehouses, relevant to postgres only" since warehouse-specific
gating is handled in get_indexes_for_model; update the comment to reflect
current behavior (or delete it) so it no longer implies inline warehouse gating
in this file and instead points readers to get_indexes_for_model for
Postgres-specific logic.

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 2036b863-d7a3-464c-a3b9-70e641dc3d52

📥 Commits

Reviewing files that changed from the base of the PR and between bf2747f and 11249d5.

📒 Files selected for processing (7)

macros/edr/system/system_utils/get_config_var.sql
macros/edr/system/system_utils/get_indexes_for_model.sql
models/edr/data_monitoring/data_monitoring/data_monitoring_metrics.sql
models/edr/dbt_artifacts/dbt_run_results.sql
models/edr/run_results/dbt_source_freshness_results.sql
models/edr/run_results/elementary_test_results.sql
models/edr/run_results/test_result_rows.sql

Address CodeRabbit review feedback: - Use elementary.get_config_var for consistency with centralized config - Remove stale warehouse-gating comment from test_result_rows.sql Co-Authored-By: Itamar Hartstein <haritamar@gmail.com>

devin-ai-integration · 2026-06-02T23:21:35Z

+{% macro get_indexes_for_model(model_name, base_indexes) %}
+    {%- if target.type != "postgres" -%} {{ return([]) }} {%- endif -%}
+
+    {% set extra_indexes_config = elementary.get_config_var(


Good catch — switched to elementary.get_config_var("elementary_extra_indexes") in d4a280c.

devin-ai-integration · 2026-06-02T23:21:42Z

+    {% set extra_indexes = extra_indexes_config.get(model_name, []) %}
+
+    {% set seen_column_sets = [] %}
+    {% set merged_indexes = [] %}


Intentionally keeping dedup by column set only. The current use case is purely adding simple B-tree indexes (no unique/type variants), and the create_indexes.sql macro also only checks column sets when deciding whether to create. Adding unique/type to the dedup key would be over-engineering for the current need. If that changes in the future, we can extend it then.

MikaKerman · 2026-06-03T13:04:51Z

@@ -0,0 +1,29 @@
+{% macro get_indexes_for_model(model_name, base_indexes) %}
+    {%- if target.type != "postgres" -%} {{ return([]) }} {%- endif -%}


Nit: now that the Postgres gating moved from the individual model files into this macro, it would be good to document why — something like:

{# dbt's indexes config is Postgres-only. Returns [] for all other targets. #}

Good call — added the comment in 1510b03.

MikaKerman · 2026-06-03T13:04:52Z

+    {% set extra_indexes_config = elementary.get_config_var(
+        "elementary_extra_indexes"
+    ) %}
+    {% set extra_indexes = extra_indexes_config.get(model_name, []) %}


Nit: if a user passes an entry missing the columns key (e.g. a typo like {"column": ["created_at"]}), idx.columns will cause a cryptic Jinja compilation error. A small guard would make the error much clearer:

{% for idx in extra_indexes %} {% if idx.columns is not defined %} {{ exceptions.raise_compiler_error( "elementary_extra_indexes entry for '" ~ model_name ~ "' is missing 'columns': " ~ idx ) }} {% endif %} ... {% endfor %}

Not a blocker — the compile error would still point to the right area — but would improve DX for anyone configuring this var.

Good suggestion — added the guard in 1510b03. It now raises a clear compiler error pointing to the model name and the offending entry.

MikaKerman · 2026-06-03T13:04:52Z

+    ) %}
+    {% set extra_indexes = extra_indexes_config.get(model_name, []) %}
+
+    {% set seen_column_sets = [] %}


Nit: the dedup logic only considers the sorted column set, so an extra index with {"columns": ["id"], "unique": true} would be silently dropped if the base already has {"columns": ["id"]}. I see Devin's response that this is intentional for now — makes sense. A one-line comment would help future readers understand the precedence:

{# Dedup by sorted column set only. Base indexes take precedence on conflicts. #}

Added the comment in 1510b03 — {# Dedup by sorted column set only. Base indexes take precedence on conflicts. #}.

MikaKerman

Looks good! A few minor nits posted as inline comments — none are blockers.

Co-Authored-By: Itamar Hartstein <haritamar@gmail.com>

devin-ai-integration Bot assigned haritamar Jun 2, 2026

coderabbitai Bot reviewed Jun 2, 2026

View reviewed changes

refactor: use get_config_var and remove stale comment

d4a280c

Address CodeRabbit review feedback: - Use elementary.get_config_var for consistency with centralized config - Remove stale warehouse-gating comment from test_result_rows.sql Co-Authored-By: Itamar Hartstein <haritamar@gmail.com>

devin-ai-integration Bot reviewed Jun 2, 2026

View reviewed changes

haritamar mentioned this pull request Jun 3, 2026

chore: update dbt-data-reliability to include elementary_extra_indexes support elementary-data/elementary#2265

Merged

MikaKerman reviewed Jun 3, 2026

View reviewed changes

MikaKerman approved these changes Jun 3, 2026

View reviewed changes

address review nits: add comments and columns validation guard

1510b03

Co-Authored-By: Itamar Hartstein <haritamar@gmail.com>

haritamar merged commit 3dc104a into master Jun 3, 2026
28 of 30 checks passed

haritamar deleted the devin/1780441951-add-extra-indexes-config branch June 3, 2026 15:35

		@@ -0,0 +1,29 @@
		{% macro get_indexes_for_model(model_name, base_indexes) %}
		{%- if target.type != "postgres" -%} {{ return([]) }} {%- endif -%}

Conversation

haritamar commented Jun 2, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Summary by CodeRabbit

Uh oh!

devin-ai-integration Bot commented Jun 2, 2026

🤖 Devin AI Engineer

Uh oh!

github-actions Bot commented Jun 2, 2026

Uh oh!

coderabbitai Bot commented Jun 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review limit reached

Walkthrough

Changes

Estimated code review effort

Poem

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration Bot Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration Bot Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

MikaKerman Jun 3, 2026

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration Bot Jun 3, 2026

Choose a reason for hiding this comment

Uh oh!

MikaKerman Jun 3, 2026

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration Bot Jun 3, 2026

Choose a reason for hiding this comment

Uh oh!

MikaKerman Jun 3, 2026

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration Bot Jun 3, 2026

Choose a reason for hiding this comment

Uh oh!

MikaKerman left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

haritamar commented Jun 2, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Jun 2, 2026 •

edited

Loading