Agents need context to answer questions about warehouse data. Agents Schema puts
that context in the warehouse itself, in a standard AGENTS schema, so agents
can query metadata next to the data they are reasoning over. See
Why Agents Schema for more on the idea behind it and
SPEC.md for the schema contract.
This repository provides GitHub workflows that ingest source metadata from
your repository and publish it into AGENTS.
Run one of the workflows below to populate the AGENTS schema from a source
you already have. Once it's populated, anything that already queries your
warehouse can read those tables as ordinary SQL, including Cursor, Claude
Code, notebooks, and internal agents. The fastest path is usually dbt: if your
repo already produces target/manifest.json, the workflow only needs the dbt
project path and your warehouse credentials.
After the first run, your warehouse has queryable metadata tables such as
AGENTS.DBT_MODEL, AGENTS.LOOKML_VIEW, or AGENTS.OSI_DATASET. Agents can
use those tables to understand which models and semantic objects exist, how
they are documented, how they relate to the warehouse, and what context is
available before writing or explaining queries.
There are three supported metadata sources. Pick one to get started quickly.
Each workflow writes to your warehouse using a single GitHub Actions secret:
WAREHOUSE_CREDENTIALS. The source-specific setup guides show the expected
secret shape and the workflow YAML to copy.
Use dbt Setup Guide when your repository contains a dbt project
or an existing target/manifest.json.
Use Looker Setup Guide when your repository contains LookML files.
Use OSI Setup Guide when your repository contains Open Semantic
Interchange *.osi.yaml files.
Use the reusable workflows together when one repository contains multiple metadata sources. See examples/workflows/dbt-looker.yml and examples/workflows/dbt-looker-osi.yml.
Once AGENTS is populated, the
agents-schema-analyst skill lets an AI agent
answer questions about your warehouse — grounded in your real metric definitions from AGENTS.*,
not guesses. You need a snow CLI connection (run snow connection add if you don't have one).
Claude Code
curl -fsSL --create-dirs \
-o ~/.claude/skills/agents-schema-analyst/SKILL.md \
https://raw.githubusercontent.com/fivetran/agents_schema/v0.0.6/examples/skills/agents-schema-analyst/SKILL.mdThen ask: /agents-schema-analyst "What is our total MRR this month?"
Codex
curl -fsSL --create-dirs \
-o ~/.codex/skills/agents-schema-analyst/SKILL.md \
https://raw.githubusercontent.com/fivetran/agents_schema/v0.0.6/examples/skills/agents-schema-analyst/SKILL.mdThen ask: $agents-schema-analyst "What is our total MRR this month?"
Agents operating over a warehouse need context that is not captured in table schemas alone: what a table is for, who maintains it, what transformations produced it, what it costs to query, and how it relates to other tables. Today this information often lives in wikis, Slack threads, dashboards, and tribal knowledge. Agents Schema puts it in the warehouse itself, where agents can find it without leaving the query interface.
Agents Schema is a discovery layer for agents that already query your warehouse. It gives them a standard place to ask: what curated tables exist, which system published the metadata, what dbt model or LookML object backs a dataset, what OSI semantic model describes it, whether a source is stale, and who owns a data product.
The schema is self-documenting. AGENTS.ROOT tells consumers which providers
are present and explains what provider-contributed tables mean. Consumers can
start there for generic discovery, or query well-known extension tables directly
when they already know the shape they need.
Agents Schema is not a replacement for specialized systems, source-native metadata APIs, or development-time tooling. A dbt MCP server helping an agent edit a dbt repository should still use dbt source files and artifacts directly. Agents Schema is the shared, queryable metadata surface for consumers that start from the warehouse and need context about data that already exists there.
It is closest in spirit to information_schema, but extensible across many
providers. Compared with MCP servers, Agents Schema is narrower: it publishes
context inside the warehouse, while MCP servers can expose tools, actions, and
source-specific workflows.
- A workflow in your repository invokes one of this repo's workflows.
- The workflow checks out your repository and reads source metadata such as dbt artifacts, LookML files, or OSI YAML files.
- The workflow runs the
agents-schemaCLI at the pinned release tag. - The CLI writes normalized metadata into the warehouse under the
AGENTSschema. - Agents and downstream tools query
AGENTSfor context close to the data itself.
The GitHub Actions call the CLI with explicit source arguments:
agents-schema dbt --project-dir dbt_project
agents-schema looker --lookml-dir lookml
agents-schema osi --osi-dir osiThe CLI reads warehouse credentials from WAREHOUSE_CREDENTIALS.
Release tags version the whole repository: reusable workflows, actions, CLI source, examples, README, and spec.
Pin exact tags in your workflows:
uses: fivetran/agents_schema/.github/workflows/agents-schema-dbt.yml@v0.0.6To upgrade, change only the tag in the uses: line. The current release tag is
v0.0.6.
The full schema contract is in SPEC.md. Keep schema definitions and compatibility rules there; keep this README focused on installation and source-specific GitHub workflow usage.
