dbt node level alerting
Setup
Metaplane leverages the meta
key of a dbt exposure
named metaplane_alerts to provide flexible and powerful alerting capabilities for your dbt models and tests.
Here's a simple top-level catch-all rule you can add to your schema.yml file:
exposures:
- name: metaplane_alerts # don't change, this is how we find the top level exposure
type: application
owner:
name: metaplane
meta:
metaplane:
alerting:
rules:
- name: All failures
description: catch all node level failures
includeWarnings: true # this defaults to true for all rule
destinations:
- type: slack_channel
name: data-engineering
- type: email
emailAddress: 'alerts@data_team.com'
This rule will send notifications for all model and test failures and warnings in a specific dbt run to both the specified Slack channel and the listed email address. To exclude a rule notifying for warnings, you can set includeWarnings: false
.
dbt cloud users can specify a list of ignoredDbtCloudEnvironments
, which will prevent us from sending alerts for any job on the ignored list. For example, this could be used to ignore alerts for all development/CI jobs.
The alerting rules also allow you to target alerts based on node-level properties. For example, you can create a rule to notify only about failures related to "critical" models:
exposures:
- name: metaplane_alerts
type: application
owner:
name: metaplane
meta:
metaplane:
alerting:
rules:
- name: critical models
description: failures related to critical models
# optional field to disable to rule for specific dbt cloud environments
ignoredDbtCloudEnvironments:
- "Staging"
filters:
- type: TAG
tags:
- critical
- type: WAREHOUSE_LOCATION
schema: GOLD
destinations:
- type: slack_channel
name: high_priority
In this example, the rule will send alerts to the "high_priority" Slack channel whenever a model or test fails and the model is tagged as "critical" or lives in the "GOLD" schema.
Beyond top-level rules, you can also add alerting destinations directly to individual models:
models:
- name: stg_orders
meta:
metaplane:
alerting:
rules:
- name: source_test_sfdc
destinations:
- type: slack_channel
name: data-engineering
Specifying a node-level rule will send an alert to all the specified destinations if that node has any failures. If a model is tagged, any child tests will inherit the alert destinations.
You can have as many of these alerting rules as you want, and they can each be as broad or specific as you need. The available filters include:
- name: critical infra
description: failures related to critical infra
filters:
- type: TAG
tags:
- critical
- p0
- type: WAREHOUSE_LOCATION
database: ANALYTICS # these can be regex
schema: PROD
- type: OWNER
ownerName: todd
And the available alert destination types are:
destinations:
- type: slack_channel
name: data-engineering
- type: email
emailAddress: 'alerts@data_team.com'
Updated 7 days ago