Quickstart for the dbt Cloud Semantic Layer and Snowflake

Updated

Semantic Layer

Snowflake

dbt Cloud

Quickstart

Intermediate

Introduction

The dbt Semantic Layer, powered by MetricFlow, simplifies the setup of key business metrics. It centralizes definitions, avoids duplicate code, and ensures easy access to metrics in downstream tools. MetricFlow helps manage company metrics easier, allowing you to define metrics in your dbt project and query them in dbt Cloud with MetricFlow commands.

📹 Learn about the dbt Semantic Layer with on-demand video courses!

Explore our dbt Semantic Layer on-demand course to learn how to define and query metrics in your dbt project.

Additionally, dive into mini-courses for querying the dbt Semantic Layer in your favorite tools: Tableau, Excel, Hex, and Mode.

This quickstart guide is designed for dbt Cloud users using Snowflake as their data platform. It focuses on building and defining metrics, setting up the dbt Semantic Layer in a dbt Cloud project, and querying metrics in Google Sheets.

If you're on different data platforms, you can also follow this guide and will need to modify the setup for the specific platform. See the users on different platforms section for more information.

Prerequisites

You need a dbt Cloud Trial, Team, or Enterprise account for all deployments. Contact your representative for Single-tenant setup; otherwise, create an account using this guide.
Have the correct dbt Cloud license and permissions based on your plan:
More info on license and permissions
- Enterprise — Developer license with Account Admin permissions. Or "Owner" with a Developer license, assigned Project Creator, Database Admin, or Admin permissions.
- Team — "Owner" access with a Developer license.
- Trial — Automatic "Owner" access under a Team plan trial.
Create a trial Snowflake account:
- Select the Enterprise Snowflake edition with ACCOUNTADMIN access. Consider organizational questions when choosing a cloud provider, and refer to Snowflake's Introduction to Cloud Platforms.
- Select a cloud provider and region. All cloud providers and regions will work so choose whichever you prefer.
Basic understanding of SQL and dbt. For example, you've used dbt before or have completed the dbt Fundamentals course.

For users on different data platforms

If you're using a data platform other than Snowflake, this guide is also applicable to you. You can adapt the setup for your specific platform by following the account setup and data loading instructions detailed in the following tabs for each respective platform.

The rest of this guide applies universally across all supported platforms, ensuring you can fully leverage the dbt Semantic Layer.

BigQuery
Databricks
Microsoft Fabric
Redshift
Starburst Galaxy

Open a new tab and follow these quick steps for account setup and data loading instructions:

Create new Snowflake worksheet and set up environment

Log in to your trial Snowflake account.
In the Snowflake user interface (UI), click + Worksheet in the upper right corner.
Select SQL Worksheet to create a new worksheet.

Set up Snowflake environment

The data used here is stored as CSV files in a public S3 bucket and the following steps will guide you through how to prepare your Snowflake account for that data and upload it.

Create a new virtual warehouse, two new databases (one for raw data, the other for future dbt development), and two new schemas (one for jaffle_shop data, the other for stripe data).

Run the following SQL commands one by one by typing them into the Editor of your new Snowflake SQL worksheet to set up your environment.
Click Run in the upper right corner of the UI for each one:

-- Create a virtual warehouse named 'transforming'
create warehouse transforming;

-- Create two databases: one for raw data and another for analytics
create database raw;
create database analytics;

-- Within the 'raw' database, create two schemas: 'jaffle_shop' and 'stripe'
create schema raw.jaffle_shop;
create schema raw.stripe;

Load data into Snowflake

Now that your environment is set up, you can start loading data into it. You will be working within the raw database, using the jaffle_shop and stripe schemas to organize your tables.

Create customer table. First, delete all contents (empty) in the Editor of the Snowflake worksheet. Then, run this SQL command to create the customer table in the jaffle_shop schema:

create table raw.jaffle_shop.customers
( id integer,
  first_name varchar,
  last_name varchar
);

You should see a ‘Table CUSTOMERS successfully created.’ message.

Load data. After creating the table, delete all contents in the Editor. Run this command to load data from the S3 bucket into the customer table:

copy into raw.jaffle_shop.customers (id, first_name, last_name)
from 's3://dbt-tutorial-public/jaffle_shop_customers.csv'
file_format = (
    type = 'CSV'
    field_delimiter = ','
    skip_header = 1
    );