Customer Data Warehouses are the New Customer Data Platform — Rittman Analytics

Mark Rittman

Mar 12, 2021

Data Engineering

BigQuery

dbt

Fivetran

Modern Data Stack

A couple of weeks ago I posted an article on this blog on Customer 360-Degree Analysis using BigQuery, dbt and new “reverse ETL” tools such as Hightouch and Rudderstack, and this article from Tejas Manohar on the Fivetran blog along with the latest episode of the Drill to Detail Podcast are great introductions to the concept of customer data warehouses as the new customer data platform.

In this article I’m going to talk a bit more about we’ve transformed our customer data warehouse into the core of our new customer data platform, adding segmentation, interest scoring and marketing audiences that we then sync to services such as Facebook Custom Audiences and Intercom.

We do this by creating a set of additional derived customer and contact tables that take data from all parts of our business, both offline and online, and use it to create a set of tables we use to drive our marketing activity.

These additional warehouse tables contain sequenced details of each client’s journey from prospect to opportunity to engaged client, derive sets of behavioral traits we use for marketing personalization and categorise each of the contacts associated with those clients into one of several value segments.

A Data Model for Customer Data Warehouse Data Platforms

The data model diagram below shows these customer data platform tables and how they relate to the customer and contact dimensions in the warehouse:

These derived tables include (links are to code in our RA Warehouse for dbt Github project, note that the specific examples in that package would need to be adapted for your particular data and requirements)

CUSTOMER_EVENTS_XA, a derived table that takes each invoice, timesheet, sales opportunity and contact interaction for a client and turns them into a sequence of events that helps us understand the stage in the customer journey they’re at as well as their profitability at any particular point in time, for example as shown in the Looker visualization below:

CONTACT_WEB_INTERESTS_XA, where we connect the page view activity recorded for each client contact when they visit our website directly or via links in marketing emails we send, tag those pages with category labels and then use when creating marketing audiences or in summary-level charts such as the one below for all contacts in-aggregate

CONTACT_SEGMENTS_XA, where we take the influencer score described in our previous post and use it together with scores for engagement and stage in the buying cycle to place each of our contacts in a value segment as shown in the next Looker visualization and with the aim of our marketing being to move those contacts into segments of higher value over time

CONTACT_AUDIENCES_XA, the most important table for marketing purposes that takes each of these interest scores, segments and other traits and records them alongside each contact email ready for syncing downstream to our marketing services.

Audience Building and Downstream Syncing of Customer Traits

Now we’ve pulled-together all of this valuable customer and contact marketing data, the next step is to sync it to our various downstream marketing and advertising technology services. For this we’ve been using a new service called Hightouch that I introduced in my previous blog, a tool that allows us to define selections of customer and contact data (or “models”) and then sync those models to various marketing and customer service applications.

Creating a data synchronization job is a three-stage process:

Defining a model, a SQL query against the customer data warehouse tables that can either compute traits itself, similar to SQL traits and computed traits in Segment Personas, or can just query a table directly if you’ve already computed those traits as part of your warehouse build.

2. Connecting to the service you want to sync data to. Hightouch connects via those services APIs and in my case I’ve added Intercom, Facebook Custom Audiences and Hubspot, plus Segment and Rudderstack’s Javascript APIs.

When you sync contact and other data to event collection services such as Segment and Rudderstack they become either user identity events or in the case of Segment, segment entrance and exit track events, or you can send transaction-type data to these services as track events, something we’ve been doing to convert Jira issue data to a series of Segment events in our system.

3. Finally you define the sync itself, which can be triggered either manually, on a schedule or after completion of a dbtCloud job run.

Interested? Find out More

If the idea of creating a customer data platform based around your existing data warehouse and being able to sync that information synced to your downstream marketing and customer applications, contact us now to organise a 100% free, no-obligation 30 minute call to discuss your data needs — we’d love to hear from you.

Customer AnalyticsData Warehousing

Mark Rittman

One Person Many Roles: Designing a Unified Person Dimension in Google BigQuery

Jan 26, 2026

Data Engineering

BigQuery

How Rittman Analytics uses AI-Augmented Project Delivery to Provide Value to Users, Faster

Jan 19, 2026

Looker

Google Cloud (GCP)

An Homage to Oracle Warehouse Builder, 25 Years Ahead of Its Time

Dec 8, 2025

Data Engineering

dbt

Looking for a partner on your data analytics journey?

RSS Feed

Subscribe via RSS

Published Year

2026

(5)

2025

(16)

2024

(22)

2023

(13)

2022

(9)

2021

(9)

2020

(11)

2019

(18)

2018

(8)

2017

(5)

2016

(15)

Customer Data Warehouses are the New Customer Data Platform — Rittman Analytics

A Data Model for Customer Data Warehouse Data Platforms

Audience Building and Downstream Syncing of Customer Traits

Interested? Find out More

Recommended Posts

One Person Many Roles: Designing a Unified Person Dimension in Google BigQuery

How Rittman Analytics uses AI-Augmented Project Delivery to Provide Value to Users, Faster

An Homage to Oracle Warehouse Builder, 25 Years Ahead of Its Time

Recent Posts

One Person Many Roles: Designing a Unified Person Dimension in Google BigQuery

Why We’ve Tried to Replace Data Analytics Developers Every Decade Since 1974

How Rittman Analytics uses AI-Augmented Project Delivery to Provide Value to Users, Faster

Rittman Analytics 2025 Wrapped : A Year of Platforms, People and High-Performing Data Teams

You Probably Don’t Need an RFP

Claude Meets Looker: Building Smarter, Connected Analytics with Google’s MCP Toolbox

From Prompts to Skills: Automating Financial and KPI Analysis in Looker with Claude Skills and MCP Toolbox

An Homage to Oracle Warehouse Builder, 25 Years Ahead of Its Time

Google Integrates Spectacles into Looker for Continuous Integration & Regression Testing

How Rittman Analytics Does Financial Analytics using Looker, Fivetran and Google BigQuery

RSS Feed

Published Year

Tag Cloud

Categories