- Gianluca Campo
- Posts
- Attribution Masterclass: My Notes
Attribution Masterclass: My Notes
Table of Contents
The Attribution Masterclass is a series about marketing attribution organized by Timo Dechau and Barbara Galiza. I'm thrilled to share I've been in the first cohort taking the masterclass with regular meetups every Thursday in November 2024.
Here follow my notes with the most important concepts shown during the masterclass. Some of the notions would be better explained using the slides the authors have made: For that, you need to actually enroll in the Masterclass: here's the page where you can sign up.
Let me recommend you to follow Timo and Barbara on Linkedin to know more about the masterclass and in general to get interesting insights and opinions on marketing attribution.
Intro to Attribution
Attribution journeys
Typical attribution models: First Touch, Last Touch, Data-Driven
Warning: These attribution models are all click-based so some viewed ads up in the funnel will be ignored
Debunking Attribution Myths
Multi-Touch Attribution is actually not there anymore, since today it has limitations
There are no tools that can solve all your attribution problems
The best attribution method actually depends on specific needs
Attribution is a model, with strategy and operations layer
There’s not just one method and that’s it so no Single Source of Truth
Why we attribute
To understand customer journeys, key touchpoints to allocate mktg budget
To measure impact and optimize our strategy
Attribution and Business Strategy
Business strategy is the higher-level vision informing the following (e.g. grow revenue from new customers by 10%)
Marketing strategy outlines initiatives and campaigns (e.g. test new channels, involve new influencers and so on)
Attribution strategy (e.g. dimensions for campaigns with new influencers: measure impressions, discount codes, run an uplift test)
Major takeaway: When you plan your mktg strategy you should also plan your attribution stategy
Types of Attribution
Click-Based models
Last-Click, First-Click, Linear, Position-Based
Data-Driven: Comprehensive Analysis, Markov Chain, Fractional Attribution, Optimization Insights
View-Based models
You consider also if the user has viewed an ad (for instance a view-through conversion window can be set in GAds)
You can also track this by adding a pixel anywhere the user could view an ad
MMM (Mktg Mix Modeling)
Economic approach
Channel agnostic
Measuring impact
Zero-Party Data
How you did you hear about us? (HDYHAU), this simple question can make a difference
The earlier you gather Zero-Party Data the better
Customer perspective is what you get in this case
Compared to other attribution types, ROI in this case is not as easy to calculate but other data/datasets about users can help with this
Enhancing Attribution
Rule-Based Approaches, e.g. zero-party data can weight mktg channels or activities
Combination of Models, multiple form of attribution are combined
Click Prediction, data models predicting which campaign sessions have come from organic or direct clicks
Multi-Touch Attribution
UTM parameters
It’s an old topic but still one of the most important ones
We still have issues with UTMs:
Sometimes are missing
They are inconsistent
This is why you need a UTM strategy:
When you own a link remember to tag it!
You need a process
You can do it manually
But even automatically, defining rules on the 3rd party platforms
Techniques for UTM paramaters definition
Random ID in utm_campaign
Don’t use UTM parameters inside your own website
[More on this available in video recordings]
References about UTM parameters
Campaign (UTM) Parameter Naming Conventions revisited: Cryptic vs. Positional vs. Key-Value Notation | by Lukas Oldenburg | Medium by Lukas Oldenburg
How to Improve Paid Media Analysis and Performance with Naming Conventions By Barbara Galiza.
User journeys and user stitching
Simplest example of user journey: landing page > conversion
No issues with this
UTMs are probably there
Hard for cookies to be missed (but see below about this point)
SAAS example: landing page on www.* > user creates an account on app.* > user buys a subscription through Stripe
Issues:
Most marketers take care of the account creation and stop there but this still doesn’t tell what mktg initiative lead to subscriptions
This is based on IPs or cookies but we actually have no real control about them (e.g. Safari changing settings).
The solution to these issues is the use of user_id (GA), hubspot_lead_id (Hubspot), hashed emails or email domain IDs, and so on
Ways to do user stitching is storing all the IDs you have
In a Data Warehouse (DWH)
Or in a leading system, for instance you decide GA is your primary platform and get Hubspot IDs data in there
In the case of guest checkouts you can join client_id and transaction_id. In general it depends if we’re talking about user level attribution or order level attribution
How does server-side tagging fits in this?
Users using different devices are treated as separate users in client-side tracking systems
This is why server-side tagging systems can help vendors - such as Meta with Facebook CAPI - optimize their campaigns
One tricky issue with server-side tagging is how to handle legal consent.
How to analyze Multi-Touch Attribution (w/ Amplitude)
Amplitude gives the chance to connect different data sources (e.g. BigQuery, GAds and so on)
We tried the attribution models comparison with a custom table where we added a First Touch, Last Touch and Data-Driven views of the demo dataset, side by side
Unfortunately this is not available with other tools - such as GA - unless you build it on your own - with BigQuery
Amplitude gives the chance to create a free account and explore a demo dataset with custom charts.
MMM, Incrementality
Intro
Shortcomings of click-based attribution
Digital campaigns w/o clicks, e.g. video campaigns
Dark social
Cross-device attribution, since click attribution can be made only in the same session
Long sales journeys, conversions happening outside of lookback windows
Incrementality
Click incrementality
It’s not just sales or conversions, it can also be paid brand search clicks per se
This is interesting also to understand cannibalization btw Paid and Organic
There’s a study by Google showing how many clicks were incremental and how many were cannibalized to Organic Search
Here’s another case history by Barbara Galiza Measuring incremental impressions, clicks and conversions for Paid Search (Methodology + Google Sheet Template)
Conversion incrementality
There are cases where clicks are not incremental but conversions are
This tries to estimate conversions that wouldn’t have happened without a specific mktg initiative
How incrementality is measured
Holdout tests are performed turning off a campaign to all or certain audiences
Geofencing is a technique similar to the previous one but in this case campaigns are turned off based on geographical areas. GeoLift is an R package made by Meta helping with this GeoLift Walkthrough | GeoLift
Causal inference, is a statistical model measuring correlation btw campaigns and sales.
Design a test
Control targeting, be sure you can select audiences precisely
Experiment control, start and stop the experiment as needed
Conversion event tracking, it’s important to measure frequency of conversions within the test group and overall
Campaign metrics, analyze spend and impressions for both control and test group
MMM
MMM in layman’s terms way
There are a bunch of inputs and outputs
MMM tries to find the correlation btw inputs and outputs
Understanding Bayesian models
Prior knowledge, Bayesian models need historical data, for instance for daily data you need at least 1 year or even more
Data updates, the model gets updated when new data arrives
Probabilistic approach, probabilities are assigned to different outcomes based on input data
MMM and Bayesian models
Simulations: MMM runs simulations
Statistical analysis: Correlation is calculated for each simulation
Channel impact: Finally MMM isolates the impact of each mktg channel
Requirements for MMM
Date-level datasets, data granularity should be daily or weekly depending on mktg activities involved and the type of business
Mktg activity data, all media activities should be included
Target metric, of course this depends especially on the business
Historical data, at least 1 or 2 years of data are needed
Sample dataset generator by Timo Dechau https://replit.com/@TimoDechau/Marketing-Mix-Model-Playground
When to use MMM and limitations
There’s a need for extensive enough datasets
Time and understanding to fine tune MMM is important
It’s good for high-level channel decisions
Barbara recommends 6 figures per month is the minimum recommended spend for those who want to use MMM
How to approach Attribution
In this 4th session I missed some content, however I believe there’s enough to understand the topics covered by Barbara and Timo.
[missed]
Constraints
Discussion loops?
It’s my conversion
[missed]
Which model/tool do we pick?
Core metrics to measure to understand if attribution is working
Mktg spend
CAC (Customer Acquisition Cost)
Cost/revenue ratio
Revenue of new customers
Revenue VIP customers (CLV, Customer Lifetime Value)
Customer payback period
Weekly checks on the core metrics is important
You should log
Then you can experiment
incrementality kicks in here
Rule-based attribution
DWH where you collect information
You take all the insights from different models (last-click, data-driven and so on)
Decide what indications override or calibrate the others (e.g. you override last-click with post-purchase survey indications)
People
You should implement it for yourself at first
Be public about it, explaining inside the company how your tests are going
Share insights
[missed]
Tools
Marketing Analytics platform
Tools to analyze uplift in case you’re doing incrementality
Planning and documentation
Surveys and asking customers can also be crucial
Checklist
Where to start
[missed]
Understand business strategy
Make sure event tracking is OK for what you need
ID resolution an user stitching
Naming conventions an UTM parameters
One testing roadmap for the experiments you need
Post-purchase surveys
Rule-based attribution
MMM vendors (potentially)
Vouchers (Stripe gives the chance to use vouchers OOTB
In the next 4 weeks
Document the financial risk of poor attribution
What to achieve and by when
What does delay decision making?
What tools do you have in place? What are you missing?
What stakeholders should be involved?
Recommendations
Prioritize strategies impacting historical data
Use MVPs where possible, think small what you want is answers so you should focus on that
Know your abilities
Engage stakeholders needing data early, document a lot and let people understand what you’re doing: The more you share the more stakeholders are involved
Reply