Skip to content

snowplow/snowplow

Repository files navigation

Snowplow logo


As of January 8, 2024, Snowplow introduced the Snowplow Limited Use License Agreement, and we are releasing new versions of our core pipeline technology under this license. There will be no security patches made to versions of our software that pre-date January 2024.

If you are currently using the pipeline in production, or in a way that is competitive with Snowplow, these changes affect you. If you wish to use the current version of Snowplow software, please contact us to discuss a plan that works for you.

We value all of our users and remain dedicated to helping our community use Snowplow in the optimal capacity that fits their business goals and needs.

Read more about this change here.


Introduction

Welcome to Snowplow, the leader in customer data infrastructure (CDI) for AI, enabling every organization to transform raw behavioral data into governed, high-fidelity fuel for AI-powered applications—including advanced analytics, real-time personalization engines, and AI agents.

Digital-first companies like Strava, HelloFresh, Auto Trader, Burberry, and DPG Media rely on Snowplow to collect and process event-level data in real time—delivering it securely to their warehouse, lake, or stream—and to integrate deep customer context into their applications.


Why Customer Data Infrastructure (CDI)?

Snowplow lays the foundation for an organization’s advanced analytics, operational, and ML/AI use cases—including customer insights, predicting customer behaviors, hyper-personalizing customer experiences, and detecting fraud in real time.

Key benefits of Snowplow’s CDI:

  • Data depth and quality
  • Centralized data governance
  • Real-time operationalization
  • Privacy and compliance
  • AI- and BI-ready behavioral data

Why Snowplow?

  • “Glass-box” technical architecture capable of processing billions of events per day
  • Over 20 SDKs to collect data from web, mobile, server-side, and other sources
  • A unique approach based on schemas and validation ensures your data is as clean as possible
  • Over 15 enrichments to get the most out of your data
  • Stream data to your data warehouse/lakehouse or SaaS destinations of choice

Our documentation is a great place to learn more.

This repository contains the major Snowplow components as individual submodule repositories.


Community

Check out our Community for support and updates.

If you spot a bug, please raise an issue in the GitHub repository of the component in question.


Copyright and license

Copyright 2012-2025 Snowplow Analytics Ltd.

Snowplow components are licensed differently depending on their purpose. Read about our different licenses here.