How Spotify Streamlined Large-Scale Dataset Migrations with Background Coding Agents

Introduction

Migrating thousands of datasets across a complex data infrastructure is no small feat. At Spotify, ensuring that downstream consumers—teams and services that rely on these datasets—experience minimal disruption is a top priority. In this article, we explore how Spotify Engineering leveraged three key tools—Honk, Backstage, and Fleet Management—to automate and ease the pain of large-scale dataset migrations. By using background coding agents, the team supercharged the migration process, reducing manual overhead and downtime.

How Spotify Streamlined Large-Scale Dataset Migrations with Background Coding Agents
Source: engineering.atspotify.com

The Challenge: Migrating Thousands of Datasets

Spotify’s data platform serves hundreds of internal teams, each depending on datasets that feed dashboards, machine learning models, and real-time features. Over time, schema changes, storage optimizations, or platform upgrades require datasets to be migrated. Doing this manually for thousands of datasets is error-prone and time-consuming. Each migration must ensure that downstream consumers—the teams that query or stream the data—experience no breakage or data loss.

Pain Points with Downstream Consumers

Traditional migration approaches often involve:

These pain points led Spotify to develop an automated, background-agent-based approach.

Introducing Honk: Background Coding Agents

Honk is Spotify’s system for running background coding agents—automated processes that perform code changes, data transformations, and configuration updates across repositories. For dataset migrations, Honk acts as the engine that drives the changes needed to adapt downstream consumers to the new dataset schema or location.

How Honk Works

Honk agents are triggered by migration events. They:

  1. Scan repositories that consume the dataset being migrated.
  2. Generate the necessary code or configuration changes (e.g., updating column names, changing table references).
  3. Create pull requests with the changes, ready for review by the owning team.
  4. Track progress to ensure all affected repositories are updated.

This approach removes the manual burden from data engineers and gives confidence that transformations are consistent.

Leveraging Backstage for Self-Service Visibility

While Honk performs the heavy lifting, Backstage—Spotify’s developer portal—provides the interface for engineers to manage and monitor migrations. Through Backstage, each team can:

This self-service model empowers teams to stay informed without constant manual coordination. Backstage also serves as the single source of truth for dataset ownership and dependencies, which Honk uses to determine which agents to run.

How Spotify Streamlined Large-Scale Dataset Migrations with Background Coding Agents
Source: engineering.atspotify.com

Fleet Management for Agent Orchestration

Running hundreds of concurrent agents across thousands of repositories requires robust orchestration. Spotify uses Fleet Management to schedule, scale, and monitor Honk agents. Key capabilities include:

Fleet Management ensures that migrations proceed efficiently without overwhelming the underlying infrastructure.

Results and Benefits

The combination of Honk, Backstage, and Fleet Management has transformed dataset migrations at Spotify:

Over thousands of datasets, this system has saved countless hours and prevented production incidents.

Conclusion

Migrating datasets at scale is a classic infrastructure challenge. By building background coding agents with Honk, integrating them into Backstage for visibility, and orchestrating with Fleet Management, Spotify Engineering turned a painful manual process into an automated, reliable pipeline. This approach not only speeds up migrations but also builds trust with downstream consumers—a win for platform reliability and developer productivity.

Want to learn more? Explore other posts in the Honk series on the Spotify Engineering blog.

Tags:

Recommended

Discover More

The Ultimate Guide to Modern Device Charging: Steps to Upgrade Your SetupRust WebAssembly: Upcoming Changes to Symbol Linking and Undefined ReferencesAkamai Bolsters Zero Trust with $205M Acquisition of Browser Security Startup LayerXBoosting AI Agent Accuracy with Knowledge Graphs and Graph RAG10 Critical Updates From the Linux World: Security, Open Source Moves, and More