talk-data.com talk-data.com

Josh Beemster

Speaker

Josh Beemster

1

talks

Technical Operations Lead Snowplow

As an SRE, Delivery Lead, and Engineering Coach, I specialize in building scalable, resilient infrastructure for big data pipelines. My current focus is managing data pipelines and infrastructure across multiple clouds using Terraform, Kubernetes, and Helm, enabling cross-cloud and on-premise behavioral data pipelines with minimal code duplication.

I’m passionate about IaC and the HashiStack (Terraform/OpenTofu, Consul, Nomad, Vault) and deeply skilled in AWS and GCP (Azure is next on my list). As a polyglot software developer, I’ve worked with many languages, though GoLang is my current favorite.

Bio from: Big Data LDN 2025

Filtering by: Big Data LDN 2025 ×

Filter by Event / Source

Talks & appearances

Showing 1 of 2 activities

Search activities →

AI-powered development tools are accelerating development speed across the board and analytics event implementation is no exception to this, but without appropriate usage they’re very capable of creating organizational chaos. Same company, same prompt, completely different schemas—data teams can’t analyze what should be identical events across platforms.

The infrastructure assumptions that worked when developers shipped tracking changes in sprint cycles or quarters are breaking when they ship them multiple times per day. Schema inconsistency, cost surprises from experimental traffic, and trust erosion in AI-generated code are becoming the new normal.

Josh will demonstrate how Snowplow’s MCP (Model Context Protocol) server and data-structure toolchains enable teams to harness AI development speed while maintaining data quality and architectural consistency. Using Snowplow’s production approach of AI-powered design paired with deterministic implementation, teams get rapid iteration without the hallucination bugs that plague direct AI code generation.

Key Takeaways:

• How AI development acceleration is fragmenting analytics schemas within organizations

• Architectural patterns that separate AI creativity from production reliability

• Real-world implementation using MCP, Data Products, and deterministic code generation