Polymarket is the world's largest prediction market platform. We enable individuals to express views on real-world events by trading on outcomes across politics, economics, sports, culture, and current affairs. Built as a peer-to-peer marketplace with no centralized "house," Polymarket aggregates diverse opinions into transparent, market-based probabilities that reflect collective expectations about the future.
We're growing fast, both in terms of volume ($21B traded in 2025) and adoption as an alternative news source. Our ambition is to become a ubiquitous beacon of truth in global media and we need your help adding fuel to the fire.
The Data team runs the infrastructure that keeps three live exchanges operational. That means real-time pipelines, clean market data, and reliable delivery to every downstream system that depends on it. The team is small and the surface area is large. Everyone carries meaningful ownership.
This hire is about adding real depth. Right now, the team is stretched across too many systems to go as deep as the work demands. We need someone who can take full ownership of critical pipelines, not just keep them running but actively make them better, faster, and less fragile over time.
If you've built and operated high-throughput data infrastructure in production environments where failures are visible and costly, this is the role. You'll work on systems that directly affect how traders, analysts, and external consumers experience Polymarket.
Build and maintain real-time data pipelines that ingest market events across three active exchanges with high reliability and low latency.
Investigate and resolve root causes of data gaps, delivery delays, and schema breaks before they compound into larger incidents.
Ship incremental improvements to ingestion and delivery infrastructure on a continuous basis, not just when something breaks.
Monitor pipeline health, define meaningful alerts, and reduce the operational overhead of keeping systems running.
Partner with analysts and engineers to understand how data is consumed downstream and improve the quality and shape of what gets delivered.
Document systems clearly enough that another engineer can debug them at 2am without calling you.
5+ years of experience building and operating production data pipelines with real throughput and real consequences when they fail.
Strong proficiency in Python or a similar language for pipeline development, data transformation, and tooling.
Hands-on experience with stream processing systems such as Kafka, Flink, or Kinesis in a production environment.
Track record of debugging hard data reliability issues, schema drift, late-arriving events, ordering anomalies, and getting to root cause without a lot of hand-holding.
Comfort operating in resource-constrained environments where you own the full lifecycle of what you build, from design through ongoing maintenance.
Experience working with cloud data infrastructure, particularly on AWS or GCP, including storage, compute, and orchestration tooling.
(Plus) Experience in financial markets, crypto infrastructure, or other latency-sensitive trading environments.
(Plus) Familiarity with blockchain event indexing or on-chain data ingestion.
(Plus) Experience building or maintaining data contracts and schema registries across multiple producers and consumers.
Competitive salary & equity
Unlimited PTO
Full Health, Vision, & Dental coverage
401k match
Hardware setup: new MacBook Pro, big display, & accessories
Largest prediction market platform. Trade on real-world event outcomes.
View company profileEstimated based on role seniority, stage (Series B) & industry benchmarks.
You'll be redirected to the company's application page
Get roles like this daily
Join our Telegram channels for curated job alerts
Hey! Looking for your next role in Web3, AI, or Robotics? I can help.
Sign up to save jobs and access them across all your devices.