About HoneyHive
At HoneyHive, we are building the new observability and testing stack for AI agents. Our product enables leading AI startups and Fortune 100 enterprises to build and deploy reliable AI agents with integrated observability, testing, and experimentation tools. AI engineers use our platform debug AI agents, evaluate output quality, troubleshoot LLM failures in production, and a whole lot more.
We’re backed by leading VCs and founders/CEOs. Our founding team includes AI, data, and systems experts who’ve shipped petabyte-scale systems at Microsoft, Amazon, JP Morgan, and more.
About the role
As a Software Engineer at HoneyHive, you'll be at the forefront of shaping how companies build and deploy Generative AI applications. Our end-users are sophisticated AI engineers at high-growth companies – and you'll be building the infrastructure they rely on daily.
We’re a small team, which means you'll have tremendous impact and ownership. You'll work directly with our founders and have the freedom to explore your own ideas.
In this role, you will
Design and implement high-throughput, real-time event processing pipelines that power our core analytics service
Work with distributed databases like ClickHouse to handle large-scale data processing
Own the development and optimization of our microservices architecture running on Kubernetes
Build and maintain deployment systems for seamless VPC integration with client cloud environments
Work with the engineering team to shape our foundational architecture and tech stack choices as we scale
Collaborate closely with the founders to shape our product roadmap and represent HoneyHive at various industry events and hackathons.
Our stack
We use React, Express, and Next.js for web development, AWS for all our cloud needs, Clickhouse for data storage, and are building SDKs in Typescript and Python.
About you
We think you'd be a great fit if you have:
Minimum 5+ years of experience as an infrastructure software engineer or SRE.
Deep expertise in distributed systems and event processing at scale. You've designed and implemented systems that handle millions of events per second and understand the tradeoffs involved. Experience with NATS, Kafka, or similar streaming platforms is a must. Experience with high-throughput data systems and columnar databases like ClickHouse is highly relevant.
Strong systems programming background with experience in multiple languages, ideally C++, Go, or Rust. You should be comfortable jumping between different languages and ecosystems.
Track record of building developer-facing products. You understand what makes APIs and SDKs delightful to use and can champion developer experience.
Experience operating production systems in cloud environments. You're comfortable with modern infrastructure (Kubernetes, Terraform, AWS/GCP/Azure) but focus on principles rather than specific tools.
Natural curiosity and bias for action. You enjoy diving into complex problems and aren't afraid to explore new approaches.
What sets great candidates apart:
Experience with real-time analytics systems and time-series data processing
Familiarity with high-throughput data ingestion systems
Previous experience at early-stage startups, particularly in data or AI/ML
Familiarity with core AI engineering concepts like prompt engineering, RAG, evals, etc.
Why join
You'll be an early employee at a well-funded startup in one of tech's hottest spaces.
You'll build critical infrastructure that engineers at cutting-edge AI companies will rely on daily.
You'll learn the ins and outs of startup building – perfect if you want to build your own company someday.
You'll have direct access to founders and the freedom to explore your own ideas.
Benefits
Competitive salary + meaningful equity
Health, vision, and dental benefits
Unlimited PTO
Assistance in relocating to NYC or SF
MacBook Pro + peripherals