Voice AI Engineer

Take2.ai

Take2.ai

Software Engineering, Data Science
New York, NY, USA · Alrededores de Amatengo, Oaxaca, Mexico · York, NE, USA
Posted on Aug 19, 2025

About Take2 AI

Take2 AI automates phone screens with AI agents.

Take2's AI Interviewers automate the end-to-end screening process - evaluating resumes, conducting phone screens, and scheduling next round interviews with qualified candidates. Our AI Interviewers help reduce overhead costs while boosting speed-to-hire and preventing mis-hires. For candidates, this guarantees they’re never left without a response and delivers a better experience, uncovering skills that go beyond the resume.

We are a team of Stanford GSB alums, backed by SemperVirens & Reach Capital. Our advisory board consists of ex CHROs of F500 companies such as Visa, HP, Disney & Google.

Want a sneak peek of what we are building? Check out this video!

https://youtu.be/UorNA5uuQjU

About The Role

Take2 AI is hiring a Senior Software Engineer with deep Voice AI expertise and strong backend engineering skills.

Our real-time voice agent pipeline already powers thousands of candidate conversations every month—now we’re scaling to millions. You’ll tackle challenges in low latency, high reliability, and large-scale architecture, shaping how our platform evolves as we grow.

This role is hands-on, highly technical, and perfect for someone who thrives in a startup environment, playing a key role from design through production with significant ownership over technical execution and direction.

In Terms of Experience

Required:

  • 6+ years software engineering experience (backend or full stack)
  • Bachelor’s in Computer Science or Computer Engineering
  • Direct, hands-on experience integrating and optimizing STT, TTS, VAD, and LLMs for real-time voice agents
  • Scaled and optimized voice pipelines for low latency, high availability, and real-time performance
  • Designed and implemented agent orchestration, evaluation frameworks, and related tooling
  • Shipped production AI applications with real-time inference, integrating ML models into live systems
  • Built and operated distributed backend systems with monitoring and observability in place
  • Strong proficiency in Python, JavaScript, Node.js, AWS, Kubernetes, Docker
  • 2+ years at an early to mid-stage startup (Series B or lower)

Preferred:

  • Experience self-hosting AI/ML models for use in voice AI pipelines, including tuning and optimizing them for performance and reliability
  • Experience owning and operating distributed systems in high-throughput, streaming, low-latency environments, ensuring scalability, reliability, and performance under real-time constraints

What You’ll Do

  • Own and scale the end-to-end voice agent pipeline that already powers thousands of monthly conversations—help us grow it to millions
  • Select, integrate, and tune models for optimal latency, quality, and reliability
  • Build orchestration logic, evaluation systems, and supporting backend services
  • Ensure low latency, high availability, and scalability of voice agent infrastructure
  • Collaborate closely with product and engineering to rapidly prototype and deliver features, playing a key role in driving the vision and evolution of our platform and product with significant ownership over technical execution and direction