About the Company:
We’re a technology-first team building the next generation of AI-powered audio and video creation. Our proprietary text-to-speech models rank among the top on Hugging Face leaderboards, and our research spans voice cloning, audio processing, and video understanding.
At the core of what we build is real-time, low-latency voice technology that delivers expressive, controllable, multilingual speech. We care deeply about production reliability at scale, which means consistent voice quality, fast response times, robust streaming, and tools that are easy to integrate and trust.
Our Async Voice API enables developers to integrate real-time speech generation into applications such as voice agents, AI assistants, gaming platforms, and interactive products.
About the Team:
You’ll be joining a team working at the intersection of AI infrastructure and developer platforms, with a shared mission to make advanced voice technology accessible through reliable, well-designed APIs.
We collaborate closely across backend, infrastructure, and machine learning teams to build systems that power real-time voice applications used by developers worldwide.
About the Role:
We are looking for a Python Backend Engineer to help build and scale the Async Developer Platform and Voice API.
This role focuses on designing and implementing high-performance backend services and APIs that power our real-time voice infrastructure. You will work on systems responsible for streaming audio, handling API traffic at scale, and ensuring reliable low-latency performance for developers integrating our platform.
You will collaborate closely with our ML engineers who build the models, while focusing on production infrastructure, API reliability, and developer experience.
Typical responsibilities include:
We’re looking for someone with the following skills and qualifications:
Nice to Have:
Why Async?
At Async, we believe artificial intelligence has the potential to help people solve immense creative challenges, and we want the upside of AI-powered content creation to be widely shared. Join us in shaping the future of audio and video technology.