At Boston Code Camp 40, I presented an in-person session on building real-time voice AI systems, focusing on the shift from traditional speech pipelines to conversational, audio-native architectures. I demonstrated how legacy approaches introduce latency and disrupt natural interaction, and how modern real-time models enable streaming, low-latency, and interruptible conversations that feel more human. The […]
LLM Voice
Memphis Agent Camp: Building Real-Time Voice AI: From Pipelines to Conversations
At Memphis Agent Camp, I delivered an online session on the evolution of voice AI from rigid, multi-step pipelines to real-time conversational systems. The talk highlighted how audio-first models enable fluid, human-like interactions by reducing latency and supporting continuous, interruptible dialogue. I walked through practical design patterns for building voice-enabled agents that integrate with enterprise […]
Building Real-Time Voice AI: From Pipelines to Conversations
I recently presented “Building Real-Time Voice AI: From Pipelines to Conversations” at the Nashua Cloud .NET User Group, where we explored how Voice AI is evolving beyond rigid pipelines like Speech → Text → LLM → Text → Speech. Traditional approaches introduce latency, break conversational flow, and make interactions feel unnatural. With the rise of […]