🎤 Convonet Voice AI Productivity System

Enterprise-grade voice AI platform with multi-LLM support (Claude, Gemini, OpenAI), LiveKit WebRTC voice, domain-specific agents (Productivity, Mortgage, Healthcare), SuiteCRM integration (contacts, cases, appointments, notes), team collaboration, and intelligent call center integration with transfer context.

Try Voice Assistant View Technical Specification View on GitHub
Flask LangGraph MCP (38 Tools) Claude · Gemini · OpenAI ElevenLabs TTS Deepgram STT/TTS Cartesia TTS Twilio Voice LiveKit WebRTC Redis Sentry FusionPBX Composio SuiteCRM

LiveKit WebRTC Voice

Low-latency browser voice with LiveKit, Deepgram/Cartesia streaming STT, ElevenLabs/Deepgram/Cartesia TTS, streaming TTS, and domain-specific agents (Productivity, Mortgage, Healthcare).

Team Collaboration

Multi-tenant team management with role-based access control, shared todos, and real-time collaboration features.

Domain-Specific Agents

Productivity (todos, calendar, reminders), Mortgage (applications, DTI, documents), Healthcare with SuiteCRM. Sticky context and intelligent AI-to-human transfer via Twilio/FusionPBX.

SuiteCRM Integration

Healthcare agent creates and links Contacts, Cases, Meetings (appointments), and Notes in SuiteCRM. On transfer to the call center (e.g. extension 2001), agents see full context: patient ID, case ID, appointment ID, and call summary.

38 MCP Tools

Todos, calendar, teams, reminders, mortgage tools, healthcare + SuiteCRM (patient lookup, book appointment, log case, save call summary), call transfer. Works with Claude, Gemini, and OpenAI.

Sentry Monitoring

Production-grade error tracking, performance monitoring, and automatic thread reset recovery for reliability.

Agent Monitor

Monitor LLM interactions with voice response timing (T0→buffer→STT→agent→first audio), per-tool elapsed time, and provider/domain filtering.

Tool Execution GUI

Monitor and troubleshoot tool call executions with real-time visualization and detailed analytics.

✨ Voice AI Integration

ElevenLabs, Deepgram, and Cartesia TTS with streaming support for low-latency, natural voice responses

Emotional Voice Responses

AI detects your emotional state and responds with matching voice tone - happy, calm, empathetic, or professional.

Multi-Language Support

Automatic language detection and native-accent responses in 29+ languages including Korean, Japanese, Spanish, and more.

Voice Cloning

Clone your voice in under 1 minute. Personalize the assistant's voice per user or team for a unique experience.

Voice Preferences

Customize voice settings per user: voice selection, language, emotion sensitivity, and speaking style preferences.

Real-Time Streaming

Low-latency voice generation with natural conversation flow. Responses start speaking immediately as they're generated.

Robust Fallback

Automatic fallback to Deepgram TTS if ElevenLabs is unavailable, ensuring reliability and continuous service.

Try Voice Assistant Demo

🤖 Select LLM Provider

Choose your preferred AI language model for the assistant

Loading providers...

🎤 Speech-to-Text (STT)

Select hearing provider

Loading...

🗣️ Text-to-Speech (TTS)

Select voice provider

Loading...

Quick Access

Voice Assistant Agent Monitor Mortgage Dashboard Call Center (SuiteCRM context) System Architecture Diagram Sequence Diagram (52 Steps) Technical Spec Tool Execution

© 2024 Voice AI. All rights reserved.