π€
Streaming Pipeline
Mic β Silero VAD β Whisper Turbo β Qwen β Fish / Kokoro β Speaker. Each stage streams so audio plays while the LLM is still generating.
Whisper STT + vLLM + Fish Audio / Kokoro TTS on Pipecat β sub-200ms TTFA, speak-while-thinking, push-interrupt.
This site follows the DiΓ‘taxis framework:
| Section | Purpose | Start here if you⦠|
|---|---|---|
| Tutorials | Learning-oriented walkthroughs | Are new to protoVoice |
| How-To Guides | Task-oriented procedures | Need to accomplish something specific |
| Reference | Technical descriptions | Need exact details on an API, env var, or frame type |
| Explanation | Understanding-oriented discussion | Want to understand how and why things work |