
blog.ivan.digital
March 5, 2026
5 min read
64/100
Summary
NVIDIA PersonaPlex 7B enables full-duplex speech-to-speech communication on Apple Silicon, allowing simultaneous listening and speaking. The qwen3-asr-swift library processes audio in real-time, streaming generated audio chunks without a multi-step pipeline.
Key Takeaways
Community Sentiment
Positives
Concerns
Pure C, CPU-only inference with Mistral Voxtral Realtime 4B speech to text model
Feb 10, 2026
Rust implementation of Mistral's Voxtral Mini 4B Realtime runs in your browser
Feb 10, 2026

Microsoft VibeVoice: Open-Source Frontier Voice AI
Apr 28, 2026

TurboQuant KV Compression and SSD Expert Streaming for M5 Pro and IOS
Apr 1, 2026

Parakeet.cpp – Parakeet ASR inference in pure C++ with Metal GPU acceleration
Feb 27, 2026