I built a way to test Qwen3-TTS and Qwen3-ASR locally on your laptop

February 1, 2026

Here's something that caught my attention — someone’s built a way to run Qwen3’s TTS and ASR models right on your laptop. Imagine testing powerful voice synthesis and recognition locally, without relying on cloud services. According to /u/zinyando, this setup supports models from 0.6B to 1.7B in size, and you can do voice cloning with just a reference audio — pretty wild, right? But here’s where it gets interesting — there’s a modern React UI included, making it pretty user-friendly, and it leverages MLX and Metal GPU acceleration, so it works smoothly on M1/M2/M3 chips. As /u/zinyando points out, you’ve got options for Docker or native deployment, which is perfect if you like experimenting on your own machine. So, if you're into local audio models or just want a hands-on way to test cutting-edge tech, this might be exactly what you’re looking for. The best part? It’s in dev mode now, so it’s ready for some serious tinkering.

Supports Qwen3-TTS models (0.6B-1.7B) and ASR models. Docker + native deployment options.

Key features:

🎭 Voice cloning with reference audio
🎨 Custom voice design from text descriptions
⚡ MLX + Metal GPU acceleration for M1/M2/M3
🎨 Modern React UI included

If you like local audio models, give it a try. Works best in local dev mode for now.

submitted by /u/zinyando
[link] [comments]

Audio Transcript

Supports Qwen3-TTS models (0.6B-1.7B) and ASR models. Docker + native deployment options.

Key features:

🎭 Voice cloning with reference audio
🎨 Custom voice design from text descriptions
⚡ MLX + Metal GPU acceleration for M1/M2/M3
🎨 Modern React UI included

If you like local audio models, give it a try. Works best in local dev mode for now.

submitted by /u/zinyando
[link] [comments]

View original article

0:00/0:00