-
-
Notifications
You must be signed in to change notification settings - Fork 625
Pull requests: Blaizzy/mlx-audio
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat(stt): implement segment batching for Qwen3-ASR (--max-parallel-segments)
#783
opened Jun 7, 2026 by
hhh2210
Loading…
2 of 3 tasks
feat(stt): add --max-line-length to split long srt/vtt cues
#781
opened Jun 6, 2026 by
hhh2210
Loading…
2 of 3 tasks
feat: add Stable Audio 3 text-to-audio model
#746
opened May 26, 2026 by
shreyaskarnik
Contributor
•
Draft
8 tasks done
Whisper: fall back to canonical openai/whisper-* processor when mlx-community repos lack one
#712
opened May 6, 2026 by
contrapuntal
Contributor
•
Draft
feat (stt): Webserver: Diarization, export, Voxtral 3B Mini, larger audio files and enhanced model/language select
#643
opened Apr 10, 2026 by
Gotanius
Loading…
docs: add contributing guide, model porting guide, and code of conduct
#632
opened Apr 4, 2026 by
beshkenadze
Contributor
•
Draft
Optimize ACE-Step: NLC VAE, compiled decode, LoRA support
#498
opened Feb 15, 2026 by
fspecii
Loading…
6 of 9 tasks
Fix VibeVoice-ASR streaming via generate() and server mapping
#483
opened Feb 5, 2026 by
JM27616
Loading…
1 of 3 tasks
Improve Japanese TTS/STT UX and dependency handling
#260
opened Nov 8, 2025 by
TechNavii
Loading…
1 of 3 tasks
ProTip!
Follow long discussions with comments:>50.