Blaizzy / mlx-audio Public

Notifications You must be signed in to change notification settings
Fork 625
Star 7.3k

Code
Issues 72
Pull requests 17
Discussions
Actions
Projects
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security and quality
Insights

Pull requests: Blaizzy/mlx-audio

Labels 11 Milestones 0

New pull request New

17 Open 480 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Fix SineGen length alignment

#785 opened Jun 7, 2026 by kanihal

Loading…

feat(stt): implement segment batching for Qwen3-ASR (--max-parallel-segments)

#783 opened Jun 7, 2026 by hhh2210

Loading…

2 of 3 tasks

feat(stt): add --max-line-length to split long srt/vtt cues

#781 opened Jun 6, 2026 by hhh2210

Loading…

2 of 3 tasks

feat: add Stable Audio 3 text-to-audio model

#746 opened May 26, 2026 by shreyaskarnik Contributor • Draft

8 tasks done

Add MiMo-V2.5-ASR STT support

#719 opened May 12, 2026 by ailuntx

Loading…

Whisper: fall back to canonical openai/whisper-* processor when mlx-community repos lack one

#712 opened May 6, 2026 by contrapuntal Contributor • Draft

feat (stt): Webserver: Diarization, export, Voxtral 3B Mini, larger audio files and enhanced model/language select

#643 opened Apr 10, 2026 by Gotanius

Loading…

feat: Add Vibevoice7B model support

#640 opened Apr 7, 2026 by Talpik

Loading…

docs: add contributing guide, model porting guide, and code of conduct

#632 opened Apr 4, 2026 by beshkenadze Contributor • Draft

"feat(ui): add voice selector for Kokoro TTS

#602 opened Mar 24, 2026 by mh03r932

Loading…

3 tasks

feat: add mlx support for IndexTTS2

#512 opened Feb 19, 2026 by 0xrushi • Draft

1 of 3 tasks

Add ACE-1.5

#499 opened Feb 15, 2026 by Blaizzy Owner • Draft

Optimize ACE-Step: NLC VAE, compiled decode, LoRA support

#498 opened Feb 15, 2026 by fspecii

Loading…

6 of 9 tasks

Add InstructTTSEval module for TTS evaluation

#490 opened Feb 8, 2026 by Blaizzy Owner • Draft

Fix VibeVoice-ASR streaming via generate() and server mapping

#483 opened Feb 5, 2026 by JM27616

Loading…

1 of 3 tasks

[WIP] Create simple quickstart instructions and command

#476 opened Feb 2, 2026 by mnoukhov Contributor • Draft

Improve Japanese TTS/STT UX and dependency handling

#260 opened Nov 8, 2025 by TechNavii

Loading…

1 of 3 tasks

ProTip! Follow long discussions with comments:>50.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!