mbotsu / mlx_speech2textLinks

Audio transcription using mlx whisper and vad silence processing

☆15

Alternatives and similar repositories for mlx_speech2text

Users that are interested in mlx_speech2text are comparing it to the libraries listed below

Sorting:

gradio-app / sambanova-gradio
☆21Updated 8 months ago
huseinzol05 / transformers-continuous-batching
Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.
☆26Updated 4 months ago
camenduru / FluxMusic-jupyter
☆19Updated 10 months ago
mark-lord / PromptExpander-Diffusionkit
A little file for doing LLM-assisted prompt expansion and image generation using Flux.schnell - complete with prompt history, prompt queu…
☆26Updated 11 months ago
Milimo-Quantum / milimochat
MilimoChat: Privacy-first, self-hosted AI chat with customizable personas, context-aware memory, and local analytics. Built on Python/Str…
☆14Updated 4 months ago
LAION-AI / Desktop-BUD-E_V1.0
BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…
☆20Updated 9 months ago
jabberjabberjabber / Chunkify
Create text chunks which end at natural stopping points without using a tokenizer
☆25Updated 4 months ago
xhedit / quantkit
cli tool to quantize gguf, gptq, awq, hqq and exl2 models
☆73Updated 7 months ago
zenforic / csm-multi
Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…
☆23Updated 3 months ago
ritabratamaiti / AnyModal
AnyModal is a Flexible Multimodal Language Model Framework for PyTorch
☆100Updated 6 months ago
lucasnewman / e2-tts-mlx
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in MLX
☆20Updated 9 months ago
JakeFurtaw / Chat-RAG
Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…
☆22Updated 2 months ago
stringandstickytape / MaxsAiStudio
A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.
☆33Updated this week
kohya-ss / HunyuanVideo
HunyuanVideo: A Systematic Framework For Large Video Generation Model
☆48Updated 7 months ago
SLIT-AI / FuseChat-3.0
☆17Updated 3 months ago
dioneapp / dioneapp
Explore, Install, Innovate — in 1 Click.
☆27Updated last week
severian42 / Computational-Model-for-Symbolic-Representations
Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …
☆49Updated 5 months ago
EduardTalianu / EntropixLab
entropix style sampling + GUI
☆26Updated 8 months ago
FishiaTee / yawullm
Yet Another (LLM) Web UI, made with Gemini
☆12Updated 6 months ago
nivibilla / build-nanogpt
Video+code lecture on building nanoGPT from scratch
☆69Updated last year
cocktailpeanut / hallucinator
☆51Updated 8 months ago
bdytx5 / open_answer_engine
☆22Updated 11 months ago
asappresearch / josh-llm-simulation-training
☆31Updated 4 months ago
curvedinf / novel-writer
Automated LLM novelist
☆47Updated last year
bdambrosio / AllTheWorldAPlay
All the world is a play, we are but actors in it.
☆50Updated this week
g-aggarwal / mlx-hub
A python command-line tool to download & manage MLX AI models from Hugging Face.
☆18Updated 10 months ago
jukofyork / transplant-vocab
Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.
☆31Updated 3 months ago
the-crypt-keeper / tcurtsni
Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?
☆22Updated last year
menloresearch / ichigo-demo
☆91Updated 2 months ago
riccardomusmeci / mlx-image
mlx image models for Apple Silicon machines
☆82Updated 3 months ago