scosman / voicebox
Exploration: using technology to aid people who lack both the ability to speak and fine motor control.
☆12Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for voicebox
- implementation of https://arxiv.org/pdf/2312.09299☆19Updated 4 months ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆14Updated 8 months ago
- Github repo for Peifeng's internship project☆12Updated last year
- Rust bindings for CTranslate2☆13Updated last year
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆11Updated 9 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆22Updated this week
- Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io☆25Updated last week
- Trying to deconstruct RWKV in understandable terms☆14Updated last year
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆12Updated 2 months ago
- CogNetX is an advanced, multimodal neural network architecture inspired by human cognition. It integrates speech, vision, and video proce…☆12Updated last week
- ☆21Updated 5 months ago
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆13Updated 3 weeks ago
- LLama implementations benchmarking framework☆12Updated last year
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆45Updated 3 months ago
- Simple Implementation of TinyGPTV in super simple Zeta lego blocks☆15Updated last week
- Command-line script for inferencing from models such as LLaMA, in a chat scenario, with LoRA adaptations☆33Updated last year
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated 5 months ago
- iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh☆46Updated last year
- One Line To Build Zero-Data Classifiers in Minutes☆33Updated last month
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆11Updated last month
- Cog wrapper for collabora/WhisperSpeech☆25Updated 8 months ago
- emoji_finder☆15Updated last month
- OpenMindedChatbot is a Proof Of Concept that leverages the power of Open source Large Language Models (LLM) with Function Calling capabil…☆28Updated 11 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆23Updated last week
- Evolutionary Search for expert-level performance on any task with environmental feedback☆14Updated 9 months ago
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆15Updated last week
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated last week
- Image Generation API Server - Similar to https://text-generator.io but for images☆47Updated 2 months ago
- Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at this…☆20Updated last year
- Generate Stunning Images and Craft Visual Stories for your Brand☆12Updated 3 weeks ago