Leikoe / torch_to_ggmlLinks
convert a saved pytorch model to gguf and generate as much corresponding ggml c code as possible
☆15Updated 2 years ago
Alternatives and similar repositories for torch_to_ggml
Users that are interested in torch_to_ggml are comparing it to the libraries listed below
Sorting:
- Yet Another (LLM) Web UI, made with Gemini☆12Updated 11 months ago
- AirLLM 70B inference with single 4GB GPU☆14Updated 5 months ago
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Updated 2 years ago
- Experimental sampler to make LLMs more creative☆31Updated 2 years ago
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆64Updated 2 years ago
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX☆29Updated last year
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆22Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated last year
- ☆27Updated 2 years ago
- Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in MLX☆20Updated last year
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆23Updated last year
- Audio transcription using mlx whisper and vad silence processing☆16Updated last year
- This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…☆57Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆41Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆23Updated last year
- Public reports detailing responses to sets of prompts by Large Language Models.☆32Updated 11 months ago
- fast state-of-the-art speech models and a runtime that runs anywhere 💥☆57Updated 6 months ago
- PyGPTPrompt: A CLI tool that manages context windows for AI models, facilitating user interaction and data ingestion for optimized long-t…☆30Updated last year
- A cli app for experimenting with kokoro voice creating and mixing using the available voices to interpolate new ones☆34Updated 10 months ago
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆29Updated 9 months ago
- ☆24Updated 10 months ago
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆38Updated 2 years ago
- Browser extension that lets you summarize and chat with any webpage using a local LLM of your choice.☆22Updated last year
- Create text chunks which end at natural stopping points without using a tokenizer☆26Updated 3 weeks ago
- Python package wrapping llama.cpp for on-device LLM inference☆95Updated 2 months ago
- A lightweight Python library for running TTS models with a unified API.☆21Updated 10 months ago
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆47Updated last month
- An API for VoiceCraft.☆25Updated last year
- run ollama & gguf easily with a single command☆52Updated last year
- Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.☆38Updated last year