Leikoe / torch_to_ggmlLinks
convert a saved pytorch model to gguf and generate as much corresponding ggml c code as possible
☆15Updated last year
Alternatives and similar repositories for torch_to_ggml
Users that are interested in torch_to_ggml are comparing it to the libraries listed below
Sorting:
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆54Updated 10 months ago
 - an auto-sleeping and -waking framework around llama.cpp☆12Updated 8 months ago
 - PyGPTPrompt: A CLI tool that manages context windows for AI models, facilitating user interaction and data ingestion for optimized long-t…☆30Updated last year
 - AirLLM 70B inference with single 4GB GPU☆14Updated 4 months ago
 - Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Updated 2 years ago
 - Experimental sampler to make LLMs more creative☆31Updated 2 years ago
 - A quick and optimized solution to manage llama based gguf quantized models, download gguf files, retreive messege formatting, add more mo…☆12Updated last year
 - Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆29Updated 7 months ago
 - Create text chunks which end at natural stopping points without using a tokenizer☆25Updated 7 months ago
 - A QT GUI for large language models☆39Updated last year
 - "a towel is about the most massively useful thing an interstellar AI hitchhiker can have"☆48Updated last year
 - Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆22Updated last year
 - Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX☆28Updated last year
 - A simple speech-to-text and text-to-speech AI chatbot that can be run fully offline.☆44Updated last year
 - Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆25Updated 7 months ago
 - Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.☆38Updated last year
 - A lightweight Python library for running TTS models with a unified API.☆21Updated 8 months ago
 - Trying to deconstruct RWKV in understandable terms☆14Updated 2 years ago
 - Public reports detailing responses to sets of prompts by Large Language Models.☆31Updated 10 months ago
 - Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆37Updated 2 years ago
 - Yet Another (LLM) Web UI, made with Gemini☆12Updated 10 months ago
 - Hub for Open Source AGiXT Extensions, Chains, Prompts, and Agents.☆17Updated 2 years ago
 - Locally running LLM with internet access☆97Updated 4 months ago
 - Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆43Updated last week
 - BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆22Updated last year
 - Python package wrapping llama.cpp for on-device LLM inference☆92Updated 3 weeks ago
 - A cli app for experimenting with kokoro voice creating and mixing using the available voices to interpolate new ones☆33Updated 8 months ago
 - Training Models Daily☆16Updated last year
 - B-Llama3o a llama3 with Vision Audio and Audio understanding as well as text and Audio and Animation Data output.☆26Updated last year
 - A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆63Updated 2 years ago