Leikoe / torch_to_ggmlLinks
convert a saved pytorch model to gguf and generate as much corresponding ggml c code as possible
☆15Updated last year
Alternatives and similar repositories for torch_to_ggml
Users that are interested in torch_to_ggml are comparing it to the libraries listed below
Sorting:
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Updated 2 years ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆52Updated 11 months ago
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX☆29Updated last year
- Yet Another (LLM) Web UI, made with Gemini☆12Updated 11 months ago
- an auto-sleeping and -waking framework around llama.cpp☆12Updated 9 months ago
- Experimental sampler to make LLMs more creative☆31Updated 2 years ago
- B-Llama3o a llama3 with Vision Audio and Audio understanding as well as text and Audio and Animation Data output.☆26Updated last year
- PyGPTPrompt: A CLI tool that manages context windows for AI models, facilitating user interaction and data ingestion for optimized long-t…☆30Updated last year
- Easily convert HuggingFace models to GGUF-format for llama.cpp☆23Updated last year
- Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.☆38Updated last year
- AirLLM 70B inference with single 4GB GPU☆14Updated 5 months ago
- Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in MLX☆20Updated last year
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆29Updated 8 months ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆32Updated 10 months ago
- A cli app for experimenting with kokoro voice creating and mixing using the available voices to interpolate new ones☆34Updated 9 months ago
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Updated last year
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆64Updated 2 years ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆23Updated last year
- A repository to store helpful information and emerging insights in regard to LLMs☆21Updated 2 years ago
- ☆24Updated 10 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆43Updated last year
- A lightweight Python library for running TTS models with a unified API.☆21Updated 9 months ago
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆25Updated 8 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆41Updated last year
- A browser interface based on the Gradio library for OpenAI's Whisper model.☆43Updated 2 years ago
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …☆54Updated 9 months ago
- cli tool to quantize gguf, gptq, awq, hqq and exl2 models☆76Updated 11 months ago
- LIVA - Local Intelligent Voice Assistant☆61Updated last year
- A QT GUI for large language models☆38Updated last year
- Bookmarklet to pull and run hugging face GGUF models in Ollama☆18Updated last year