Leikoe / torch_to_ggmlLinks

convert a saved pytorch model to gguf and generate as much corresponding ggml c code as possible

☆15

Alternatives and similar repositories for torch_to_ggml

Users that are interested in torch_to_ggml are comparing it to the libraries listed below

Sorting:

the-crypt-keeper / ggml-downloader
Simple, Fast, Parallel Huggingface GGML model downloader written in python
☆24Updated 2 years ago
revdotcom / reverb-self-hosted
This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.
☆52Updated 11 months ago
JosefAlbers / e2tts-mlx
Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX
☆29Updated last year
FishiaTee / yawullm
Yet Another (LLM) Web UI, made with Gemini
☆12Updated 11 months ago
FarFetchd / sleepyllama
an auto-sleeping and -waking framework around llama.cpp
☆12Updated 9 months ago
the-crypt-keeper / the-muse
Experimental sampler to make LLMs more creative
☆31Updated 2 years ago
beyondExp / B-Llama3-o
B-Llama3o a llama3 with Vision Audio and Audio understanding as well as text and Audio and Animation Data output.
☆26Updated last year
teleprint-me / py.gpt.prompt
PyGPTPrompt: A CLI tool that manages context windows for AI models, facilitating user interaction and data ingestion for optimized long-t…
☆30Updated last year
brittlewis12 / autogguf
Easily convert HuggingFace models to GGUF-format for llama.cpp
☆23Updated last year
CharlesMod / quantizeHFmodel
Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.
☆38Updated last year
Codys12 / airllm
AirLLM 70B inference with single 4GB GPU
☆14Updated 5 months ago
lucasnewman / e2-tts-mlx
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in MLX
☆20Updated last year
huseinzol05 / transformers-continuous-batching
Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.
☆29Updated 8 months ago
teknium1 / LLM-Logbook
Public reports detailing responses to sets of prompts by Large Language Models.
☆32Updated 10 months ago
RobViren / kokovoicelab
A cli app for experimenting with kokoro voice creating and mixing using the available voices to interpolate new ones
☆34Updated 9 months ago
bjj / exllamav2-openai-server
An OpenAI API compatible LLM inference server based on ExLlamaV2.
☆25Updated last year
jllllll / exllama
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
☆64Updated 2 years ago
LAION-AI / Desktop-BUD-E_V1.0
BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…
☆23Updated last year
j-webtek / LLM-Learning
A repository to store helpful information and emerging insights in regard to LLMs
☆21Updated 2 years ago
rodrigobaron / anthill
☆24Updated 10 months ago
Hellisotherpeople / llm_steer-oobabooga
Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…
☆43Updated last year
fakerybakery / simpletts
A lightweight Python library for running TTS models with a unified API.
☆21Updated 9 months ago
zenforic / csm-multi
Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…
☆25Updated 8 months ago
LAION-AI / Desktop_BUD-E
BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…
☆41Updated last year
AndreMarkert / whisper-webui
A browser interface based on the Gradio library for OpenAI's Whisper model.
☆43Updated 2 years ago
severian42 / Computational-Model-for-Symbolic-Representations
Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …
☆54Updated 9 months ago
xhedit / quantkit
cli tool to quantize gguf, gptq, awq, hqq and exl2 models
☆76Updated 11 months ago
LuciAkirami / liva
LIVA - Local Intelligent Voice Assistant
☆61Updated last year
shinomakoi / AI-Messenger
A QT GUI for large language models
☆38Updated last year
hololeo / click-n-ollamarun
Bookmarklet to pull and run hugging face GGUF models in Ollama
☆18Updated last year