bodaay / HuggingFaceModelDownloaderLinks

Simple go utility to download HuggingFace Models and Datasets

☆759

Alternatives and similar repositories for HuggingFaceModelDownloader

Users that are interested in HuggingFaceModelDownloader are comparing it to the libraries listed below

Sorting:

turboderp-org / exui
Web UI for ExLlamaV2
☆514Updated 9 months ago
theroyallab / tabbyAPI
The official API server for Exllama. OAI compatible, lightweight, and fast.
☆1,090Updated this week
matatonic / openedai-vision
An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.
☆265Updated 8 months ago
aphrodite-engine / aphrodite-engine
Large-scale LLM inference engine
☆1,591Updated this week
matt-c1 / llama-3-quant-comparison
Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.
☆165Updated last year
Maximilian-Winter / llama-cpp-agent
The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …
☆606Updated 9 months ago
mamei16 / LLM_Web_search
An extension for oobabooga/text-generation-webui that enables the LLM to search the web
☆268Updated this week
oobabooga / text-generation-webui-extensions
☆668Updated 3 weeks ago
itsme2417 / PolyMind
A multimodal, function calling powered LLM webui.
☆216Updated last year
leafspark / AutoGGUF
automatically quant GGUF models
☆214Updated 3 weeks ago
AI-Commandos / LLaMa2lang
Convenience scripts to finetune (chat-)LLaMa3 and other models for any language
☆315Updated last year
lmg-anon / mikupad
LLM Frontend in a single html file
☆663Updated last week
turboderp-org / exllamav3
An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs
☆571Updated last week
AndrewVeee / nucleo-ai
An AI assistant beyond the chat box.
☆328Updated last year
ParisNeo / ollama_proxy_server
A proxy server for multiple ollama instances with Key security
☆527Updated last week
turboderp / exllama
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
☆2,905Updated 2 years ago
jllllll / llama-cpp-python-cuBLAS-wheels
Wheels for llama-cpp-python compiled with cuBLAS support
☆97Updated last year
QuixiAI / dolphin-system-messages
Dolphin System Messages
☆363Updated 9 months ago
epolewski / EricLLM
A fast batching API to serve LLM models
☆188Updated last year
zenoverflow / omnichain
Efficient visual programming for AI language models
☆362Updated 6 months ago
Orion-zhen / abliteration
Make abliterated models with transformers, easy and fast
☆92Updated 7 months ago
foldl / chatllm.cpp
Pure C++ implementation of several models for real-time chatting on your computer (CPU & GPU)
☆746Updated this week
runpod-workers / worker-vllm
The RunPod worker template for serving our large language model endpoints. Powered by vLLM.
☆380Updated this week
Atinoda / text-generation-webui-docker
Docker variants of oobabooga's text-generation-webui, including pre-built images.
☆440Updated 2 weeks ago
marella / ctransformers
Python bindings for the Transformer models implemented in C/C++ using GGML library.
☆1,875Updated last year
the-crypt-keeper / can-ai-code
Self-evaluating interview for AI coders
☆597Updated 4 months ago
gpustack / gguf-parser-go
Review/Check GGUF files and estimate the memory usage and maximum tokens per second.
☆216Updated 3 months ago
tdrussell / qlora-pipe
A pipeline parallel training script for LLMs.
☆162Updated 6 months ago
brucepro / Memoir
Memoir+ a persona memory extension for Text Gen Web UI.
☆218Updated 3 weeks ago
AlexBuz / llama-zip
LLM-powered lossless compression tool
☆290Updated last year