epolewski / EricLLMLinks

A fast batching API to serve LLM models

☆185

Alternatives and similar repositories for EricLLM

Users that are interested in EricLLM are comparing it to the libraries listed below

Sorting:

itsme2417 / PolyMind
A multimodal, function calling powered LLM webui.
☆215Updated 10 months ago
severian42 / Vodalus-Expert-LLM-Forge
Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …
☆184Updated last year
galatolofederico / microchain
function calling-based LLM agents
☆287Updated 10 months ago
LostRuins / datasetexplorer
Easily view and modify JSON datasets for large language models
☆81Updated 2 months ago
nath1295 / LLMFlex
A python package for developing AI applications with local LLMs.
☆151Updated 7 months ago
migtissera / Sensei
Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI
☆222Updated last year
remichu-ai / gallama
☆132Updated 3 months ago
QuixiAI / laserRMT
This is our own implementation of 'Layer Selective Rank Reduction'
☆239Updated last year
chigkim / Ollama-MMLU-Pro
☆95Updated 7 months ago
thomasgauthier / LoRD
Low-Rank adapter extraction for fine-tuned transformers models
☆175Updated last year
the-crypt-keeper / LLooM
Experimental LLM Inference UX to aid in creative writing
☆119Updated 7 months ago
jeffrey-fong / Invoker
The one who calls upon functions - Function-Calling Language Model
☆36Updated last year
noco-ai / spellbook-docker
AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many other AI models
☆162Updated last year
VatsaDev / NanoPhi-alpha
GPT-2 small trained on phi-like data
☆67Updated last year
eugenepentland / landmark-attention-qlora
Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA
☆123Updated 2 years ago
rombodawg / Easy_training
☆49Updated 5 months ago
latent-variable / Real_time_fallacy_detection
Real-time Fallacy Detection using OpenAI whisper and ChatGPT/LLaMA/Mistral
☆115Updated last year
taprosoft / llm_finetuning
Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…
☆146Updated last year
abgulati / kosmos-2_5-containerized
Kosmos-2.5 is a cutting-edge Multimodal-LLM (MLLM) specializing in image OCR. However, its stringent software requirements & Python-scrip…
☆62Updated last year
matteoserva / GraphLLM
☆207Updated 2 weeks ago
matt-c1 / llama-3-quant-comparison
Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.
☆158Updated last year
teknium1 / ShareGPT-Builder
☆115Updated 7 months ago
AI-Commandos / LLaMa2lang
Convenience scripts to finetune (chat-)LLaMa3 and other models for any language
☆310Updated last year
Fus3n / TwoAI
A simple experiment on letting two local LLM have a conversation about anything!
☆110Updated last year
TheProxyCompany / proxy-structuring-engine
Guaranteed Structured Output from any Language Model via Hierarchical State Machines
☆141Updated 2 months ago
ortegaalfredo / neurochat
Native gui to serveral AI services plus llama.cpp local AIs.
☆116Updated last year
Maximilian-Winter / llama-cpp-agent
The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …
☆579Updated 5 months ago
rafacelente / bllama
1.58-bit LLaMa model
☆81Updated last year
intentee / llmops-handbook
Practical and advanced guide to LLMOps. It provides a solid understanding of large language models’ general concepts, deployment techniqu…
☆70Updated 11 months ago
avarayr / suaveui
Open source LLM UI, compatible with all local LLM providers.
☆175Updated 10 months ago