Preemo-Inc / text-generation-inferenceLinks

☆199

Alternatives and similar repositories for text-generation-inference

Users that are interested in text-generation-inference are comparing it to the libraries listed below

Sorting:

hamelsmu / llama-inference
experiments with inference on llama
☆104Updated last year
AblateIt / finetune-study
Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.
☆82Updated last year
premAI-io / benchmarks
🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.
☆137Updated last year
arcee-ai / DALM
Domain Adapted Language Modeling Toolkit - E2E RAG
☆325Updated 8 months ago
QuixiAI / laserRMT
This is our own implementation of 'Layer Selective Rank Reduction'
☆239Updated last year
neuml / txtinstruct
📚 Datasets and models for instruction-tuning
☆238Updated last year
TheBlokeAI / AIScripts
Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub
☆162Updated last year
mixedbread-ai / batched
The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…
☆142Updated 3 weeks ago
taprosoft / llm_finetuning
Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…
☆146Updated last year
argilla-io / notus
Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…
☆168Updated last year
Locutusque / TPU-Alignment
Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free
☆232Updated 9 months ago
sabetAI / BLoRA
batched loras
☆344Updated last year
abacaj / train-with-fsdp
☆93Updated last year
rmihaylov / mpttune
Tune MPTs
☆84Updated 2 years ago
jondurbin / bagel
A bagel, with everything.
☆323Updated last year
huggingface / data-is-better-together
Let's build better datasets, together!
☆260Updated 7 months ago
AnswerDotAI / fastdata
☆154Updated 8 months ago
huggingface / llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
☆268Updated last year
teknium1 / LLM-Benchmark-Logs
Just a bunch of benchmark logs for different LLMs
☆119Updated last year
QuixiAI / spectrum
☆128Updated 3 months ago
LLukas22 / llm-rs-python
Unofficial python bindings for the rust llm library. 🐍❤️🦀
☆75Updated last year
redotvideo / pluto
Synthetic Data for LLM Fine-Tuning
☆120Updated last year
sdan / selfextend
an implementation of Self-Extend, to expand the context window via grouped attention
☆119Updated last year
FastEval / FastEval
Fast & more realistic evaluation of chat language models. Includes leaderboard.
☆187Updated last year
bigcode-project / jupytercoder
☆141Updated last year
Gryphe / BlockMerge_Gradient
Merge Transformers language models by use of gradient parameters.
☆206Updated 11 months ago
mobarski / alpaca-libre
Reimplementation of the task generation part from the Alpaca paper
☆119Updated 2 years ago
closedai-project / closedai
Drop in replacement for OpenAI, but with Open models.
☆152Updated 2 years ago
deployradiant / pychatml
Chat Markup Language conversation library
☆55Updated last year
thomasgauthier / LoRD
Low-Rank adapter extraction for fine-tuned transformers models
☆175Updated last year