abacaj / transformersLinks

Understanding large language models

☆117

Alternatives and similar repositories for transformers

Users that are interested in transformers are comparing it to the libraries listed below

Sorting:

Narsil / fast_gpt2
☆156Updated 2 years ago
abacaj / train-with-fsdp
☆93Updated last year
teknium1 / LLM-Benchmark-Logs
Just a bunch of benchmark logs for different LLMs
☆119Updated last year
AblateIt / finetune-study
Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.
☆82Updated last year
euclaise / SlimTrainer
Full finetuning of large language models without large memory requirements
☆94Updated last year
Birch-san / mpt-play
Command-line script for inferencing from models such as MPT-7B-Chat
☆100Updated 2 years ago
sdan / selfextend
an implementation of Self-Extend, to expand the context window via grouped attention
☆119Updated last year
teknium1 / transformers-gptq-quant
☆47Updated last year
mobarski / alpaca-libre
Reimplementation of the task generation part from the Alpaca paper
☆119Updated 2 years ago
vikhyat / mixtral-inference
inference code for mixtral-8x7b-32kseqlen
☆101Updated last year
VikParuchuri / libgen_to_txt
Convert all of libgen to high quality markdown
☆253Updated last year
official-elinas / zeus-llm-trainer
Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models
☆70Updated last year
BerriAI / instructprompt
☆107Updated 2 years ago
567-labs / fastllm
A collection of LLM services you can self host via docker or modal labs to support your applications development
☆192Updated last year
CarperAI / treasure_trove
☆22Updated last year
Locutusque / TPU-Alignment
Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free
☆232Updated 9 months ago
keerthanpg / DadJokeGenerator
☆46Updated 2 years ago
Muhtasham / summarization-eval
📝 Reference-Free automatic summarization evaluation with potential hallucination detection
☆101Updated last year
muellerzr / minimal-trainer-zoo
Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines
☆197Updated last year
RobertRiachi / nanoPALM
☆143Updated 2 years ago
vithursant / nanoGPT_mlx
Port of Andrej Karpathy's nanoGPT to Apple MLX framework.
☆111Updated last year
MF-FOOM / wikivec2text
Simple embedding -> text model trained on a small subset of Wikipedia sentences.
☆156Updated 2 years ago
cohere-ai / BinaryVectorDB
Efficient vector database for hundred millions of embeddings.
☆207Updated last year
reactorsh / ambrosia
clean up your LLM datasets
☆115Updated 2 years ago
pacman100 / peft-codegen-25
☆23Updated 2 years ago
Arize-ai / LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
☆102Updated last year
geov-ai / geov
The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…
☆121Updated 2 years ago
deployradiant / pychatml
Chat Markup Language conversation library
☆55Updated last year
1rgs / tokenwiz
A clone of OpenAI's Tokenizer page for HuggingFace Models
☆45Updated last year
closedai-project / closedai
Drop in replacement for OpenAI, but with Open models.
☆152Updated 2 years ago