geronimi73 / 3090_shorts
minimal LLM scripts for 24GB VRAM GPUs. training, inference, whatever
β37Updated 2 weeks ago
Alternatives and similar repositories for 3090_shorts:
Users that are interested in 3090_shorts are comparing it to the libraries listed below
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β67Updated 3 months ago
- β48Updated 3 months ago
- β87Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β48Updated 7 months ago
- β62Updated 6 months ago
- Official implementation for 'Extending LLMsβ Context Window with 100 Samples'β76Updated last year
- Fine-tune ModernBERT on a large Dataset with Custom Tokenizer Trainingβ58Updated last week
- Set of scripts to finetune LLMsβ36Updated 10 months ago
- Small and Efficient Mathematical Reasoning LLMsβ71Updated last year
- Code for NeurIPS LLM Efficiency Challengeβ55Updated 10 months ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorerβ40Updated 10 months ago
- Codebase accompanying the Summary of a Haystack paper.β74Updated 4 months ago
- β52Updated 8 months ago
- A pipeline for LLM knowledge distillationβ89Updated 3 weeks ago
- β24Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β34Updated 2 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked promptsβ24Updated 11 months ago
- ReBase: Training Task Experts through Retrieval Based Distillationβ28Updated last week
- β31Updated 7 months ago
- β37Updated last year
- β74Updated last year
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.β115Updated last year
- This is the official repository for Inheritune.β109Updated this week
- β47Updated 5 months ago
- Universal text classifier for generative modelsβ22Updated 6 months ago
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRAβ101Updated 6 months ago
- PyTorch implementation for MRLβ18Updated 11 months ago
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTOβ¦β53Updated this week
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.β33Updated 11 months ago