geronimi73 / 3090_shorts
minimal LLM scripts for 24GB VRAM GPUs. training, inference, whatever
β38Updated last week
Alternatives and similar repositories for 3090_shorts:
Users that are interested in 3090_shorts are comparing it to the libraries listed below
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β73Updated 5 months ago
- Implementation of "SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models"β26Updated last month
- β32Updated 9 months ago
- Official implementation for 'Extending LLMsβ Context Window with 100 Samples'β75Updated last year
- Small and Efficient Mathematical Reasoning LLMsβ71Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimizationβ59Updated last year
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).β80Updated last year
- β48Updated 4 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 8 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)β76Updated 5 months ago
- β62Updated 8 months ago
- β87Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked promptsβ24Updated last year
- Set of scripts to finetune LLMsβ37Updated 11 months ago
- β24Updated last year
- Codebase accompanying the Summary of a Haystack paper.β75Updated 6 months ago
- β38Updated last month
- Mixing Language Models with Self-Verification and Meta-Verificationβ102Updated 3 months ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.β117Updated last year
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limitβ63Updated last year
- Code for NeurIPS LLM Efficiency Challengeβ57Updated 11 months ago
- β74Updated last year
- Repository containing awesome resources regarding Hugging Face tooling.β46Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorerβ41Updated 11 months ago
- QLoRA with Enhanced Multi GPU Supportβ36Updated last year
- Data preparation code for CrystalCoder 7B LLMβ44Updated 10 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β34Updated 3 months ago
- Simple GRPO scripts and configurations.β58Updated last month
- Evaluating LLMs with CommonGen-Liteβ89Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMsβ77Updated 11 months ago