predibase / lora_bakeoffLinks
☆20Updated 10 months ago
Alternatives and similar repositories for lora_bakeoff
Users that are interested in lora_bakeoff are comparing it to the libraries listed below
Sorting:
- Storing long contexts in tiny caches with self-study☆108Updated this week
- The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models" [AISTATS …☆61Updated 9 months ago
- Supercharge huggingface transformers with model parallelism.☆77Updated 9 months ago
- ☆70Updated last week
- Just a bunch of benchmark logs for different LLMs☆119Updated 11 months ago
- Open Implementations of LLM Analyses☆105Updated 9 months ago
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆130Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆55Updated 5 months ago
- The official repo for "LLoCo: Learning Long Contexts Offline"☆117Updated last year
- Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research☆214Updated this week
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆81Updated 2 months ago
- Evaluating LLMs with fewer examples☆160Updated last year
- ReLM is a Regular Expression engine for Language Models☆106Updated 2 years ago
- ☆68Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆79Updated 10 months ago
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆137Updated 11 months ago
- ☆45Updated last year
- PyTorch implementation of models from the Zamba2 series.☆184Updated 6 months ago
- experiments with inference on llama☆104Updated last year
- Code for NeurIPS LLM Efficiency Challenge☆59Updated last year
- ☆87Updated last year
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆75Updated this week
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆173Updated 4 months ago
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆198Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆43Updated last year
- Cray-LM unified training and inference stack.☆22Updated 5 months ago
- ☆36Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆106Updated 7 months ago
- ☆199Updated 7 months ago
- Combining Base and Instruction-Tuned Language Models for Better Synthetic Data Generation☆33Updated 5 months ago