predibase / lora_bakeoffLinks
☆20Updated last year
Alternatives and similar repositories for lora_bakeoff
Users that are interested in lora_bakeoff are comparing it to the libraries listed below
Sorting:
- Evaluating LLMs with fewer examples☆169Updated last year
- ☆47Updated 2 years ago
- Mixing Language Models with Self-Verification and Meta-Verification☆112Updated last year
- PyTorch implementation of models from the Zamba2 series.☆186Updated last year
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆153Updated last year
- ReLM is a Regular Expression engine for Language Models☆107Updated 2 years ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 3 months ago
- ☆59Updated 2 months ago
- The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models" [AISTATS …☆60Updated last year
- ☆71Updated last year
- Supercharge huggingface transformers with model parallelism.☆78Updated 6 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆82Updated last year
- Train your own SOTA deductive reasoning model☆107Updated 11 months ago
- ☆56Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆61Updated last year
- ☆52Updated last year
- ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…☆226Updated 4 months ago
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆201Updated last year
- minimal pytorch implementation of bm25 (with sparse tensors)☆104Updated 3 months ago
- ☆31Updated last year
- Just a bunch of benchmark logs for different LLMs☆119Updated last year
- experiments with inference on llama☆103Updated last year
- ☆93Updated last month
- ☆137Updated last year
- ☆153Updated 5 months ago
- Memoria is a human-inspired memory architecture for neural networks.☆84Updated last year
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.☆92Updated last year
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆91Updated last year
- ☆48Updated last year
- QLoRA with Enhanced Multi GPU Support☆37Updated 2 years ago