predibase / lora_bakeoff
☆16Updated 2 weeks ago
Related projects: ⓘ
- ☆38Updated 8 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆70Updated last month
- ☆68Updated last month
- ☆75Updated 3 weeks ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆60Updated last year
- ☆55Updated 9 months ago
- Drift detection module for machine learning pipelines.☆20Updated last year
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆26Updated last year
- Understanding how features learned by neural networks evolve throughout training☆30Updated this week
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆73Updated 6 months ago
- ☆26Updated this week
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆37Updated 3 months ago
- Small and Efficient Mathematical Reasoning LLMs☆69Updated 7 months ago
- 🤝 Trade any tensors over the network☆30Updated 11 months ago
- Truth Forest: Toward Multi-Scale Truthfulness in Large Language Models through Intervention without Tuning☆40Updated 9 months ago
- ☆29Updated 2 weeks ago
- NLP with Rust for Python 🦀🐍☆57Updated 3 months ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆82Updated 3 weeks ago
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆86Updated 3 months ago
- ☆91Updated last month
- Just a bunch of benchmark logs for different LLMs☆112Updated last month
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆94Updated 2 weeks ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆27Updated 2 months ago
- Tools for merging pretrained large language models.☆19Updated 3 months ago
- ☆25Updated this week
- ☆48Updated 11 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆67Updated 2 months ago
- ☆24Updated last year
- Cold Compress is a hackable, lightweight, and open-source toolkit for creating and benchmarking cache compression methods built on top of…☆73Updated last month
- Use QLoRA to tune LLM in PyTorch-Lightning w/ Huggingface + MLflow☆55Updated 10 months ago