jxmorris12 / cde
code for training & evaluating Contextual Document Embedding models
☆93Updated this week
Related projects ⓘ
Alternatives and complementary repositories for cde
- An introduction to LLM Sampling☆61Updated this week
- ☆92Updated last month
- ☆100Updated 3 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆76Updated 7 months ago
- Functional Benchmarks and the Reasoning Gap☆78Updated last month
- Experiments for efforts to train a new and improved t5☆76Updated 6 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆83Updated last week
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆81Updated last year
- ☆49Updated 7 months ago
- look how they massacred my boy☆54Updated 3 weeks ago
- ☆48Updated last year
- smolLM with Entropix sampler on pytorch☆137Updated last week
- Manage scalable open LLM inference endpoints in Slurm clusters☆237Updated 4 months ago
- ☆91Updated last year
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…☆46Updated last week
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆104Updated last month
- Code for Zero-Shot Tokenizer Transfer☆115Updated 3 weeks ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆105Updated 2 weeks ago
- ☆44Updated 2 months ago
- Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"☆171Updated 3 weeks ago
- ☆106Updated 3 weeks ago
- NLP with Rust for Python 🦀🐍☆59Updated 5 months ago
- Cold Compress is a hackable, lightweight, and open-source toolkit for creating and benchmarking cache compression methods built on top of…☆86Updated 3 months ago
- The first dense retrieval model that can be prompted like an LM☆62Updated last month
- Set of scripts to finetune LLMs☆36Updated 7 months ago
- ☆74Updated 2 weeks ago
- ☆116Updated 2 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆92Updated last week
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆98Updated 9 months ago
- Muon optimizer for neural networks: >30% extra sample efficiency, <3% wallclock overhead☆69Updated this week