cohere-ai / magikarp
☆128Updated this week
Related projects ⓘ
Alternatives and complementary repositories for magikarp
- ☆38Updated 7 months ago
- Benchmarking LLMs with Challenging Tasks from Real Users☆198Updated 3 weeks ago
- ☆112Updated last month
- Manage scalable open LLM inference endpoints in Slurm clusters☆238Updated 4 months ago
- Code for Zero-Shot Tokenizer Transfer☆117Updated last month
- Scalable Meta-Evaluation of LLMs as Evaluators☆41Updated 9 months ago
- Code repository for the c-BTM paper☆105Updated last year
- ☆103Updated last month
- Code and results accompanying the paper "Refusal in Language Models Is Mediated by a Single Direction".☆125Updated last month
- Self-Alignment with Principle-Following Reward Models☆147Updated 8 months ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆130Updated this week
- Evaluating LLMs with fewer examples☆135Updated 7 months ago
- ☆71Updated 6 months ago
- ☆49Updated 6 months ago
- Codebase accompanying the Summary of a Haystack paper.☆72Updated 2 months ago
- ☆101Updated 3 months ago
- A simple unified framework for evaluating LLMs☆145Updated 2 weeks ago
- Code accompanying "How I learned to start worrying about prompt formatting".☆95Updated last month
- The official evaluation suite and dynamic data release for MixEval.☆224Updated 2 weeks ago
- ☆46Updated 2 weeks ago
- ☆90Updated 4 months ago
- Supercharge huggingface transformers with model parallelism.☆75Updated last month
- Experiments for efforts to train a new and improved t5☆76Updated 7 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆93Updated 3 months ago
- code for training & evaluating Contextual Document Embedding models☆119Updated this week
- Language models scale reliably with over-training and on downstream tasks☆94Updated 7 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆115Updated 2 weeks ago
- ☆24Updated 3 weeks ago
- minimal pytorch implementation of bm25 (with sparse tensors)☆90Updated 8 months ago
- Replicating O1 inference-time scaling laws☆49Updated last month