jxmorris12 / cdeLinks
code for training & evaluating Contextual Document Embedding models
☆195Updated last month
Alternatives and similar repositories for cde
Users that are interested in cde are comparing it to the libraries listed below
Sorting:
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆205Updated 2 weeks ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆260Updated 11 months ago
- ☆124Updated 2 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆173Updated 5 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆80Updated last year
- ☆118Updated 9 months ago
- An extension of the nanoGPT repository for training small MOE models.☆152Updated 3 months ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆181Updated 9 months ago
- PyTorch building blocks for the OLMo ecosystem☆238Updated this week
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆170Updated 3 weeks ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆225Updated 7 months ago
- An introduction to LLM Sampling☆78Updated 6 months ago
- minimal GRPO implementation from scratch☆90Updated 3 months ago
- BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.☆203Updated last month
- ☆134Updated 2 months ago
- Let's build better datasets, together!☆259Updated 6 months ago
- A pipeline for LLM knowledge distillation☆104Updated 2 months ago
- Code for Zero-Shot Tokenizer Transfer☆131Updated 5 months ago
- Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch☆175Updated this week
- Prune transformer layers☆69Updated last year
- Late Interaction Models Training & Retrieval☆444Updated 2 weeks ago
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆108Updated 2 months ago
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆198Updated 11 months ago
- A simplified implementation for experimenting with RLVR on GSM8K, This repository provides a starting point for exploring reasoning.☆101Updated 4 months ago
- ☆180Updated 2 months ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆137Updated last month
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆140Updated 4 months ago
- awesome synthetic (text) datasets☆282Updated 7 months ago
- ☆132Updated 10 months ago
- Reproducible, flexible LLM evaluations☆213Updated last month