jxmorris12 / cdeLinks

code for training & evaluating Contextual Document Embedding models

☆194

Alternatives and similar repositories for cde

Users that are interested in cde are comparing it to the libraries listed below

Sorting:

huggingface / llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
☆265Updated last year
RulinShao / retrieval-scaling
Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".
☆206Updated last month
cognitivecomputations / spectrum
☆128Updated 3 months ago
google-deepmind / mishax
☆134Updated 3 months ago
casper-hansen / OpenCoconut
OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.
☆173Updated 5 months ago
booydar / babilong
BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.
☆203Updated 2 months ago
bminixhofer / zett
Code for Zero-Shot Tokenizer Transfer
☆133Updated 6 months ago
Pleias / Quest-Best-Tokens
An introduction to LLM Sampling
☆79Updated 7 months ago
allenai / OLMo-core
PyTorch building blocks for the OLMo ecosystem
☆258Updated this week
davanstrien / awesome-synthetic-datasets
awesome synthetic (text) datasets
☆286Updated last week
cognitivecomputations / grokadamw
☆134Updated 10 months ago
huggingface / fineweb-2
☆160Updated 2 weeks ago
arcee-ai / EvolKit
EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…
☆229Updated 8 months ago
writer / writing-in-the-margins
☆118Updated 10 months ago
VITA-Group / Q-GaLore
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.
☆198Updated 11 months ago
SinatrasC / entropix-smollm
smolLM with Entropix sampler on pytorch
☆150Updated 8 months ago
allenai / infinigram-api
☆69Updated last month
ConsequentAI / fneval
Functional Benchmarks and the Reasoning Gap
☆88Updated 9 months ago
orionw / promptriever
The first dense retrieval model that can be prompted like an LM
☆80Updated 2 months ago
huggingface / data-is-better-together
Let's build better datasets, together!
☆260Updated 6 months ago
huggingface / picotron_tutorial
☆198Updated 5 months ago
melisa-writer / short-transformers
Prune transformer layers
☆69Updated last year
cohere-ai / magikarp
Code for the paper "Fishing for Magikarp"
☆157Updated 2 months ago
OSU-NLP-Group / GrokkedTransformer
Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'
☆223Updated 7 months ago
NVIDIA / logits-processor-zoo
A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.
☆315Updated last week
MadryLab / context-cite
Attribute (or cite) statements generated by LLMs back to in-context information.
☆245Updated 9 months ago
dwzhu-pku / LongEmbed
LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)
☆138Updated 8 months ago
OpenPipe / deductive-reasoning
Train your own SOTA deductive reasoning model
☆96Updated 4 months ago
ServiceNow / Fast-LLM
Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research
☆211Updated this week
jonhue / activeft
PyTorch library for Active Fine-Tuning
☆84Updated 4 months ago