huggingface / fineweb-2Links

☆208

Alternatives and similar repositories for fineweb-2

Users that are interested in fineweb-2 are comparing it to the libraries listed below

Sorting:

allenai / OLMo-core
PyTorch building blocks for the OLMo ecosystem
☆319Updated this week
huggingface / llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
☆276Updated last year
allenai / olmes
Reproducible, flexible LLM evaluations
☆264Updated 3 weeks ago
facebookresearch / ReasonIR
Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".
☆206Updated 4 months ago
arcee-ai / EvolKit
EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…
☆242Updated last year
RulinShao / retrieval-scaling
Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".
☆218Updated 2 weeks ago
ServiceNow / Fast-LLM
Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research
☆259Updated this week
JinjieNi / MixEval
The official evaluation suite and dynamic data release for MixEval.
☆252Updated last year
zai-org / ComplexFuncBench
Complex Function Calling Benchmark.
☆148Updated 10 months ago
huggingface / cosmopedia
☆552Updated last year
LLM360 / amber-train
Pre-training code for Amber 7B LLM
☆169Updated last year
sanyalsunny111 / LLM-Inheritune
This is the official repository for Inheritune.
☆115Updated 9 months ago
allenai / WildBench
Benchmarking LLMs with Challenging Tasks from Real Users
☆246Updated last year
google-deepmind / loft
LOFT: A 1 Million+ Token Long-Context Benchmark
☆219Updated 5 months ago
huggingface / data-is-better-together
Let's build better datasets, together!
☆264Updated 11 months ago
LLM360 / k2-train
☆52Updated last year
snowflakedb / ArcticTraining
ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)
☆245Updated this week
QuixiAI / spectrum
☆138Updated 2 months ago
sail-sg / sailcraft
🚢 Data Toolkit for Sailor Language Models
☆94Updated 8 months ago
jshuadvd / LongRoPE
Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper
☆152Updated last year
jxmorris12 / cde
code for training & evaluating Contextual Document Embedding models
☆200Updated 6 months ago
jakespringer / echo-embeddings
☆156Updated last year
microsoft / LongRoPE
LongRoPE is a novel method that can extends the context window of pre-trained LLMs to an impressive 2048k tokens.
☆271Updated 3 weeks ago
facebookresearch / memory
Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…
☆356Updated 11 months ago
booydar / babilong
BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.
☆215Updated 2 months ago
microsoft / lost_in_conversation
Code that accompanies the public release of the paper Lost in Conversation (https://arxiv.org/abs/2505.06120)
☆181Updated 4 months ago
facebookresearch / RAM
A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).
☆297Updated last week
DaoD / INTERS
This is the repository for our paper "INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning"
☆205Updated 11 months ago
microsoft / ArchScale
Simple & Scalable Pretraining for Neural Architecture Research
☆299Updated 2 weeks ago
Mohammadjafari80 / GSM8K-RLVR
A simplified implementation for experimenting with RLVR on GSM8K, This repository provides a starting point for exploring reasoning.
☆145Updated 9 months ago