Tufalabs / BeyondNextTokenPredictionLinks

☆19

Alternatives and similar repositories for BeyondNextTokenPrediction

Users that are interested in BeyondNextTokenPrediction are comparing it to the libraries listed below

Sorting:

dinobby / MAgICoRE
☆24Updated 8 months ago
Tomorrowdawn / top_nsigma
The official code repo and data hub of top_nsigma sampling strategy for LLMs.
☆25Updated 3 months ago
princeton-nlp / ELIZA-Transformer
[NAACL 2025] Representing Rule-based Chatbots with Transformers
☆21Updated 3 months ago
scottlogic-alex / prm800k-denorm
Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format
☆27Updated last year
ElleLeonne / Lightning-ReLoRA
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆33Updated last year
SeunghyunSEO / optimized_hf_llama_class_for_training
☆47Updated 9 months ago
thomasgauthier / LLM-self-play
Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)
☆29Updated last year
HazyResearch / aioli
Aioli: A unified optimization framework for language model data mixing
☆25Updated 4 months ago
arcee-ai / DAM
☆49Updated 6 months ago
kyegomez / Reka-Torch
Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch
☆29Updated last week
LLM360 / crystalcoder-data-prep
Data preparation code for CrystalCoder 7B LLM
☆44Updated last year
ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆57Updated 9 months ago
para-lost / ReBase
ReBase: Training Task Experts through Retrieval Based Distillation
☆29Updated 3 months ago
john-hewitt / implicit-ins
Codebase for Instruction Following without Instruction Tuning
☆34Updated 8 months ago
hamishivi / automated-instruction-selection
Exploration of automated dataset selection approaches at large scales.
☆41Updated 3 months ago
Leezekun / MacRAG
☆14Updated 3 weeks ago
JHU-CLSP / RATIONALYST
Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044
☆33Updated 8 months ago
MiuLab / PairDistill
Source code of our paper "PairDistill: Pairwise Relevance Distillation for Dense Retrieval", EMNLP 2024 Main.
☆22Updated 6 months ago
princeton-pli / MeCo
Code for preprint "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"
☆39Updated 3 weeks ago
allenai / easy-to-hard-generalization
Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"
☆47Updated last year
menhguin / minp_paper
Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper
☆35Updated 2 months ago
NathanGodey / qfilters
Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)
☆31Updated 2 months ago
Zyphra / Zyda_processing
☆34Updated 11 months ago
YutongWang1216 / DocMTAgent
Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory
☆44Updated 3 months ago
Tebmer / Rereading-LLM-Reasoning
EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…
☆26Updated 5 months ago
GAIR-NLP / Entropy-ABF
Official implementation for 'Extending LLMs’ Context Window with 100 Samples'
☆78Updated last year
du-nlp-lab / MLR-Copilot
☆65Updated 2 months ago
sail-sg / SkyLadder
The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling
☆32Updated 2 months ago
gangiswag / infogent
☆21Updated 3 months ago
orionw / promptriever
The first dense retrieval model that can be prompted like an LM
☆73Updated 3 weeks ago