huggingface / wikirace-llmsLinks

☆25

Alternatives and similar repositories for wikirace-llms

Users that are interested in wikirace-llms are comparing it to the libraries listed below

Sorting:

s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆59Updated last month
Columbia-NLP-Lab / PAPILLON
Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles
☆60Updated 6 months ago
ZeroSumEval / ZeroSumEval
A framework for pitting LLMs against each other in an evolving library of games ⚔
☆34Updated 7 months ago
AnswerDotAI / ModernBERT-Instruct-mini-cookbook
☆52Updated 9 months ago
arcee-ai / DAM
☆55Updated last year
lightonai / pylate-rs
PyLate efficient inference engine
☆68Updated 2 months ago
brendanhogan / picoDeepResearch
☆68Updated 6 months ago
rosmineb / unit_test_rl
Project code for training LLMs to write better unit tests + code
☆21Updated 6 months ago
BBischof / yapping
Verbosity control for AI agents
☆64Updated last year
JoshuaPurtell / SmallBench
Small, simple agent task environments for training and evaluation
☆19Updated last year
xjdr-alt / muzero_sketch
☆40Updated last year
matthewrenze / jhu-concise-cot
The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models
☆22Updated last year
meetdavidwan / clamr
CLaMR: Contextualized Late-Interaction for Multimodal Content Retrieval
☆22Updated 5 months ago
willccbb / localchat
☆14Updated 7 months ago
haizelabs / j1-micro
j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.
☆99Updated 4 months ago
Pleias / Quest-Best-Tokens
An introduction to LLM Sampling
☆79Updated 11 months ago
brendanhogan / completion_tree_view
☆15Updated 7 months ago
minosvasilias / simple_grpo
Simple GRPO scripts and configurations.
☆59Updated 9 months ago
joey00072 / Attention-as-graph
alternative way to calculating self attention
☆18Updated last year
ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆60Updated last year
vivien000 / regex-constrained-decoding
Fast, High-Fidelity LLM Decoding with Regex Constraints
☆21Updated last year
sunnweiwei / PPP-Agent
☆88Updated 3 weeks ago
weaviate-tutorials / Hurricane
Writing Blog Posts with Generative Feedback Loops!
☆50Updated last year
Xalp / ECHO
Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)
☆91Updated 10 months ago
teknium1 / transformers-gptq-quant
☆45Updated 2 years ago
thomasnormal / fewshot
☆29Updated last month
facebookresearch / ZeroSumEval
A framework for pitting LLMs against each other in an evolving library of games ⚔
☆34Updated 7 months ago
JD-P / RetroInstruct
Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.
☆32Updated last month
stunningpixels / lou-eval
Track the progress of LLM context utilisation
☆55Updated 7 months ago
taylorai / onnx_embedding_models
utilities for loading and running text embeddings with onnx
☆44Updated 3 months ago