EQ-bench / eqbench3Links
☆28Updated 3 months ago
Alternatives and similar repositories for eqbench3
Users that are interested in eqbench3 are comparing it to the libraries listed below
Sorting:
- GPT-4 Level Conversational QA Trained In a Few Hours☆65Updated last year
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- LLMs as Collaboratively Edited Knowledge Bases☆45Updated last year
- Verifiers for LLM Reinforcement Learning☆79Updated 7 months ago
- ☆62Updated 4 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆58Updated 3 weeks ago
- Pre-training code for CrystalCoder 7B LLM☆55Updated last year
- ☆39Updated last year
- ☆62Updated 10 months ago
- ☆40Updated 11 months ago
- ☆61Updated 11 months ago
- accompanying material for sleep-time compute paper☆117Updated 6 months ago
- ☆49Updated 9 months ago
- GoldFinch and other hybrid transformer components☆45Updated last year
- Train, tune, and infer Bamba model☆136Updated 5 months ago
- entropix style sampling + GUI☆27Updated last year
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated this week
- [ACL 2025] Agentic Knowledgeable Self-awareness☆89Updated 5 months ago
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆29Updated 7 months ago
- Multi-Granularity LLM Debugger [ICSE2026]☆91Updated 4 months ago
- Pivotal Token Search☆131Updated 4 months ago
- ☆51Updated last year
- ☆125Updated 6 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆29Updated 11 months ago
- ☆55Updated last year
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆98Updated 5 months ago
- OpenPipe Reinforcement Learning Experiments☆32Updated 8 months ago
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.☆16Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆35Updated last year
- An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"☆43Updated last year