EQ-bench / eqbench3Links
☆37Updated 3 months ago
Alternatives and similar repositories for eqbench3
Users that are interested in eqbench3 are comparing it to the libraries listed below
Sorting:
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- LLMs as Collaboratively Edited Knowledge Bases☆46Updated last year
- ☆39Updated last year
- Verifiers for LLM Reinforcement Learning☆80Updated 7 months ago
- Pre-training code for CrystalCoder 7B LLM☆55Updated last year
- ☆55Updated last year
- ☆128Updated 7 months ago
- GoldFinch and other hybrid transformer components☆45Updated last year
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated last year
- [ACL 2025] Agentic Knowledgeable Self-awareness☆91Updated 5 months ago
- ☆40Updated 11 months ago
- accompanying material for sleep-time compute paper☆118Updated 7 months ago
- ☆46Updated 5 months ago
- ☆35Updated last year
- ☆62Updated last year
- ☆51Updated last year
- GPT-4 Level Conversational QA Trained In a Few Hours☆66Updated last year
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆94Updated 6 months ago
- A repository for research on medium sized language models.☆78Updated last year
- ☆48Updated last year
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆115Updated 5 months ago
- ☆41Updated 5 months ago
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆30Updated 8 months ago
- Open Implementations of LLM Analyses☆107Updated last year
- ☆24Updated last year
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆100Updated 3 months ago
- ☆17Updated 8 months ago
- ☆45Updated last year
- ☆92Updated 6 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated last month