PrimeIntellect-ai / genesys
☆123Updated last month
Alternatives and similar repositories for genesys:
Users that are interested in genesys are comparing it to the libraries listed below
- Train your own SOTA deductive reasoning model☆91Updated 2 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆65Updated last month
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆172Updated 2 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆172Updated 3 months ago
- prime-rl is a codebase for decentralized RL training at scale☆89Updated this week
- A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning☆150Updated last week
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆257Updated this week
- ☆114Updated 2 months ago
- EvaByte: Efficient Byte-level Language Models at Scale☆91Updated 2 weeks ago
- Verdict is a library for scaling judge-time compute.☆202Updated last week
- Mixing Language Models with Self-Verification and Meta-Verification☆104Updated 4 months ago
- ☆37Updated 3 months ago
- Just a bunch of benchmark logs for different LLMs☆119Updated 9 months ago
- Scaling Data for SWE-agents☆101Updated this week
- ⚖️ Awesome LLM Judges ⚖️☆94Updated last week
- ☆50Updated 5 months ago
- Compiling useful links, papers, benchmarks, ideas, etc.☆46Updated last month
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆98Updated 2 months ago
- Complex Function Calling Benchmark.☆99Updated 3 months ago
- ☆65Updated 2 months ago
- accompanying material for sleep-time compute paper☆77Updated last week
- ☆117Updated last month
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆39Updated 3 months ago
- ☆55Updated this week
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆215Updated 6 months ago
- Official repo for Learning to Reason for Long-Form Story Generation☆44Updated 2 weeks ago
- Exploring Applications of GRPO☆189Updated this week
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆186Updated 3 weeks ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆108Updated 2 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆67Updated 5 months ago