casper-hansen / OpenCoconut
OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.
☆172Updated 3 months ago
Alternatives and similar repositories for OpenCoconut:
Users that are interested in OpenCoconut are comparing it to the libraries listed below
- ☆114Updated 2 months ago
- Train your own SOTA deductive reasoning model☆91Updated last month
- Repository for the paper Stream of Search: Learning to Search in Language☆145Updated 3 months ago
- EvaByte: Efficient Byte-level Language Models at Scale☆91Updated 2 weeks ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆139Updated 2 months ago
- ☆170Updated 2 weeks ago
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆189Updated 5 months ago
- A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning☆150Updated this week
- Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch☆165Updated 4 months ago
- Benchmarking LLMs with Challenging Tasks from Real Users☆221Updated 6 months ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆172Updated last month
- Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.☆17Updated this week
- This is the official repository for Inheritune.☆111Updated 2 months ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆196Updated last week
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆98Updated last month
- A simple unified framework for evaluating LLMs☆209Updated 3 weeks ago
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆171Updated this week
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆186Updated 3 weeks ago
- Just a bunch of benchmark logs for different LLMs☆119Updated 9 months ago
- Replicating O1 inference-time scaling laws☆84Updated 5 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆65Updated last month
- Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory☆56Updated 3 weeks ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆215Updated 6 months ago
- Evaluating LLMs with fewer examples☆151Updated last year
- ☆120Updated 7 months ago
- Functional Benchmarks and the Reasoning Gap☆85Updated 7 months ago
- The official evaluation suite and dynamic data release for MixEval.☆238Updated 5 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆108Updated 2 months ago
- ☆109Updated 4 months ago
- 🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.☆336Updated 2 weeks ago