casper-hansen / OpenCoconutLinks

OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.

☆173

Alternatives and similar repositories for OpenCoconut

Users that are interested in OpenCoconut are comparing it to the libraries listed below

Sorting:

SalesforceAIResearch / LaTRO
☆117Updated 5 months ago
PrimeIntellect-ai / genesys
☆130Updated 4 months ago
kanishkg / stream-of-search
Repository for the paper Stream of Search: Learning to Search in Language
☆149Updated 5 months ago
OpenPipe / deductive-reasoning
Train your own SOTA deductive reasoning model
☆101Updated 4 months ago
Alex-Gurung / ReasoningNCP
Official repo for Learning to Reason for Long-Form Story Generation
☆68Updated 3 months ago
OpenEvaByte / evabyte
EvaByte: Efficient Byte-level Language Models at Scale
☆103Updated 3 months ago
ScalingIntelligence / Archon
Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.
☆174Updated 4 months ago
OSU-NLP-Group / GrokkedTransformer
Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'
☆225Updated 2 weeks ago
google-deepmind / latent-multi-hop-reasoning
[ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?
☆71Updated 4 months ago
jerber / lang-jepa
☆117Updated 7 months ago
Mihaiii / backtrack_sampler
An easy-to-understand framework for LLM samplers that rewind and revise generated tokens
☆140Updated 5 months ago
allenai / infinigram-api
☆70Updated 2 weeks ago
Danau5tin / calculator_agent_rl
Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.
☆45Updated 2 months ago
letta-ai / sleep-time-compute
accompanying material for sleep-time compute paper
☆99Updated 3 months ago
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆103Updated 4 months ago
facebookresearch / ExploreToM
Code for ExploreTom
☆85Updated last month
vicksEmmanuel / latent-gemma
☆26Updated 6 months ago
LeonGuertler / TextArena
A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning
☆217Updated this week
writer / writing-in-the-margins
☆118Updated 11 months ago
jxmorris12 / cde
code for training & evaluating Contextual Document Embedding models
☆195Updated 2 months ago
goodfire-ai / r1-interpretability
Open source interpretability artefacts for R1.
☆157Updated 3 months ago
SinatrasC / entropix-smollm
smolLM with Entropix sampler on pytorch
☆150Updated 9 months ago
arcee-ai / EvolKit
EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…
☆229Updated 9 months ago
sanyalsunny111 / LLM-Inheritune
This is the official repository for Inheritune.
☆112Updated 5 months ago
QuixiAI / grokadamw
☆134Updated 11 months ago
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆55Updated 5 months ago
da03 / Internalize_CoT_Step_by_Step
☆187Updated 3 months ago
felipemaiapolo / tinyBenchmarks
Evaluating LLMs with fewer examples
☆160Updated last year
microsoft / ArchScale
Simple & Scalable Pretraining for Neural Architecture Research
☆277Updated last week
eqimp / hogwild_llm
Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache
☆113Updated 2 weeks ago