casper-hansen / OpenCoconut
OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.
☆136Updated this week
Alternatives and similar repositories for OpenCoconut:
Users that are interested in OpenCoconut are comparing it to the libraries listed below
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆118Updated 2 months ago
- ☆83Updated 2 months ago
- smolLM with Entropix sampler on pytorch☆147Updated 2 months ago
- ☆94Updated 2 weeks ago
- ☆121Updated 4 months ago
- look how they massacred my boy☆63Updated 2 months ago
- An automated tool for discovering insights from research papaer corpora☆135Updated 7 months ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆150Updated 2 months ago
- ☆46Updated 2 months ago
- Just a bunch of benchmark logs for different LLMs☆116Updated 5 months ago
- ☆133Updated 3 months ago
- ☆62Updated 3 months ago
- code for training & evaluating Contextual Document Embedding models☆141Updated 3 weeks ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆52Updated 4 months ago
- Code for TrackTheMind☆63Updated 3 weeks ago
- A simple unified framework for evaluating LLMs☆161Updated 2 weeks ago
- Collection of autoregressive model implementation☆76Updated this week
- ☆104Updated 3 months ago
- Functional Benchmarks and the Reasoning Gap☆81Updated 3 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆116Updated 5 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆98Updated this week
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- ☆113Updated 3 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆212Updated 7 months ago
- This is the official repository for Inheritune.☆107Updated 3 months ago
- ☆115Updated 3 weeks ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆89Updated last month
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆162Updated this week
- ☆68Updated 4 months ago