cosmoquester / memoriaLinks
Memoria is a human-inspired memory architecture for neural networks.
☆84Updated last year
Alternatives and similar repositories for memoria
Users that are interested in memoria are comparing it to the libraries listed below
Sorting:
- A repository for research on medium sized language models.☆77Updated last year
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆40Updated last year
- ☆123Updated 11 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆153Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆112Updated last year
- ☆87Updated 2 years ago
- Simple GRPO scripts and configurations.☆59Updated last year
- Self-Taught Optimizer (STOP): Recursively Self-Improving Code Generation☆49Updated 2 years ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆175Updated last year
- ☆62Updated 2 years ago
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Updated last year
- OMNI: Open-endedness via Models of human Notions of Interestingness☆58Updated last year
- GoldFinch and other hybrid transformer components☆45Updated last year
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆38Updated 8 months ago
- EvaByte: Efficient Byte-level Language Models at Scale☆115Updated 9 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆61Updated last year
- ☆130Updated last year
- Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆51Updated last year
- ☆56Updated last year
- ☆105Updated last year
- ☆48Updated last year
- ☆41Updated last year
- [TMLR 2026] When Attention Collapses: How Degenerate Layers in LLMs Enable Smaller, Stronger Models☆122Updated last year
- Evaluating LLMs with CommonGen-Lite☆94Updated last year
- ☆47Updated 2 years ago
- Official repo for Learning to Reason for Long-Form Story Generation☆74Updated 9 months ago
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆66Updated 11 months ago
- Track the progress of LLM context utilisation☆55Updated 9 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated last year
- A repository for transformer critique learning and generation☆89Updated 2 years ago