Continual-Intelligence / SEALLinks
Self-Adapting Language Models
☆697Updated 3 weeks ago
Alternatives and similar repositories for SEAL
Users that are interested in SEAL are comparing it to the libraries listed below
Sorting:
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆295Updated 3 weeks ago
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,123Updated 5 months ago
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆798Updated last month
- Atom of Thoughts for Markov LLM Test-Time Scaling☆577Updated 3 weeks ago
- Pretraining code for a large-scale depth-recurrent language model☆793Updated last month
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆314Updated 8 months ago
- ☆262Updated 2 months ago
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆298Updated this week
- Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents☆1,501Updated last month
- Inference, Fine Tuning and many more recipes with Gemma family of models☆223Updated last week
- CodeScientist: An automated scientific discovery system for code-based experiments☆273Updated 2 weeks ago
- ☆162Updated 2 months ago
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆529Updated 2 weeks ago
- Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning☆225Updated 4 months ago
- Tina: Tiny Reasoning Models via LoRA☆266Updated last month
- Releases from OpenAI Preparedness☆792Updated last month
- Build your own visual reasoning model☆395Updated this week
- ☆156Updated 2 months ago
- Dream 7B, a large diffusion language model☆816Updated 3 weeks ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆341Updated 7 months ago
- procedural reasoning datasets☆938Updated this week
- ☆210Updated 4 months ago
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆112Updated this week
- Testing baseline LLMs performance across various models☆278Updated 3 weeks ago
- Code and data for the Chain-of-Draft (CoD) paper☆310Updated 4 months ago
- ☆206Updated 3 weeks ago
- ☆585Updated 2 months ago
- Code for the paper: "Learning to Reason without External Rewards"☆319Updated this week
- An agent benchmark with tasks in a simulated software company.☆488Updated this week
- Code for ExploreTom☆84Updated 2 weeks ago