wangyu-ustc / MemoryLLMLinks
The official implementation of the ICML 2024 paper "MemoryLLM: Towards Self-Updatable Large Language Models" and "M+: Extending MemoryLLM with Scalable Long-Term Memory"
☆182Updated last week
Alternatives and similar repositories for MemoryLLM
Users that are interested in MemoryLLM are comparing it to the libraries listed below
Sorting:
- The official implementation of Self-Play Preference Optimization (SPPO)☆569Updated 5 months ago
- Recipes to train the self-rewarding reasoning LLMs.☆225Updated 4 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆145Updated 9 months ago
- [ICML 2025] |TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation☆110Updated last month
- Repo for "Z1: Efficient Test-time Scaling with Code"☆63Updated 3 months ago
- ☆82Updated 6 months ago
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆228Updated 2 months ago
- ☆373Updated this week
- Revisiting Mid-training in the Era of Reinforcement Learning Scaling☆144Updated last week
- Official code of paper "Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models"☆79Updated last month
- ☆71Updated 4 months ago
- RL Scaling and Test-Time Scaling (ICML'25)☆109Updated 5 months ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆103Updated 2 months ago
- Benchmarking LLMs with Challenging Tasks from Real Users☆229Updated 8 months ago
- ☆319Updated 10 months ago
- [NeurIPS 2024] BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models☆264Updated 4 months ago
- ☆180Updated last month
- This is the official repository for Inheritune.☆112Updated 5 months ago
- ☆90Updated 2 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆139Updated 8 months ago
- HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models☆44Updated 7 months ago
- [ACL'25 Oral] What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆70Updated 3 weeks ago
- DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents☆209Updated this week
- Test-time preferenece optimization (ICML 2025).☆147Updated 2 months ago
- ☆136Updated last month
- [ACL 2025] An inference-time decoding strategy with adaptive foresight sampling☆99Updated 2 months ago
- The official repo for "LLoCo: Learning Long Contexts Offline"☆117Updated last year
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆253Updated last week
- Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting☆32Updated last year
- Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)☆205Updated last year