Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Models" [ICLR 2025]
☆63Oct 3, 2025Updated 4 months ago
Alternatives and similar repositories for episodic-memory-benchmark
Users that are interested in episodic-memory-benchmark are comparing it to the libraries listed below
Sorting:
- ☆16Feb 22, 2025Updated last year
- Implementation of Hyena Hierarchy in JAX☆10Apr 30, 2023Updated 2 years ago
- ☆11Oct 11, 2023Updated 2 years ago
- ☆11Jul 21, 2024Updated last year
- Triton Implementation of HyperAttention Algorithm☆48Dec 11, 2023Updated 2 years ago
- ☆13Jan 20, 2023Updated 3 years ago
- ☆13Dec 15, 2025Updated 2 months ago
- FlexAttention w/ FlashAttention3 Support☆27Oct 5, 2024Updated last year
- [EMNLP 2023] Official implementation of the algorithm ETSC: Exact Toeplitz-to-SSM Conversion our EMNLP 2023 paper - Accelerating Toeplitz…☆14Oct 17, 2023Updated 2 years ago
- KernelBench v2: Can LLMs Write GPU Kernels? - Benchmark with Torch -> Triton (and more!) problems☆21Jul 4, 2025Updated 7 months ago
- Code for the paper "Multi-Field Adaptive Retrieval," a research project on a semi-structured document retrieval☆16Feb 13, 2026Updated 2 weeks ago
- https://hf.co/hexgrad/Kokoro-82M☆14Jan 14, 2026Updated last month
- ☆17Dec 19, 2024Updated last year
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆19Mar 10, 2025Updated 11 months ago
- Source-to-Source Debuggable Derivatives in Pure Python☆15Jan 23, 2024Updated 2 years ago
- Code and data for paper "(How) do Language Models Track State?"☆20Mar 31, 2025Updated 11 months ago
- Official code for the paper "Attention as a Hypernetwork"☆48Jun 22, 2024Updated last year
- ☆23Jan 31, 2025Updated last year
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆27Mar 1, 2025Updated last year
- Collection of LLM completions for reasoning-gym task datasets☆30Jul 4, 2025Updated 7 months ago
- Official repo of dataset-decomposition paper [NeurIPS 2024]☆21Jan 8, 2025Updated last year
- Fork of Flame repo for training of some new stuff in development☆19Feb 20, 2026Updated last week
- A repository for research on medium sized language models.☆78May 23, 2024Updated last year
- ☆20May 30, 2024Updated last year
- Scaling Sparse Fine-Tuning to Large Language Models☆18Jan 31, 2024Updated 2 years ago
- Generative Modeling via Drifting in MLX☆42Feb 6, 2026Updated 3 weeks ago
- Implemenation of the HIERarchical imagionation On Structured State Space Sequence Models (HIEROS) paper☆21Jul 14, 2024Updated last year
- ☆19Jul 24, 2025Updated 7 months ago
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆186May 25, 2025Updated 9 months ago
- ☆25Dec 13, 2024Updated last year
- AGaLiTe: Approximate Gated Linear Transformers for Online Reinforcement Learning (Published in TMLR)☆23Oct 15, 2024Updated last year
- ☆19Dec 4, 2025Updated 2 months ago
- [ICML2025] Official Repo for Paper "Optimizing Temperature for Language Models with Multi-Sample Inference"☆21Feb 16, 2025Updated last year
- ☆23Jan 27, 2025Updated last year
- Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆51Oct 31, 2024Updated last year
- PageRank for LLMs☆51Sep 10, 2025Updated 5 months ago
- ☆24Apr 3, 2025Updated 10 months ago
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆30Nov 12, 2024Updated last year
- Official Repository for Task-Circuit Quantization☆24Jun 1, 2025Updated 9 months ago