Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory
☆261May 24, 2025Updated 9 months ago
Alternatives and similar repositories for dynamic-cheatsheet
Users that are interested in dynamic-cheatsheet are comparing it to the libraries listed below
Sorting:
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆23Nov 13, 2025Updated 3 months ago
- Z.AI API Playground - Complete examples for GLM-4.7, Vision, Image/Video Generation, Audio, and more. Powered by Z.AI-GLM-4.7-Coding Plan☆49Feb 12, 2026Updated 3 weeks ago
- AI-Driven Research Systems (ADRS)☆128Dec 17, 2025Updated 2 months ago
- Github repository for "Internalizing World Models via Self-Play Finetuning for Agentic RL"☆33Nov 1, 2025Updated 4 months ago
- ☆23Sep 19, 2024Updated last year
- [ACL'24] Chain of Thought (CoT) is significant in improving the reasoning abilities of large language models (LLMs). However, the correla…☆47May 11, 2025Updated 9 months ago
- Agent-OM: Leveraging LLM Agents for Ontology Matching☆19Jan 24, 2026Updated last month
- Official Repostory of "Mnemis: Dual-Route Retrieval on Hierarchical Graphs for Long-Term LLM Memory"☆46Feb 18, 2026Updated 2 weeks ago
- Code for paper "W-RAG: Weakly Supervised Dense Retrieval in RAG for Open-domain Question Answering"☆15Oct 2, 2025Updated 5 months ago
- Official Implementation of UTrice: Unifying Primitives in Differentiable Ray Tracing and Rasterization via Triangles for Particle-Based 3…☆30Jan 13, 2026Updated last month
- When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought☆27Feb 14, 2026Updated 3 weeks ago
- Multi-Figurative Language Generation (COLING 2022)☆12Jan 30, 2023Updated 3 years ago
- (ICLR 2026) Unveiling Super Experts in Mixture-of-Experts Large Language Models☆37Sep 25, 2025Updated 5 months ago
- Stanford NLP Python library for benchmarking the utility of LLM interpretability methods☆171Updated this week
- This repository provides the official implementation of QSVD, a method for efficient low-rank approximation that unifies Query-Key-Value …☆25Dec 1, 2025Updated 3 months ago
- This Repository demostrates various examples using YOLO☆13Feb 9, 2024Updated 2 years ago
- Belief in the Machine: Investigating Epistemological Blind Spots of Language Models☆32Apr 19, 2025Updated 10 months ago
- [ICLR 2026] The official code for "Doxing via the Lens: Revealing Location-related Privacy Leakage on Multi-modal Large Reasoning Models"☆23Feb 7, 2026Updated last month
- This repo contains code for the paper "Psychologically-informed chain-of-thought prompts for metaphor understanding in large language mod…☆14Apr 28, 2023Updated 2 years ago
- ☆16Apr 29, 2025Updated 10 months ago
- A Modular Framework for Learning on Multimodal Biomedical Knowledge Graphs☆18Jan 12, 2024Updated 2 years ago
- Storing long contexts in tiny caches with self-study☆244Dec 5, 2025Updated 3 months ago
- the open-source code of QAgent☆53Oct 14, 2025Updated 4 months ago
- The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"☆37Oct 1, 2025Updated 5 months ago
- Measuring Thinking Efficiency in Reasoning Models - Research Repository☆39Dec 2, 2025Updated 3 months ago
- Autonomous AI backend for deep research AI applications.☆30Feb 27, 2026Updated last week
- https://footprints.baulab.info☆18Oct 4, 2024Updated last year
- MoCo: A One-Stop Shop for Model Collaboration Research☆48Feb 24, 2026Updated 2 weeks ago
- ☆23Jan 17, 2025Updated last year
- 👩💻 Code for the ACL paper "Detecting Edit Failures in LLMs: An Improved Specificity Benchmark"☆20Jan 19, 2024Updated 2 years ago
- Clawdbot skills for agentic coding workflows - ACFS stack, cloud CLIs, and dev tools☆53Mar 3, 2026Updated last week
- ☆23Jan 27, 2025Updated last year
- Mine-tuning is a methodology for synchronizing human and AI attention.☆19Jun 16, 2024Updated last year
- Repo for paper: Controllable Text Generation with Language Constraints☆20Jun 20, 2023Updated 2 years ago
- Self-hosted AI assistant with multi-channel support, scheduled tasks, and extensible skills☆68Updated this week
- ☆82May 28, 2025Updated 9 months ago
- Measuring if attention is explanation with ROAR☆22Mar 3, 2023Updated 3 years ago
- THEORY OF SPACE: a benchmark for evaluating whether foundation models can actively explore under partial observability efficiently to bui…☆40Feb 27, 2026Updated last week
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆53Oct 19, 2024Updated last year