Official implementation of the paper "You Do Not Fully Utilize Transformer's Representation Capacity"
☆32May 28, 2025Updated 11 months ago
Alternatives and similar repositories for lime
Users that are interested in lime are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of "Steering LLM Reasoning Through Bias-Only Adaptation" and "Small Vectors, Big Effects: A Mechanistic Study of …☆48Oct 7, 2025Updated 6 months ago
- Примеры пропозалов для подачи заявки в Open.TLab☆27Dec 15, 2022Updated 3 years ago
- Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Worksho…☆21Feb 27, 2023Updated 3 years ago
- Code for the paper "PALBERT: Teaching ALBERT to Ponder", NeurIPS 2022 Spotlight☆37Apr 8, 2023Updated 3 years ago
- ☆13Aug 7, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆16Jul 16, 2024Updated last year
- Mixture-of-Basis-Experts for Compressing MoE-based LLMs☆33Dec 24, 2025Updated 4 months ago
- Code for the paper "Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters", ICML 2022☆28Jul 10, 2022Updated 3 years ago
- Official Implementation of `An Optimisation Framework for Unsupervised Environment Design` from RLC 2025☆18Nov 24, 2025Updated 5 months ago
- ☆15Sep 4, 2025Updated 8 months ago
- Official codebase for "Sampling For Learnability", published at NeurIPS 2024☆21Oct 21, 2025Updated 6 months ago
- ☆16Sep 22, 2024Updated last year
- High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC…☆641Feb 10, 2024Updated 2 years ago
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Oct 12, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆15Mar 2, 2025Updated last year
- Scalable Opponent Shaping Experiments in JAX☆25Apr 13, 2024Updated 2 years ago
- ☆20Mar 19, 2025Updated last year
- Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)☆98Dec 5, 2024Updated last year
- Mamba support for transformer lens☆19Sep 17, 2024Updated last year
- ☆15Mar 20, 2025Updated last year
- This is a simple torch implementation of the high performance Multi-Query Attention☆16Aug 23, 2023Updated 2 years ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆23Oct 28, 2024Updated last year
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆34Mar 7, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Source code for SWIFT, an efficient reward model.☆21Jan 13, 2026Updated 3 months ago
- ☆23Mar 7, 2025Updated last year
- dynamic planning, hybrid models, hierarchical active inference, tool use☆15Jun 13, 2025Updated 10 months ago
- ☆54May 20, 2024Updated last year
- [ECCV'24 Oral] Momentum Auxiliary Network for Supervised Local Learning☆14Aug 15, 2024Updated last year
- CREATE Environment for long-horizon physics-puzzle tasks with diverse tools☆18Nov 22, 2022Updated 3 years ago
- Repository with which to explore k-diffusion and diffusers, and within which changes to said packages may be tested.☆55Jan 28, 2024Updated 2 years ago
- Multi-agent simulator in Jax for research and teaching in AI & ALife☆31Apr 11, 2026Updated 3 weeks ago
- [ICML24] Official Implementation of "ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections"☆16May 31, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆36Nov 22, 2024Updated last year
- [NeurIPS 2023] Formulating Discrete Probability Flow Through Optimal Transport☆21Jan 8, 2024Updated 2 years ago
- WeGeFT: Weight‑Generative Fine‑Tuning for Multi‑Faceted Efficient Adaptation of Large Models☆23Jul 10, 2025Updated 9 months ago
- Retrieval-Augmented Decision Transformer: External Memory for In-context RL☆25Oct 27, 2024Updated last year
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆16Jun 16, 2024Updated last year
- 魔镜魔镜,无所不知的魔镜[-_-](并不是)☆13Jun 10, 2021Updated 4 years ago
- ☆31Sep 23, 2024Updated last year