Code for ICML 2024 paper
☆35Sep 18, 2025Updated 5 months ago
Alternatives and similar repositories for larimar
Users that are interested in larimar are comparing it to the libraries listed below
Sorting:
- Building language models to predict more than one token ahead to enable further ahead predictions.☆12May 22, 2025Updated 9 months ago
- [ACL 2025] Squeezed Attention: Accelerating Long Prompt LLM Inference☆57Nov 20, 2024Updated last year
- [ICLR 2025] TidalDecode: A Fast and Accurate LLM Decoding with Position Persistent Sparse Attention☆52Aug 6, 2025Updated 6 months ago
- Official repository of "Pareto Manifold Learning: Tackling multiple tasks via ensembles of single-task models" [ICML 2023]☆23Jan 10, 2025Updated last year
- The code for "AttentionPredictor: Temporal Pattern Matters for Efficient LLM Inference", Qingyue Yang, Jie Wang, Xing Li, Zhihai Wang, Ch…☆28Jul 15, 2025Updated 7 months ago
- AdaSplash: Adaptive Sparse Flash Attention (aka Flash Entmax Attention)☆33Sep 30, 2025Updated 5 months ago
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Apr 17, 2024Updated last year
- Official Code Repository for the paper "Key-value memory in the brain"☆31Feb 25, 2025Updated last year
- Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language …☆38Jan 13, 2025Updated last year
- Faikin Remote (code and PCB)☆18Feb 22, 2026Updated last week
- Self-Questioning Language Models☆57Jan 5, 2026Updated last month
- Official code for the paper "Attention as a Hypernetwork"☆51Updated this week
- ☆41Nov 30, 2023Updated 2 years ago
- ☆12Jul 4, 2024Updated last year
- ☆15Mar 15, 2022Updated 3 years ago
- Implementation of LaViC (KDD 2025)☆13Jun 1, 2025Updated 9 months ago
- asyncio-friendly python API for Sensibo (https://sensibo.com). Requires Python 3.4+☆11Feb 11, 2026Updated 2 weeks ago
- ☆15Jun 28, 2023Updated 2 years ago
- Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"☆25Jul 21, 2025Updated 7 months ago
- Official repo of paper LM2☆47Feb 13, 2025Updated last year
- Based on the tree of thoughts paper☆48Sep 7, 2023Updated 2 years ago
- ArkVale: Efficient Generative LLM Inference with Recallable Key-Value Eviction (NIPS'24)☆53Dec 17, 2024Updated last year
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Jun 28, 2024Updated last year
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆14Apr 30, 2025Updated 10 months ago
- This is the accompanying repository to the paper - Automatic Estimation of Singing Voice Musical Dynamics☆15Oct 28, 2024Updated last year
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆11Mar 18, 2023Updated 2 years ago
- [COLM 2025: 1st Workshop on the Application of LLM Explainability to Reasoning and Planning] Latent Chain-of-Thought? Decoding the Depth-…☆17Oct 4, 2025Updated 4 months ago
- ☆13Sep 8, 2024Updated last year
- The official implementation of the paper **LVChat: Facilitating Long Video Comprehension**☆14Apr 15, 2024Updated last year
- A small cli tool that downloads sheet music from MuseScore without the hassle☆15Oct 27, 2022Updated 3 years ago
- Easily build and deploy your AI Discord Bot with Mendable☆14Jul 14, 2024Updated last year
- ☆17Feb 3, 2026Updated last month
- A library for handling Structural Causal Models and performing interventional and counterfactual inference on them.☆13Jul 3, 2020Updated 5 years ago
- Official implementation of Hierarchical Context Merging: Better Long Context Understanding for Pre-trained LLMs (ICLR 2024).☆44Aug 6, 2024Updated last year
- Deploy on Kubernetes with Helm/Helmfile and ArgoCD☆13Jan 12, 2026Updated last month
- ☆12Jul 7, 2024Updated last year
- [ICLR 2025] FLAT: LLM Unlearning via Loss Adjustment with Only Forget Data☆14Feb 26, 2025Updated last year
- The official implementation of the paper "Large Scale Knowledge Washing"☆10Jun 12, 2024Updated last year
- A CircleCI Orb to simplify deployments to Kubernetes using Helm.☆12Jun 16, 2025Updated 8 months ago