Zoeyyao27 / SirLLMView external linksLinks
This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM
☆60May 28, 2024Updated last year
Alternatives and similar repositories for SirLLM
Users that are interested in SirLLM are comparing it to the libraries listed below
Sorting:
- ☆38Oct 10, 2024Updated last year
- LoRA-Ensemble: Efficient Uncertainty Modelling for Self-attention Networks☆54Sep 28, 2025Updated 4 months ago
- MegaRAG: Multimodal Graph-based RAG☆33Sep 16, 2025Updated 4 months ago
- The official repo for "LLoCo: Learning Long Contexts Offline"☆118Jun 15, 2024Updated last year
- Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments☆48Jan 8, 2026Updated last month
- ☆96Dec 6, 2024Updated last year
- Code for paper titled "Towards the Law of Capacity Gap in Distilling Language Models"☆102Jul 9, 2024Updated last year
- ☆28May 24, 2025Updated 8 months ago
- ☆46Jun 11, 2025Updated 8 months ago
- 🌟Official code of our AAAI26 paper 🔍WebFilter☆35Nov 9, 2025Updated 3 months ago
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Feb 9, 2025Updated last year
- A MoE impl for PyTorch, [ATC'23] SmartMoE☆71Jul 11, 2023Updated 2 years ago
- [NeurIPS 2024] | An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding☆21Oct 10, 2024Updated last year
- ☆20Nov 3, 2024Updated last year
- MM-Instruct: Generated Visual Instructions for Large Multimodal Model Alignment☆35Jul 1, 2024Updated last year
- Evaluating the Factuality of Large Language Models using Large-Scale Knowledge Graphs☆34Sep 3, 2024Updated last year
- The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Mem…☆397Apr 20, 2024Updated last year
- ☆18Sep 5, 2024Updated last year
- Official Code for paper "Towards Efficient and Effective Unlearning of Large Language Models for Recommendation" (Frontiers of Computer S…☆38Jul 19, 2024Updated last year
- FocusLLM: Scaling LLM’s Context by Parallel Decoding☆44Dec 8, 2024Updated last year
- Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)☆27Oct 3, 2025Updated 4 months ago
- An Experiment on Dynamic NTK Scaling RoPE☆64Nov 26, 2023Updated 2 years ago
- Pytorch implementation for "Compressed Context Memory For Online Language Model Interaction" (ICLR'24)☆62Apr 18, 2024Updated last year
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.☆28Dec 30, 2025Updated last month
- Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models☆89Apr 4, 2024Updated last year
- ☆22Dec 11, 2025Updated 2 months ago
- [ACL 2025] Squeezed Attention: Accelerating Long Prompt LLM Inference☆56Nov 20, 2024Updated last year
- Official repo of paper LM2☆47Feb 13, 2025Updated last year
- Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity (ACL 2025, oral)☆28Jun 14, 2025Updated 8 months ago
- [ICML 2024 Oral] This project is the official implementation of our Accurate LoRA-Finetuning Quantization of LLMs via Information Retenti…☆67Apr 15, 2024Updated last year
- ☆21Apr 17, 2025Updated 9 months ago
- [EMNLP 2024] Official PyTorch implementation code for realizing the technical part of Traversal of Layers (TroL) presenting new propagati…☆99Jun 23, 2024Updated last year
- ☆14Jan 24, 2025Updated last year
- ☆13May 21, 2023Updated 2 years ago
- A game engine made in Java using libgdx (Currently in alpha state, and probably will remain that way)☆16Jan 4, 2012Updated 14 years ago
- 🚀 Sliding Window Attention Training for Efficient Large Language Models☆15Dec 8, 2025Updated 2 months ago
- [EMNLP 2023]Context Compression for Auto-regressive Transformers with Sentinel Tokens☆25Nov 6, 2023Updated 2 years ago
- ☆302Jul 10, 2025Updated 7 months ago
- This repo contains the source code for: Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs☆44Aug 14, 2024Updated last year