Zoeyyao27 / SirLLMLinks
This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM
☆58Updated last year
Alternatives and similar repositories for SirLLM
Users that are interested in SirLLM are comparing it to the libraries listed below
Sorting:
- Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory☆44Updated 4 months ago
- Code and data for CoachLM, an automatic instruction revision approach LLM instruction tuning.☆61Updated last year
- ☆37Updated 8 months ago
- Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models☆36Updated 8 months ago
- The first dense retrieval model that can be prompted like an LM☆73Updated last month
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆51Updated 6 months ago
- This is the official repository for Inheritune.☆111Updated 4 months ago
- FuseAI Project☆87Updated 5 months ago
- The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"☆54Updated last month
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆115Updated last year
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆80Updated last year
- ☆24Updated 9 months ago
- The official implementation of Cross-Task Experience Sharing (COPS)☆22Updated 8 months ago
- Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"☆127Updated 10 months ago
- The official code repo and data hub of top_nsigma sampling strategy for LLMs.☆26Updated 4 months ago
- Code repo for "CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs".☆14Updated 9 months ago
- A repository for research on medium sized language models.☆76Updated last year
- Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning [ICLR 2025]☆45Updated 5 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆36Updated last year
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.☆89Updated 11 months ago
- FastCuRL: Curriculum Reinforcement Learning with Stage-wise Context Scaling for Efficient LLM Reasoning☆52Updated 3 weeks ago
- Verifiers for LLM Reinforcement Learning☆60Updated 2 months ago
- Official Code for paper "Towards Efficient and Effective Unlearning of Large Language Models for Recommendation" (Frontiers of Computer S…☆37Updated 11 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆112Updated last month
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆33Updated 3 months ago
- Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs☆48Updated 11 months ago
- Scaling Computer-Use Grounding via UI Decomposition and Synthesis☆79Updated last week
- Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"☆33Updated last year
- HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models☆45Updated 6 months ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆85Updated last month