Zoeyyao27 / SirLLM
This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM
☆55Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for SirLLM
- Code and data for CoachLM, an automatic instruction revision approach LLM instruction tuning.☆58Updated 7 months ago
- Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory☆30Updated last week
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆37Updated 3 weeks ago
- ☆36Updated 3 weeks ago
- ☆44Updated last month
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆102Updated 6 months ago
- The first dense retrieval model that can be prompted like an LM☆62Updated last month
- ☆49Updated last week
- This is the official repository for Inheritune.☆105Updated last month
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆75Updated 3 weeks ago
- FuseAI Project☆76Updated 2 months ago
- ☆62Updated last month
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆114Updated 6 months ago
- HelloBench: evaluating long text generation capabilities of LLMs☆29Updated 3 weeks ago
- Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"☆109Updated 2 months ago
- ☆40Updated last month
- A repository for research on medium sized language models.☆74Updated 5 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆76Updated 7 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆129Updated last month
- A toolkit for fine-tuning, inferencing, and evaluating GreenBitAI's LLMs.☆72Updated 3 weeks ago
- Data preparation code for CrystalCoder 7B LLM☆42Updated 6 months ago
- ☆35Updated last year
- The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"☆50Updated 6 months ago
- ☆25Updated 2 months ago
- Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"☆31Updated 11 months ago
- Expert Specialized Fine-Tuning☆143Updated last month
- DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆34Updated 2 weeks ago
- The official repo for "LLoCo: Learning Long Contexts Offline"☆110Updated 4 months ago
- Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models☆80Updated 7 months ago
- ☆48Updated 3 weeks ago