This is InfiniRetri, a tool enhance Transformer-based LLMs(Large Language Model) ablity to hangle Long-Context.
☆120Mar 27, 2025Updated last year
Alternatives and similar repositories for InfiniRetri2
Users that are interested in InfiniRetri2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆52Feb 17, 2025Updated last year
- [EMNLP2022] Source code for Neural Machine Translation with Contrastive Translation Memories☆12Feb 15, 2023Updated 3 years ago
- ROSA+: RWKV's ROSA implementation with fallback statistical predictor☆35Oct 13, 2025Updated 7 months ago
- ☆14Oct 24, 2024Updated last year
- MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)☆12Apr 18, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A comprehensive and efficient long-context model evaluation framework☆31Feb 25, 2026Updated 3 months ago
- ☆18Aug 18, 2022Updated 3 years ago
- [ACL 2025] An official pytorch implement of the paper: Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement☆40May 28, 2025Updated 11 months ago
- [NeurIPS 2025] Official Pytorch Implementation of "The Curse of Depth in Large Language Models" by Wenfang Sun, Xinyuan Song, Pengxiang L…☆71Mar 3, 2026Updated 2 months ago
- GoldFinch and other hybrid transformer components☆46Jul 20, 2024Updated last year
- Official repository of paper "Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models"☆26May 27, 2025Updated 11 months ago
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆170Oct 20, 2025Updated 7 months ago
- Code repo for MathAgent☆20Dec 15, 2023Updated 2 years ago
- A collection of tricks and tools to speed up transformer models☆202May 6, 2026Updated 2 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆37Aug 14, 2024Updated last year
- Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.☆19May 7, 2022Updated 4 years ago
- Research work aimed at addressing the problem of modeling infinite-length context☆48Dec 18, 2025Updated 5 months ago
- ☆60Jul 9, 2024Updated last year
- An official implementation of "Catastrophic Failure of LLM Unlearning via Quantization" (ICLR 2025)☆38Feb 22, 2025Updated last year
- ☆12Mar 8, 2024Updated 2 years ago
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated 2 years ago
- Codebase for Instruction Following without Instruction Tuning☆36Sep 24, 2024Updated last year
- Perf monitoring CLI tool for Apple Silicon☆16Jan 1, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments (Zhou et al., EMNLP 2024)☆14Oct 3, 2024Updated last year
- ☆24Jul 26, 2025Updated 9 months ago
- Softened ROSA QKV Operators for Training Next-Generation LLM Models☆38Apr 7, 2026Updated last month
- WebResearcher: An Iterative Deep-Research Agent,迭代式深度研究智能体☆48Feb 13, 2026Updated 3 months ago
- ☆129Mar 31, 2024Updated 2 years ago
- In this paper, the image is spliced based on super pixel.☆12Mar 15, 2019Updated 7 years ago
- ☆84Nov 10, 2025Updated 6 months ago
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated last year
- xKV: Cross-Layer SVD for KV-Cache Compression☆48Nov 30, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Reasoning-based Evaluation and Ranking of Translations.☆20Jul 18, 2025Updated 10 months ago
- The official repo for "LLoCo: Learning Long Contexts Offline"☆117Jun 15, 2024Updated last year
- manipulating cointegrated pairs to achieve a market-neutral strategy that outperforms indices☆10Jan 12, 2021Updated 5 years ago
- Multifocus Image Fusion using DTCWT, Curvelet Transform and Nonsubsampled Contourlet Transform (NSCT)☆10Mar 7, 2023Updated 3 years ago
- This is the repository for the NeurIPS-21 paper [Contrastive Graph Poisson Networks: Semi-Supervised Learning with Extremely Limited Labe…☆12Feb 28, 2023Updated 3 years ago
- This is a fork of SGLang for hip-attention integration. Please refer to hip-attention for detail.☆18Mar 31, 2026Updated last month
- Lottery Ticket Adaptation☆40Nov 20, 2024Updated last year