This is InfiniRetri, a tool enhance Transformer-based LLMs(Large Language Model) ablity to hangle Long-Context.
☆120Mar 27, 2025Updated last year
Alternatives and similar repositories for InfiniRetri2
Users that are interested in InfiniRetri2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆52Feb 17, 2025Updated last year
- ROSA+: RWKV's ROSA implementation with fallback statistical predictor☆34Oct 13, 2025Updated 6 months ago
- Ongoing research project for code&math LLMs☆31Jul 4, 2025Updated 10 months ago
- A comprehensive and efficient long-context model evaluation framework☆31Feb 25, 2026Updated 2 months ago
- Code for paper: Long cOntext aliGnment via efficient preference Optimization☆24Oct 10, 2025Updated 6 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- GoldFinch and other hybrid transformer components☆46Jul 20, 2024Updated last year
- Official repository of paper "Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models"☆25May 27, 2025Updated 11 months ago
- Code repo for MathAgent☆20Dec 15, 2023Updated 2 years ago
- A collection of tricks and tools to speed up transformer models☆199Apr 28, 2026Updated last week
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆35Aug 14, 2024Updated last year
- Research work aimed at addressing the problem of modeling infinite-length context☆48Dec 18, 2025Updated 4 months ago
- ☆59Jul 9, 2024Updated last year
- Graph neural network for predicting energy of known and hypothetical crystal structures☆10Jan 26, 2022Updated 4 years ago
- An official implementation of "Catastrophic Failure of LLM Unlearning via Quantization" (ICLR 2025)☆38Feb 22, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- DatasetResearch: Benchmarking Agent Systems for Demand-Driven Dataset Discovery☆20Sep 24, 2025Updated 7 months ago
- ☆12Mar 8, 2024Updated 2 years ago
- Automatic prompt optimization framework for multi-step agent tasks.☆37Nov 12, 2024Updated last year
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated last year
- Perf monitoring CLI tool for Apple Silicon☆16Jan 1, 2024Updated 2 years ago
- Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments (Zhou et al., EMNLP 2024)☆14Oct 3, 2024Updated last year
- ☆24Jul 26, 2025Updated 9 months ago
- Softened ROSA QKV Operators for Training Next-Generation LLM Models☆36Apr 7, 2026Updated 3 weeks ago
- ☆42Oct 16, 2025Updated 6 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- From Llama to Deepseek, grpo/mtp implemented. With pt/sft/lora/qlora included☆30Apr 21, 2025Updated last year
- Hill Space is All You Need☆17Jul 11, 2025Updated 9 months ago
- ☆84Nov 10, 2025Updated 5 months ago
- xKV: Cross-Layer SVD for KV-Cache Compression☆49Nov 30, 2025Updated 5 months ago
- Reasoning-based Evaluation and Ranking of Translations.☆20Jul 18, 2025Updated 9 months ago
- The official repo for "LLoCo: Learning Long Contexts Offline"☆117Jun 15, 2024Updated last year
- [ACL 2025] ⚖️ Temporally-aware MLLM for Biomedical Radiology Analysis and Report Generation. Flexible toolkit with MLLM backbone support,…☆29Mar 18, 2026Updated last month
- ☆23Nov 27, 2025Updated 5 months ago
- ☆12Mar 25, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This repository offers a comprehensive overview of existing datasets and methods in the field of change captioning.☆17Sep 2, 2025Updated 8 months ago
- This is the repository for the NeurIPS-21 paper [Contrastive Graph Poisson Networks: Semi-Supervised Learning with Extremely Limited Labe…☆12Feb 28, 2023Updated 3 years ago
- This is the repository for the PR paper [Multi-Level Graph Learning Network for Hyperspectral Image Classification].☆11Feb 28, 2023Updated 3 years ago
- ☆41Apr 30, 2025Updated last year
- This is a fork of SGLang for hip-attention integration. Please refer to hip-attention for detail.☆18Mar 31, 2026Updated last month
- Lottery Ticket Adaptation☆40Nov 20, 2024Updated last year
- [NeurIPS 2025] Atom of Thoughts for Markov LLM Test-Time Scaling☆662Apr 1, 2026Updated last month