This is InfiniRetri, a tool enhance Transformer-based LLMs(Large Language Model) ablity to hangle Long-Context.
☆122Mar 27, 2025Updated last year
Alternatives and similar repositories for InfiniRetri2
Users that are interested in InfiniRetri2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆52Feb 17, 2025Updated last year
- [EMNLP2022] Source code for Neural Machine Translation with Contrastive Translation Memories☆12Feb 15, 2023Updated 3 years ago
- ROSA+: RWKV's ROSA implementation with fallback statistical predictor☆36Oct 13, 2025Updated 8 months ago
- ☆14Oct 24, 2024Updated last year
- Ongoing research project for code&math LLMs☆31Jul 4, 2025Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Code for paper: Long cOntext aliGnment via efficient preference Optimization☆26Oct 10, 2025Updated 8 months ago
- [NeurIPS 2025] Official Pytorch Implementation of "The Curse of Depth in Large Language Models" by Wenfang Sun, Xinyuan Song, Pengxiang L…☆72Mar 3, 2026Updated 4 months ago
- Official repository of paper "Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models"☆26May 27, 2025Updated last year
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆173Oct 20, 2025Updated 8 months ago
- Code repo for MathAgent☆20Dec 15, 2023Updated 2 years ago
- Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.☆19May 7, 2022Updated 4 years ago
- 本项目由三个模块构成。意图识别:判断用户的意图是业务型还是闲聊型;模型检索:该部分构建一个语料库,当用户 发起新的query(通过意图识别判断为业务型对话)时,为用户匹配query检索的最佳response,使用HSWN进行召回(粗排), 然后构建句子的相似度,并利用Lig…☆12Feb 18, 2021Updated 5 years ago
- ☆61Jul 9, 2024Updated last year
- An official implementation of "Catastrophic Failure of LLM Unlearning via Quantization" (ICLR 2025)☆39Feb 22, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Set of scripts to finetune LLMs☆38Mar 30, 2024Updated 2 years ago
- DatasetResearch: Benchmarking Agent Systems for Demand-Driven Dataset Discovery☆21Sep 24, 2025Updated 9 months ago
- ☆12Mar 8, 2024Updated 2 years ago
- Automatic prompt optimization framework for multi-step agent tasks.☆37Nov 12, 2024Updated last year
- Codebase for Instruction Following without Instruction Tuning☆36Sep 24, 2024Updated last year
- Haskell Protocol Buffers☆12Jan 22, 2020Updated 6 years ago
- ☆16Nov 5, 2024Updated last year
- ☆11Oct 8, 2022Updated 3 years ago
- ☆26Jul 26, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆47Oct 16, 2025Updated 8 months ago
- WebResearcher: An Iterative Deep-Research Agent,迭代式深度研究智能体☆49Feb 13, 2026Updated 4 months ago
- ☆130Mar 31, 2024Updated 2 years ago
- ☆84Jun 2, 2026Updated last month
- xKV: Cross-Layer SVD for KV-Cache Compression☆52Updated this week
- LlaMA3-SFT, Meta-Llama-3-8B/Meta-Llama-3-8B-Instruct微调(transformers)/LORA(peft)/推理, 支持中文(chinese, zh)☆34May 17, 2024Updated 2 years ago
- Reasoning-based Evaluation and Ranking of Translations.☆19Jun 2, 2026Updated last month
- The official repo for "LLoCo: Learning Long Contexts Offline"☆118Jun 15, 2024Updated 2 years ago
- [ACL 2025] ⚖️ Temporally-aware MLLM for Biomedical Radiology Analysis and Report Generation. Flexible toolkit with MLLM backbone support,…☆30Mar 18, 2026Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆18Dec 8, 2024Updated last year
- manipulating cointegrated pairs to achieve a market-neutral strategy that outperforms indices☆11Jan 12, 2021Updated 5 years ago
- ☆25Nov 27, 2025Updated 7 months ago
- This repository has been moved to https://gitlab.com/twittner/cql-io☆13Feb 20, 2016Updated 10 years ago
- A unified approach to explain conditional text generation models. Pytorch. The code of paper "Local Explanation of Dialogue Response Gene…☆16Mar 21, 2022Updated 4 years ago
- [NeurIPS 2025] Atom of Thoughts for Markov LLM Test-Time Scaling☆663Apr 1, 2026Updated 3 months ago
- The core code of "Assisted learning for land use classification: The important role of semantic correlation between heterogeneous images"☆13Dec 16, 2023Updated 2 years ago