Pytorch implementation of https://arxiv.org/html/2404.07143v1
☆21Apr 13, 2024Updated 2 years ago
Alternatives and similar repositories for infini-attention
Users that are interested in infini-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Efficient Infinite Context Transformers with Infini-attention Pytorch Implementation + QwenMoE Implementation + Training Script + 1M cont…☆91May 9, 2024Updated last year
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆58Mar 30, 2026Updated 2 weeks ago
- To mitigate position bias in LLMs, especially in long-context scenarios, we scale only one dimension of LLMs, reducing position bias and …☆11Jun 18, 2024Updated last year
- The repository of VG-Refiner paper☆17Dec 9, 2025Updated 4 months ago
- [npj Digital Medicine] An In-Depth Evaluation of Federated Learning on Biomedical Natural Language Processing for Information Extraction☆12May 1, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A simple and minimal open source implementation of "Introducing LFM2: The Fastest On-Device Foundation Models on the Market" from Liquid …☆23Mar 30, 2026Updated 2 weeks ago
- [TMI'22] Personalized Retrogress-Resilient Federated Learning Towards Imbalanced Medical Data☆15Jul 20, 2022Updated 3 years ago
- Implementation of NIPS2023: Unleashing the Full Potential of Product Quantization for Large-Scale Image Retrieva☆11Nov 12, 2024Updated last year
- ☆16Dec 4, 2025Updated 4 months ago
- Official implementation for "Mixture of In-Context Experts Enhance LLMs’ Awareness of Long Contexts" (Accepted by Neurips2024)☆14Jan 7, 2025Updated last year
- 16-811 Project.☆10Jan 12, 2018Updated 8 years ago
- ☆17Feb 26, 2024Updated 2 years ago
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆37Oct 3, 2025Updated 6 months ago
- CMU Linguistic Annotation Backend☆15Sep 22, 2025Updated 6 months ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Quora Paraphrasing Dataset Bahasa Indonesia Version☆11Apr 18, 2021Updated 4 years ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- ☆19Oct 14, 2024Updated last year
- Literature reviews of (Unsupervised/self-supervised) pretraining on medical datasets☆18Jan 16, 2024Updated 2 years ago
- One-shot Global Localization through Semantic Distribution Feature Retrieval and Semantic Topological Histogram Registration☆19Feb 14, 2025Updated last year
- Your virtual companian/waifu powered by chatgpt and other state-of-the-art AI models☆11Sep 11, 2023Updated 2 years ago
- 汉字构造表☆18Jul 16, 2025Updated 9 months ago
- (NeurIPS 2023) Open-set visual object query search & localization in long-form videos☆26Feb 1, 2024Updated 2 years ago
- cc98爬虫☆15Sep 1, 2013Updated 12 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Simulating Realistic Human Scanpaths in Dynamic Real-World Scenes☆15Mar 3, 2026Updated last month
- ☆35Mar 6, 2026Updated last month
- A list of papers about point cloud based place recognition, also known as loop closure detection in SLAM (processing)☆10Jan 30, 2024Updated 2 years ago
- KARL: Knowledge-Aware Reasoning and Reinforcement Learning for Knowledge-Intensive Visual Grounding☆66Apr 5, 2026Updated last week
- ☆34Nov 13, 2025Updated 5 months ago
- Prognostication of chronic disorders of consciousness using resting state fMRI and clinical characteristics☆11Feb 12, 2026Updated 2 months ago
- [ICLR 2025] Permute-and-Flip: An optimally robust and watermarkable decoder for LLMs☆19Mar 20, 2025Updated last year
- PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention…☆299May 4, 2024Updated last year
- [TCSVT‘26] LaSSM: Efficient Semantic-Spatial Query Decoding via Local Aggregation and State Space Models for 3D Instance Segmentation☆21Feb 22, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆43Aug 5, 2025Updated 8 months ago
- The official implementation of A Counting-Aware Hierarchical Decoding Framework for Generalized Referring Expression Segmentation☆25Aug 17, 2025Updated 7 months ago
- MFF Matlab file importer and exporter☆15Aug 1, 2024Updated last year
- ☆27May 13, 2025Updated 11 months ago
- ROSA+: RWKV's ROSA implementation with fallback statistical predictor☆34Oct 13, 2025Updated 6 months ago
- [Arxiv'25] SaLon3R: Structure-aware Long-term Generalizable 3D Reconstruction from Unposed Images☆48Oct 18, 2025Updated 5 months ago
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆19Nov 4, 2025Updated 5 months ago