Pytorch implementation of https://arxiv.org/html/2404.07143v1
☆21Apr 13, 2024Updated last year
Alternatives and similar repositories for infini-attention
Users that are interested in infini-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Efficient Infinite Context Transformers with Infini-attention Pytorch Implementation + QwenMoE Implementation + Training Script + 1M cont…☆88May 9, 2024Updated last year
- optimize neuro-centric parameters instead of weights to solve RL tasks☆14Oct 2, 2023Updated 2 years ago
- The repository of VG-Refiner paper☆17Dec 9, 2025Updated 3 months ago
- To mitigate position bias in LLMs, especially in long-context scenarios, we scale only one dimension of LLMs, reducing position bias and …☆11Jun 18, 2024Updated last year
- [npj Digital Medicine] An In-Depth Evaluation of Federated Learning on Biomedical Natural Language Processing for Information Extraction☆12May 1, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- A simple and minimal open source implementation of "Introducing LFM2: The Fastest On-Device Foundation Models on the Market" from Liquid …☆23Mar 9, 2026Updated 2 weeks ago
- Supervision by Fusion: Towards Unsupervised Learning of Deep Salient Object Detector☆11Jun 24, 2023Updated 2 years ago
- ☆42Updated this week
- Official code for ICLR 2022 paper: "PoNet: Pooling Network for Efficient Token Mixing in Long Sequences".☆33May 23, 2023Updated 2 years ago
- 脑机接口资源汇总☆14Aug 11, 2022Updated 3 years ago
- Matlab-based spike sorter for NEV files☆11Mar 27, 2024Updated 2 years ago
- Implementation of NIPS2023: Unleashing the Full Potential of Product Quantization for Large-Scale Image Retrieva☆11Nov 12, 2024Updated last year
- ☆16Dec 4, 2025Updated 3 months ago
- 16-811 Project.☆10Jan 12, 2018Updated 8 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the p…☆14Feb 5, 2024Updated 2 years ago
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆37Oct 3, 2025Updated 5 months ago
- ☆15Nov 23, 2020Updated 5 years ago
- Quora Paraphrasing Dataset Bahasa Indonesia Version☆11Apr 18, 2021Updated 4 years ago
- CMU Linguistic Annotation Backend☆15Sep 22, 2025Updated 6 months ago
- R files containing the code used to predict rugby world cup matches☆10Sep 18, 2015Updated 10 years ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- ☆19Oct 14, 2024Updated last year
- Brain2Word:Decoding Brain Activity for Language Generation☆11Oct 20, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Literature reviews of (Unsupervised/self-supervised) pretraining on medical datasets☆18Jan 16, 2024Updated 2 years ago
- High-performance control stack for Embodied AI powered by the OpenClaw ecosystem. Designed for high-dynamic platforms including Humanoids…☆28Feb 16, 2026Updated last month
- One-shot Global Localization through Semantic Distribution Feature Retrieval and Semantic Topological Histogram Registration☆19Feb 14, 2025Updated last year
- 汉字构造表☆18Jul 16, 2025Updated 8 months ago
- (NeurIPS 2023) Open-set visual object query search & localization in long-form videos☆26Feb 1, 2024Updated 2 years ago
- cc98爬虫☆15Sep 1, 2013Updated 12 years ago
- ☆16Jun 30, 2025Updated 8 months ago
- Simulating Realistic Human Scanpaths in Dynamic Real-World Scenes☆15Mar 3, 2026Updated 3 weeks ago
- ☆33Mar 6, 2026Updated 3 weeks ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A list of papers about point cloud based place recognition, also known as loop closure detection in SLAM (processing)☆10Jan 30, 2024Updated 2 years ago
- DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding☆66Jun 10, 2025Updated 9 months ago
- Codebase for Inference-Time Policy Adapters☆25Nov 3, 2023Updated 2 years ago
- [TCSVT‘26] LaSSM: Efficient Semantic-Spatial Query Decoding via Local Aggregation and State Space Models for 3D Instance Segmentation☆20Feb 22, 2026Updated last month
- PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention…☆299May 4, 2024Updated last year
- ☆43Aug 5, 2025Updated 7 months ago
- ☆27May 13, 2025Updated 10 months ago