An unofficial pytorch implementation of 'Efficient Infinite Context Transformers with Infini-attention'
☆55Aug 19, 2024Updated last year
Alternatives and similar repositories for infini-attention
Users that are interested in infini-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆58Mar 30, 2026Updated 2 weeks ago
- PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention…☆299May 4, 2024Updated last year
- Efficient Infinite Context Transformers with Infini-attention Pytorch Implementation + QwenMoE Implementation + Training Script + 1M cont…☆91May 9, 2024Updated last year
- Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No Context Behind: Efficient Infinite Context Transformers with I…☆376Apr 23, 2024Updated last year
- Compute WER and SER for speech recognition evaluation☆27Mar 18, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Video Diffusion State Space Models☆19Mar 27, 2024Updated 2 years ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Mar 11, 2024Updated 2 years ago
- Community Open Source Implementation of GPT4o in PyTorch☆26Updated this week
- A byte-level decoder architecture that matches the performance of tokenized Transformers.☆67Apr 24, 2024Updated last year
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆17Aug 30, 2024Updated last year
- Implementation of "PaLM2-VAdapter:" from the multi-modal model paper: "PaLM2-VAdapter: Progressively Aligned Language Model Makes a Stron…☆17Nov 11, 2024Updated last year
- ☆19Jan 5, 2023Updated 3 years ago
- replacement of AdamW and Lion optimizer for LLMs☆13May 28, 2023Updated 2 years ago
- Two implementations of ZeRO-1 optimizer sharding in JAX☆14Jun 11, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A simple and minimal open source implementation of "Introducing LFM2: The Fastest On-Device Foundation Models on the Market" from Liquid …☆23Updated this week
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated last year
- This is a simple torch implementation of the high performance Multi-Query Attention☆16Aug 23, 2023Updated 2 years ago
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 5 months ago
- Another implementation of Hinton's capsule networks in tensorflow.☆19Feb 19, 2018Updated 8 years ago
- EvaByte: Efficient Byte-level Language Models at Scale☆117Apr 22, 2025Updated 11 months ago
- An implementation of the base GPT-3 Model architecture from the paper by OPENAI "Language Models are Few-Shot Learners"☆20Jun 29, 2024Updated last year
- Flux reconstruction fluid flow solver for 1D PDEs written in Julia. Linear advection, Burgers, viscous Burgers, and Euler equations.☆13Apr 28, 2022Updated 3 years ago
- Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper☆153Jul 20, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ICCV 2025] Dynamic-VLM☆28Dec 16, 2024Updated last year
- ☆28Feb 10, 2026Updated 2 months ago
- Sequence-based prediction of peptide-TCR interactions using paired chain data☆13Feb 2, 2026Updated 2 months ago
- ☆24Dec 16, 2024Updated last year
- Rust derive macros for automating the boring stuff.☆14Aug 3, 2025Updated 8 months ago
- ☆12Jan 19, 2024Updated 2 years ago
- Recursive Self-Aggregation evals on ARC-AGI☆31Jan 26, 2026Updated 2 months ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Pytorch Implementation of the paper: "Learning to (Learn at Test Time): RNNs with Expressive Hidden States"☆24Updated this week
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆28Updated this week
- Single-Image Crowd Counting via Multi-Column Convolutional Neural Network☆16Sep 29, 2018Updated 7 years ago
- Merging Generated and Retrieved Knowledge for Open-Domain QA (EMNLP 2023)☆22Oct 8, 2023Updated 2 years ago
- Bleeding edge low level Rust binding for GGML☆17Jun 26, 2024Updated last year
- Block Transformer: Global-to-Local Language Modeling for Fast Inference (NeurIPS 2024)☆165Apr 13, 2025Updated last year
- ☆15Oct 31, 2023Updated 2 years ago
- Gemma 2B with 10M context length using Infini-attention.☆935May 12, 2024Updated last year