An unofficial pytorch implementation of 'Efficient Infinite Context Transformers with Infini-attention'
☆55Aug 19, 2024Updated last year
Alternatives and similar repositories for infini-attention
Users that are interested in infini-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆58Mar 22, 2026Updated last week
- PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention…☆299May 4, 2024Updated last year
- Efficient Infinite Context Transformers with Infini-attention Pytorch Implementation + QwenMoE Implementation + Training Script + 1M cont…☆88May 9, 2024Updated last year
- Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No Context Behind: Efficient Infinite Context Transformers with I…☆375Apr 23, 2024Updated last year
- Compute WER and SER for speech recognition evaluation☆27Mar 18, 2026Updated last week
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Video Diffusion State Space Models☆19Mar 27, 2024Updated 2 years ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Mar 11, 2024Updated 2 years ago
- Community Open Source Implementation of GPT4o in PyTorch☆26Updated this week
- code for promptCSE, emnlp 2022☆11Apr 10, 2023Updated 2 years ago
- A byte-level decoder architecture that matches the performance of tokenized Transformers.☆67Apr 24, 2024Updated last year
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆17Aug 30, 2024Updated last year
- ☆19Jan 5, 2023Updated 3 years ago
- replacement of AdamW and Lion optimizer for LLMs☆13May 28, 2023Updated 2 years ago
- Community Repo for Nowledge Labs Products☆61Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A simple and minimal open source implementation of "Introducing LFM2: The Fastest On-Device Foundation Models on the Market" from Liquid …☆23Mar 22, 2026Updated last week
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 11 months ago
- This is a simple torch implementation of the high performance Multi-Query Attention☆16Aug 23, 2023Updated 2 years ago
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 4 months ago
- Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine☆26Dec 20, 2024Updated last year
- ☆24Apr 25, 2023Updated 2 years ago
- A codebase for data crawling and preprocessing for TTS and ASR systems training.☆22Feb 26, 2026Updated last month
- EvaByte: Efficient Byte-level Language Models at Scale☆117Apr 22, 2025Updated 11 months ago
- An implementation of the base GPT-3 Model architecture from the paper by OPENAI "Language Models are Few-Shot Learners"☆20Jun 29, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 한국어 심리 상담 데이터셋☆81Jun 20, 2023Updated 2 years ago
- Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper☆153Jul 20, 2024Updated last year
- [ICCV 2025] Dynamic-VLM☆28Dec 16, 2024Updated last year
- AI-Powered YouTube Live Streamer (AI로 작동하는 유튜브 라이브 스트리머)☆14Jan 22, 2024Updated 2 years ago
- Tensorflow implementation of the paper "Fast Compressive Sensing Using Generative Model with Structed Latent Variables"☆10Apr 7, 2020Updated 5 years ago
- Sequence-based prediction of peptide-TCR interactions using paired chain data☆13Feb 2, 2026Updated last month
- Text-to-text alignment algorithm for speech recognition error analysis.☆28Jan 29, 2026Updated 2 months ago
- ☆24Dec 16, 2024Updated last year
- ☆12Jan 6, 2023Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Recursive Self-Aggregation evals on ARC-AGI☆29Jan 26, 2026Updated 2 months ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Automatically remove watermarks from illustrations using AI (Stable Diffusion).☆21Dec 17, 2024Updated last year
- Pytorch Implementation of the paper: "Learning to (Learn at Test Time): RNNs with Expressive Hidden States"☆25Mar 16, 2026Updated last week
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆28Updated this week
- Merging Generated and Retrieved Knowledge for Open-Domain QA (EMNLP 2023)☆22Oct 8, 2023Updated 2 years ago
- Block Transformer: Global-to-Local Language Modeling for Fast Inference (NeurIPS 2024)☆163Apr 13, 2025Updated 11 months ago