A PyTorch implementation of Perceiver, Perceiver IO and Perceiver AR with PyTorch Lightning scripts for distributed training
☆531Jan 2, 2024Updated 2 years ago
Alternatives and similar repositories for perceiver-io
Users that are interested in perceiver-io are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch☆1,206Aug 22, 2023Updated 2 years ago
- Implementation of Perceiver AR, Deepmind's new long-context attention network based on Perceiver architecture, in Pytorch☆95Apr 10, 2023Updated 3 years ago
- ☆258May 19, 2026Updated 3 weeks ago
- music generation with perceiver-ar model☆26Jul 20, 2022Updated 3 years ago
- Unofficial implementation of Perceiver IO☆129Jun 14, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Perceiver (transformer variant) implemented in JAX and Flax☆13Mar 29, 2021Updated 5 years ago
- This repository contains implementations and illustrative code to accompany DeepMind publications☆14,994Jun 4, 2026Updated last week
- Vector (and Scalar) Quantization, in Pytorch☆3,962Jun 5, 2026Updated last week
- ☆19Jul 31, 2025Updated 10 months ago
- Named tensors with first-class dimensions for PyTorch☆331Jun 14, 2023Updated 2 years ago
- SOTA Google's Perceiver-AR Music Transformer Implementation and Model☆103May 9, 2023Updated 3 years ago
- A concise but complete full-attention transformer with a set of promising experimental features from various papers☆5,888Jun 4, 2026Updated last week
- PyTorch examples powered by Lightning☆11Dec 28, 2022Updated 3 years ago
- Masked Visual Pre-training for Robotics☆247Apr 1, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- KErnel OPerationS, on CPUs and GPUs, with autodiff and without memory overflows☆1,176Updated this week
- Test pytorch code with minimal computational overhead☆26Jun 8, 2023Updated 3 years ago
- Fast and memory efficient PyTorch implementation of the Perceiver with FlashAttention.☆32Nov 4, 2024Updated last year
- Hackable and optimized Transformers building blocks, supporting a composable construction.☆10,490May 21, 2026Updated 3 weeks ago
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆80Jan 7, 2026Updated 5 months ago
- Type annotations and dynamic checking for a tensor's shape, dtype, names, etc.☆1,481May 2, 2025Updated last year
- Code release for SLIP Self-supervision meets Language-Image Pre-training☆791Feb 9, 2023Updated 3 years ago
- Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch☆39Sep 6, 2021Updated 4 years ago
- Official JAX implementation of MAGVIT: Masked Generative Video Transformer☆1,001Jan 17, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Toolbox of models, callbacks, and datasets for AI/ML researchers.☆1,752Jan 20, 2026Updated 4 months ago
- An open source implementation of CLIP.☆13,889Jun 6, 2026Updated last week
- Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)☆9,507May 31, 2026Updated last week
- Official code for Long Expressive Memory (ICLR 2022, Spotlight)☆70Mar 11, 2022Updated 4 years ago
- TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.☆1,723Updated this week
- Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations. [NeurIPS 2021]☆89Oct 2, 2021Updated 4 years ago
- Adam with minor modifications which give significant improvement☆19Aug 20, 2021Updated 4 years ago
- (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.☆21Jul 13, 2022Updated 3 years ago
- Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.☆3,464May 19, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch☆1,267Oct 18, 2022Updated 3 years ago
- Structured state space sequence models☆2,905Jul 17, 2024Updated last year
- LAVIS - A One-stop Library for Language-Vision Intelligence☆11,236Jun 2, 2026Updated last week
- Elucidating the Design Space of Diffusion-Based Generative Models (EDM)☆1,964Mar 16, 2024Updated 2 years ago
- Learning Features with Parameter-Free Layers, ICLR 2022☆84May 3, 2023Updated 3 years ago
- maximal update parametrization (µP)☆1,722Jul 17, 2024Updated last year
- [ECCV2022] New benchmark for evaluating pre-trained model; New supervised contrastive learning framework.☆110Dec 8, 2023Updated 2 years ago