SwayamInSync / MIRALinks
MIRA - Multimodal Image Reconstruction with Attention is a transformer (Encoder-Decoder) based architecture for Text / Image to 3D reconstruction
☆13Updated last year
Alternatives and similar repositories for MIRA
Users that are interested in MIRA are comparing it to the libraries listed below
Sorting:
- Research Paper Implementations☆10Updated last year
- Implementation of language model papers along with several examples [NOT ALL WRITTEN FROM SCRATCH].☆12Updated 11 months ago
- Just another reasonably minimal repo for class-conditional training of pixel-space diffusion transformers.☆123Updated 3 months ago
- Implementation of the paper "Denoising Diffusion Probabilistic Models" in PyTorch☆64Updated 2 years ago
- ☆11Updated 5 years ago
- 100 days of building GPU kernels!☆500Updated 5 months ago
- 🤩 An AWESOME Curated List of Papers, Workshops, Datasets, and Challenges from CVPR 2024☆144Updated last year
- 100 Days of GPU Challenge☆23Updated 3 weeks ago
- ☆161Updated last month
- Notebooks for fine tuning pali gemma☆117Updated 5 months ago
- A Simplified PyTorch Implementation of Vision Transformer (ViT)☆210Updated last year
- ☆44Updated 4 months ago
- Implementation of Diffusion Transformer Model in Pytorch☆70Updated 4 months ago
- Text to Image Latent Diffusion using a Transformer core☆208Updated last year
- This repo implements a Stable Diffusion model in PyTorch with all the essential components.☆225Updated 10 months ago
- [ECCV 2024] Official PyTorch implementation of RoPE-ViT "Rotary Position Embedding for Vision Transformer"☆401Updated 9 months ago
- ☆172Updated last year
- GPU Kernels☆194Updated 5 months ago
- From-scratch diffusion model implemented in PyTorch.☆98Updated last year
- [ICML 2025] Implementation of Spatial Reasoning with Denoising Models☆77Updated 2 months ago
- [WACV'25 Oral] Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think☆472Updated 9 months ago
- Implementation of Lumiere, SOTA text-to-video generation from Google Deepmind, in Pytorch☆279Updated last year
- The CVF Open Access Downloader is a Python application designed to automate the bulk downloading of open-access papers from Computer Visi…☆10Updated last year
- My take on Flow Matching☆77Updated 8 months ago
- [ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuning☆296Updated 3 weeks ago
- This repo implements Diffusion Transformers(DiT) in PyTorch and provides training and inference code on CelebHQ dataset☆47Updated 8 months ago
- Implementation of different diffusion models for probabilistic image generation☆33Updated last year
- [ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Ra…☆104Updated 6 months ago
- 🔥🔥🔥Official Codebase of "DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation"☆294Updated last year
- PlatoNeRF: 3D Reconstruction in Plato's Cave via Single-View Two-Bounce Lidar☆81Updated last year