SwayamInSync / MIRALinks
MIRA - Multimodal Image Reconstruction with Attention is a transformer (Encoder-Decoder) based architecture for Text / Image to 3D reconstruction
☆13Updated last year
Alternatives and similar repositories for MIRA
Users that are interested in MIRA are comparing it to the libraries listed below
Sorting:
- Implementation of language model papers along with several examples [NOT ALL WRITTEN FROM SCRATCH].☆12Updated 8 months ago
- Just another reasonably minimal repo for class-conditional training of pixel-space diffusion transformers.☆88Updated last week
- A question bank for interview questions for data related roles☆10Updated last year
- Making of cuda kernel☆16Updated last week
- About This repository is a curated collection of the most exciting and influential CVPR 2025 papers. 🔥 [Paper + Code + Demo]☆33Updated this week
- Notebooks to demonstrate TimmWrapper☆16Updated 4 months ago
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Updated 7 months ago
- Official implementation of "Art-Free Generative Models: Art Creation Without Graphic Art Knowledge"☆31Updated last month
- 100 Days of GPU Challenge☆20Updated this week
- Shows how to do parameter ensembling using differential evolution.☆10Updated 3 years ago
- Code of paper "A new baseline for edge detection: Make Encoder-Decoder great again"☆39Updated last month
- High order Moment Models☆38Updated last week
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 10 months ago
- Exploration into the Firefly algorithm in Pytorch☆39Updated 3 months ago
- ☆14Updated 9 months ago
- Generating Labeled Image Datasets using Stable Diffusion Models☆25Updated last year
- ☆33Updated 7 months ago
- (CVPR 2024) Bayesian Diffusion Models for 3D Shape Reconstruction☆33Updated last year
- Implementation of MobileViT in TensorFlow and Keras☆11Updated 2 years ago
- ☆47Updated 3 months ago
- Official implementation of ViewFusion: Learning Composable Diffusion Models for Novel View Synthesis☆34Updated this week
- Official repository for "Build-A-Scene: Interactive 3D Layout Control for Diffusion-Based Image Generation" (ICLR2025)☆70Updated last month
- SkyScenes: A Synthetic Dataset for Aerial Scene Understanding☆19Updated 8 months ago
- Official code repository for paper: "ExPLoRA: Parameter-Efficient Extended Pre-training to Adapt Vision Transformers under Domain Shifts"☆31Updated 8 months ago
- [ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Ra…☆101Updated 2 months ago
- ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements [CVPRW 2025]☆23Updated last month
- Official implementation of StochSync: a zero-shot approach for image generation in arbitrary spaces via stochastic diffusion synchronizat…☆13Updated 3 months ago
- [ECCV 2024] SUP-NeRF: A Streamlined Unification of Pose Estimation and NeRF for Monocular 3D Object Reconstruction☆12Updated 8 months ago
- [Preprint] UCGM: Unified Continuous Generative Models☆133Updated last week
- RS-IMLE☆39Updated 5 months ago