SwayamInSync / MIRALinks
MIRA - Multimodal Image Reconstruction with Attention is a transformer (Encoder-Decoder) based architecture for Text / Image to 3D reconstruction
☆13Updated last year
Alternatives and similar repositories for MIRA
Users that are interested in MIRA are comparing it to the libraries listed below
Sorting:
- A question bank for interview questions for data related roles☆10Updated last year
- 100 Days of GPU Challenge☆20Updated this week
- Implementation of language model papers along with several examples [NOT ALL WRITTEN FROM SCRATCH].☆12Updated 8 months ago
- Notebooks to demonstrate TimmWrapper☆16Updated 5 months ago
- Just another reasonably minimal repo for class-conditional training of pixel-space diffusion transformers.☆106Updated 3 weeks ago
- Exploration into the Firefly algorithm in Pytorch☆40Updated 4 months ago
- Making of cuda kernel☆16Updated last month
- Implementation of LVSM, SOTA Large View Synthesis with Minimal 3d Inductive Bias, from Adobe Research☆101Updated 4 months ago
- Implementation of a multimodal diffusion transformer in Pytorch☆102Updated last year
- Implementation of the proposed Spline-Based Transformer from Disney Research☆97Updated 7 months ago
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Updated 8 months ago
- This is the official code release for [LiFT: A Surprisingly Simple Lightweight Feature Transform for Dense ViT Descriptors](https://arxiv…☆40Updated 7 months ago
- Explorations into improving ViTArc with Slot Attention☆42Updated 8 months ago
- Implementation of PyTorch: "GAMBA: MARRY GAUSSIAN SPLATTING WITH MAMBA FOR SINGLE-VIEW 3D RECONSTRUCTION"☆64Updated 2 months ago
- Contains materials for my talk "You don't know TensorFlow".☆9Updated 2 years ago
- Implementation of a transformer for reinforcement learning using `x-transformers`☆60Updated last week
- [CVPR 2025] Parallel Sequence Modeling via Generalized Spatial Propagation Network☆88Updated 3 weeks ago
- vision language models finetuning notebooks & use cases (paligemma - florence .....)☆27Updated 2 weeks ago
- Implementation of Zero-Shot Video Semantic Segmentation [CVPR 2025]☆49Updated 4 months ago
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.☆11Updated 5 months ago
- High order Moment Models☆38Updated 3 weeks ago
- Implementation of Diffusion Transformer Model in Pytorch☆61Updated last month
- Official code repository for paper: "ExPLoRA: Parameter-Efficient Extended Pre-training to Adapt Vision Transformers under Domain Shifts"☆31Updated 8 months ago
- Toy genetic algorithm in Pytorch☆53Updated last month
- ☆64Updated 2 months ago
- ☆47Updated 4 months ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 10 months ago
- Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models. TMLR 2025.☆78Updated last month
- Use Grounding DINO, Segment Anything, and CLIP to label objects in images.☆31Updated last year
- The Gaussian Histogram Loss (HL-Gauss) proposed by Imani et al. with a few convenient wrappers for regression, in Pytorch☆64Updated 3 weeks ago