SwayamInSync / MIRA
MIRA - Multimodal Image Reconstruction with Attention is a transformer (Encoder-Decoder) based architecture for Text / Image to 3D reconstruction
☆13Updated 11 months ago
Alternatives and similar repositories for MIRA:
Users that are interested in MIRA are comparing it to the libraries listed below
- Implementation of language model papers along with several examples [NOT ALL WRITTEN FROM SCRATCH].☆12Updated 5 months ago
- A question bank for interview questions for data related roles☆10Updated 11 months ago
- Notebooks to demonstrate TimmWrapper☆15Updated last month
- This project is an implementation of fine-tuning an SDXL model using DreamBooth and LoRA on custom data of interior rooms to generate des…☆10Updated last year
- My Implementation of " Structure and Content-Guided Video Synthesis with Diffusion Models" by RunwayML☆26Updated last year
- Generative model for 3D objects.☆15Updated last year
- Pytorch Implementation of the paper: "Learning to (Learn at Test Time): RNNs with Expressive Hidden States"☆24Updated 3 weeks ago
- Explorations into improving ViTArc with Slot Attention☆37Updated 4 months ago
- Official implementation of "Art-Free Generative Models: Art Creation Without Graphic Art Knowledge"☆31Updated 2 months ago
- Official Implementation of DINO-Foresight: Looking into the Future with DINO☆47Updated last week
- Exploration into the Firefly algorithm in Pytorch☆35Updated 3 weeks ago
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Updated 4 months ago
- Implementation of a multimodal diffusion transformer in Pytorch☆100Updated 8 months ago
- This repository compiles a list of papers related to Video LLM.☆19Updated 8 months ago
- Implementation of PyTorch: "GAMBA: MARRY GAUSSIAN SPLATTING WITH MAMBA FOR SINGLE-VIEW 3D RECONSTRUCTION"☆64Updated last month
- Visual RAG using less than 300 lines of code.☆26Updated last year
- High order Moment Models☆38Updated last week
- Shows how to do parameter ensembling using differential evolution.☆10Updated 3 years ago
- [NeurIPS2023] 3D-OWIS is capable of detecting unknown instances in inference, and progressively learning novel classes in the process of …☆67Updated last year
- Official code release for the paper Trapped in texture bias? A large scale comparison of deep instance segmentation, accepted at ECCV 202…☆15Updated last year
- This is the official code release for [LiFT: A Surprisingly Simple Lightweight Feature Transform for Dense ViT Descriptors](https://arxiv…☆35Updated 4 months ago
- (CVPR 2024) Bayesian Diffusion Models for 3D Shape Reconstruction☆27Updated 10 months ago
- Image Search Engine with HuggingFace Sentence Transformer☆12Updated last year
- ☆10Updated 6 months ago
- Implementation of the proposed MaskBit from Bytedance AI☆75Updated 3 months ago
- The official implementation of Distilling Diffusion Models to Efficient 3D LiDAR Scene Completion☆36Updated last week
- LoRA-Ensemble: Efficient Uncertainty Modelling for Self-attention Networks☆47Updated 5 months ago
- ☆32Updated 4 months ago