NSTiwari / Stable-DiffusionXL-using-DreamBooth-and-LoRA
This project is an implementation of fine-tuning an SDXL model using DreamBooth and LoRA on custom data of interior rooms to generate designs for your home.
☆10Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for Stable-DiffusionXL-using-DreamBooth-and-LoRA
- Notebooks for fine tuning pali gemma☆41Updated 3 months ago
- ☆13Updated last year
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆9Updated 3 months ago
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆33Updated 2 weeks ago
- ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing☆65Updated 5 months ago
- ☆29Updated 11 months ago
- Eye exploration☆22Updated last month
- Use Grounding DINO, Segment Anything, and GPT-4V to label images with segmentation masks for use in training smaller, fine-tuned models.☆65Updated 11 months ago
- The open source implementation of the base model behind GPT-4 from OPENAI [Language + Multi-Modal]☆11Updated last year
- EdgeSAM model for use with Autodistill.☆25Updated 5 months ago
- Official code repository for paper: "ExPLoRA: Parameter-Efficient Extended Pre-training to Adapt Vision Transformers under Domain Shifts"☆24Updated last month
- NExT-GPT: Any-to-Any Multimodal Large Language Model☆19Updated last week
- The open source implementation of "NeVA: NeMo Vision and Language Assistant"☆18Updated last year
- This repo contains self made projects and learnables from various resources on using local LLMs and RAG☆14Updated 6 months ago
- LoRA fine-tuned Stable Diffusion Deployment☆31Updated last year
- SAM-CLIP module for use with Autodistill.☆12Updated 11 months ago
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆11Updated 7 months ago
- ☆12Updated 7 months ago
- ☆67Updated last month
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.☆45Updated 8 months ago
- This repository shows various ways of deploying a vision model (TensorFlow) from 🤗 Transformers.☆29Updated 2 years ago
- Official repository for the paper "End-to-End Visual Editing with a Generatively Pre-Trained Artist", which is accepted at ECCV 2022. Her…☆29Updated last year
- Visual RAG using less than 300 lines of code.☆23Updated 8 months ago
- Text-to-face implementation using AttnGan architecture.☆16Updated 2 years ago
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆34Updated last year
- Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta☆16Updated last week
- Image Search Engine with HuggingFace Sentence Transformer☆11Updated last year
- PyTorch at the Edge: Deploying Over 964 TIMM Models on Android with TorchScript and Flutter.☆42Updated last year
- Pretraining and finetuning for visual instruction following with Mixture of Experts☆9Updated 9 months ago
- ☆16Updated 8 months ago