NSTiwari / Stable-DiffusionXL-using-DreamBooth-and-LoRA
This project is an implementation of fine-tuning an SDXL model using DreamBooth and LoRA on custom data of interior rooms to generate designs for your home.
☆10Updated last year
Alternatives and similar repositories for Stable-DiffusionXL-using-DreamBooth-and-LoRA:
Users that are interested in Stable-DiffusionXL-using-DreamBooth-and-LoRA are comparing it to the libraries listed below
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Updated 4 months ago
- EdgeSAM model for use with Autodistill.☆26Updated 8 months ago
- Use Grounding DINO, Segment Anything, and GPT-4V to label images with segmentation masks for use in training smaller, fine-tuned models.☆66Updated last year
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆62Updated 6 months ago
- This repo contains self made projects and learnables from various resources on using local LLMs and RAG☆14Updated 10 months ago
- ☆30Updated last year
- ☆13Updated last year
- Notebooks to demonstrate TimmWrapper☆15Updated last month
- This repository contains examples of using PaliGemma for tasks such as object detection, segmentation, image captioning, etc.☆20Updated 2 weeks ago
- This project is under development.☆23Updated last year
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆27Updated 9 months ago
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.☆45Updated 11 months ago
- LoRA fine-tuned Stable Diffusion Deployment☆31Updated 2 years ago
- The open source implementation of "NeVA: NeMo Vision and Language Assistant"☆18Updated last year
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 7 months ago
- Simple CogVLM client script☆14Updated last year
- ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing☆67Updated 9 months ago
- ☆24Updated last year
- Chat with Qwen2-VL. Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆10Updated 5 months ago
- ☆29Updated last year
- This repository shows various ways of deploying a vision model (TensorFlow) from 🤗 Transformers.☆29Updated 2 years ago
- Quantization of LLMs and benchmarking.☆10Updated 11 months ago
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆35Updated last year
- The open source implementation of the base model behind GPT-4 from OPENAI [Language + Multi-Modal]☆11Updated last year
- Implementation of CaiT models in TensorFlow and ImageNet-1k checkpoints. Includes code for inference and fine-tuning.☆12Updated last year
- Notebooks for fine tuning pali gemma☆96Updated 2 months ago
- IBM Quantum Challenge Fall 2023☆10Updated last year
- ☆31Updated last year
- Using open-source LLM Llama2 by Meta on local CPU inference for document question-and-answer☆15Updated last year
- PyTorch at the Edge: Deploying Over 964 TIMM Models on Android with TorchScript and Flutter.☆44Updated last year