daanelson / MiniGPT-4
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
☆14Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for MiniGPT-4
- A repository containing datasets and tools to train a watermark classifier.☆60Updated 2 years ago
- A Gradio component that can be used to annotate images with bounding boxes.☆31Updated 2 weeks ago
- Example of YOLOv8 object detection on browser. It is powered by ONNX and TFJS and served through JavaScript without any frameworks. It de…☆30Updated 4 months ago
- Easily compute clip embeddings from video frames☆136Updated last year
- Cross-platform, customizable ML solutions for live and streaming media.☆22Updated 3 years ago
- Efficiently read embedding in streaming from any filesystem☆94Updated 6 months ago
- an optimized, production-ready implementation of active speaker detection☆52Updated 5 months ago
- A few Colab Notebooks which showcase a hacky way to run Cog Containers in Google Colag☆9Updated last year
- Attempt at cog wrapper for IP_Adapter-face for SDXL☆14Updated last year
- Stable Diffusion 3.0 beta Generation GUI for image generation process and automatic save images.☆14Updated 6 months ago
- Make-A-Video Latent Diffusion Model☆18Updated 11 months ago
- ☆84Updated last year
- Discord AI Generation Bot to collect an aesthetic rating dataset☆61Updated last year
- ☆10Updated 5 months ago
- openai guided diffusion tweaks☆52Updated 2 years ago
- checkpoints for glide finetuned on laion and other datasets. wip.☆50Updated 2 years ago
- ☆28Updated 2 years ago
- Thispersondoesnotexist went down, so this time, while building it back up, I am going to open source all of it.☆86Updated last year
- ☆49Updated 2 years ago
- Optimized library for large-scale extraction of frames and audio from video.☆203Updated last year
- Cog wrapper for Vchitect/SEINE☆37Updated 11 months ago
- Cog wrapper for sdxl-lightning 4step Unet☆31Updated last month
- CLIP Guided Diffusion☆53Updated 7 months ago
- A quality zero-shot lipsync pipeline built with MuseTalk, LivePortrait, and CodeFormer.☆27Updated last month
- It is a simple library to speed up CLIP inference up to 3x (K80 GPU)☆201Updated last year
- Training simple models to predict CLIP image embeddings from text embeddings, and vice versa.☆59Updated 2 years ago
- cheap views of intermediate Stable Diffusion results☆45Updated 2 years ago
- ☆111Updated 3 years ago
- ☆42Updated last week