daanelson / MiniGPT-4Links
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
☆15Updated last year
Alternatives and similar repositories for MiniGPT-4
Users that are interested in MiniGPT-4 are comparing it to the libraries listed below
Sorting:
- Convert an audio file to a waveform video☆11Updated last year
- Sort a folder of images according to their similarity with provided text in your browser (uses a browser-ported version of OpenAI's CLIP …☆186Updated last year
- 🤗🖼️ HuggingPics: Fine-tune Vision Transformers for anything using images found on the web.☆304Updated last year
- It is a simple library to speed up CLIP inference up to 3x (K80 GPU)☆221Updated 2 years ago
- SDXL Multi-controlnet with loras☆26Updated last year
- extending stable diffusion prompts with suitable style cues using text generation☆178Updated 2 years ago
- Python package to generate image embeddings with CLIP without PyTorch/TensorFlow☆152Updated 3 years ago
- ☆15Updated 2 years ago
- ☆25Updated 3 years ago
- Inpainting using RunwayML's stable-diffusion-inpainting checkpoint☆19Updated 2 years ago
- OpenAI's CLIP model ported to JavaScript using the ONNX web runtime☆156Updated last year
- Flask-based web application designed to compare text and image embeddings using the CLIP model.☆22Updated last year
- Attempt at cog wrapper for IP_Adapter-face for SDXL☆14Updated 9 months ago
- convert pixels to SVG square-based shapes☆54Updated 2 months ago
- Stable Diffusion XL training and inference as a cog model☆227Updated 9 months ago
- ☆112Updated 4 years ago
- Pure Python Port of Potrace☆131Updated last year
- ☆15Updated 2 years ago
- Kandinsky 2 — multilingual text2image latent diffusion model☆74Updated 2 years ago
- This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.☆156Updated last year
- ☆100Updated 2 years ago
- A fork of DreamBooth with Stable Diffusion V2 Enabled.☆92Updated 2 years ago
- An open toolbox for NSFW & toxicity detection☆56Updated 2 years ago
- LoRA training model packaged with Cog☆114Updated 2 years ago
- Remove text from AI-generated images☆295Updated 10 months ago
- Create GIFs and Videos using Stable Diffusion☆224Updated last year
- ☆11Updated last year
- Building the laion5B paper☆36Updated 3 years ago
- A PoC to run Segment Anything Model (SAM) entirely in the browser without any backend☆74Updated 2 years ago
- GitHub action that'll sync files from a GitHub Repo with the Hugging Face Hub 🤗☆76Updated 10 months ago