daanelson / MiniGPT-4Links
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
☆15Updated last year
Alternatives and similar repositories for MiniGPT-4
Users that are interested in MiniGPT-4 are comparing it to the libraries listed below
Sorting:
- SDXL Multi-controlnet with loras☆26Updated last year
- Sort a folder of images according to their similarity with provided text in your browser (uses a browser-ported version of OpenAI's CLIP …☆190Updated last year
- ☆112Updated 4 years ago
- OpenAI's CLIP model ported to JavaScript using the ONNX web runtime☆157Updated 2 years ago
- It is a simple library to speed up CLIP inference up to 3x (K80 GPU)☆226Updated 2 years ago
- Python package to generate image embeddings with CLIP without PyTorch/TensorFlow☆157Updated 3 years ago
- Inpainting using RunwayML's stable-diffusion-inpainting checkpoint☆19Updated 2 years ago
- Kandinsky 2 — multilingual text2image latent diffusion model☆87Updated last year
- extending stable diffusion prompts with suitable style cues using text generation☆178Updated 2 years ago
- 4bit bitsandbytes quants of the best 7B vlms☆33Updated last year
- ☆25Updated 3 years ago
- Convert an audio file to a waveform video☆11Updated 2 years ago
- This project is under development.☆23Updated 2 years ago
- ☆15Updated 2 years ago
- ☆87Updated 2 years ago
- ☆14Updated 2 years ago
- A colab notebook that combines Stable Diffusion + DALL-E Mini (Craiyon)☆125Updated 3 years ago
- Efficiently read embedding in streaming from any filesystem☆103Updated 4 months ago
- Remove text from AI-generated images☆300Updated last year
- Attempt at cog wrapper for IP_Adapter-face for SDXL☆15Updated last year
- convert pixels to SVG square-based shapes☆56Updated 5 months ago
- Implementations of zero-shot capabilities with Open AI's CLIP and computer vision models☆34Updated last year
- ☆15Updated 2 years ago
- Implementation of Grounding DINO & Segment Anything, and it allows masking based on prompt, which is useful for programmed inpainting.☆38Updated 2 years ago
- ☆51Updated 3 years ago
- Kandinsky 2 — multilingual text2image latent diffusion model☆74Updated 2 years ago
- This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.☆158Updated 2 years ago
- LoRA Explorer model to explore Flux.1[Schnell] with LoRAs☆31Updated last year
- A PoC to run Segment Anything Model (SAM) entirely in the browser without any backend☆77Updated 2 years ago
- Pure Python Port of Potrace☆147Updated last year