Train vector quantized CLIP models using pytorch lightning
☆20Jul 14, 2024Updated last year
Alternatives and similar repositories for vq-clip
Users that are interested in vq-clip are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Jax implementation of VIT-VQGAN☆10Jan 25, 2024Updated 2 years ago
- Upscale, enhance, and reimagine your renders with a single prompt using Stable Diffusion and FLUX.☆14Aug 26, 2024Updated last year
- Ultra-minimal autoregressive diffusion model for image generation☆21Dec 26, 2025Updated 3 months ago
- Corpus to accompany: "Selective Vision is the Challenge for Visual Reasoning: A Benchmark for Visual Argument Understanding"☆11Apr 11, 2025Updated last year
- v1: Learning to Point Visual Tokens for Multimodal Grounded Reasoning☆19Oct 6, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- recipe for training fully-featured self supervised image jepa models☆12Jun 4, 2025Updated 10 months ago
- A Pytorch implementation of "Deep Learning with Logged Bandit Feedback"☆10Aug 22, 2018Updated 7 years ago
- Code to reproduce the experiments in the paper: Does CLIP Bind Concepts? Probing Compositionality in Large Image Models.☆16Oct 14, 2023Updated 2 years ago
- Agentic Keyframe Search for Video Question Answering☆18Apr 7, 2025Updated last year
- Fast and controllable text-to-image model.☆41Jun 16, 2023Updated 2 years ago
- [ACL 24 Findings] Implementation of Resonance RoPE and the PosGen synthetic dataset.☆24Mar 5, 2024Updated 2 years ago
- ☆15Apr 8, 2022Updated 4 years ago
- official implementation of "CLIP-VQDiffusion : Langauge Free Training of Text To Image generation using CLIP and vector quantized diffusi…☆19Sep 5, 2024Updated last year
- ☆13Jan 22, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Official PyTorch implementation of LilNetX: Lightweight Networks with EXtreme Model Compression and Structured Sparsification☆48May 30, 2022Updated 3 years ago
- Realtime Face detection demo using YOLO v2 and OpenCV DNN module☆17Mar 10, 2018Updated 8 years ago
- Adapting Self-Supervised Representations as a Latent Space for Efficient Generation☆40Oct 17, 2025Updated 5 months ago
- [ICLR 23] Contrastive Aligned of Vision to Language Through Parameter-Efficient Transfer Learning☆40Jul 29, 2023Updated 2 years ago
- ESPER☆24Mar 29, 2024Updated 2 years ago
- Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"☆26Jul 21, 2025Updated 8 months ago
- Data for evaluating GPT-4V☆11Oct 26, 2023Updated 2 years ago
- 10ms, sd-turbo, 512x512, batch size 1, txt2img on consumer hardware☆19Dec 8, 2023Updated 2 years ago
- [SIGGRAPH 2025] MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation☆29Aug 5, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Kolmogorov-Arnold networks (KAN) as implicit functions (like NeRF but simpler)☆15May 16, 2024Updated last year
- ☆18May 17, 2024Updated last year
- ☆13Mar 8, 2024Updated 2 years ago
- Official Pytorch implementation of "Omni-AVSR: Towards Unified Multimodal Speech Recognition with Large Language Models" [IEEE ICASSP 202…☆34Mar 10, 2026Updated last month
- ☆16Mar 12, 2024Updated 2 years ago
- ☆23Jun 18, 2024Updated last year
- Open Source + Multilingual MLLM + Fine-tuning + Distillation + More efficient models and learning + ?☆18Jan 31, 2025Updated last year
- ☆14Jun 20, 2022Updated 3 years ago
- ☆14Jul 5, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Textual Localization: Decomposing Multi-concept Images for Subject-Driven Text-to-Image Generation☆16Mar 10, 2024Updated 2 years ago
- Official repository of the paper "GPR1200: A Benchmark for General-PurposeContent-Based Image Retrieval"☆29Apr 3, 2025Updated last year
- [CVPR 2025] DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models☆106Nov 22, 2025Updated 4 months ago
- [ICLR 2026] Code for Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model☆31Mar 1, 2026Updated last month
- Simple google map☆10Feb 28, 2016Updated 10 years ago
- FL-Tuning☆12Jul 11, 2022Updated 3 years ago
- Hypernetwork training considerations and implementation types in PyTorch. Includes classification and time-series examples alongside 1D G…☆24Jan 4, 2023Updated 3 years ago