☆65Oct 4, 2023Updated 2 years ago
Alternatives and similar repositories for General-GPT
Users that are interested in General-GPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Un-*** 50 billions multimodality dataset☆24Sep 14, 2022Updated 3 years ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆16Apr 22, 2021Updated 5 years ago
- The Land-Diffuser is a novel application of the Denoising Diffusion Probabilistic Model (DDPM) in the realm of 3D Talking Head generation…☆13Dec 23, 2023Updated 2 years ago
- Simple large-scale training of stable diffusion with multi-node support.☆133May 8, 2023Updated 3 years ago
- Train vision models using JAX and 🤗 transformers☆102Dec 14, 2025Updated 5 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆37May 7, 2023Updated 3 years ago
- C++, OpenMP and CUDA implementation of Mean Shift clustering algorithm☆14Apr 27, 2020Updated 6 years ago
- [COG24] - Official repository of "OfflineMania: A Benchmark Environment for Offline Reinforcement Learning in Racing Games"☆12Jul 15, 2024Updated last year
- clip retrieval benchmark☆17May 4, 2022Updated 4 years ago
- O-GIA is an umbrella for research, infrastructure and projects ecosystem that should provide open source, reproducible datasets, models, …☆87Feb 19, 2023Updated 3 years ago
- An open source implementation of CLIP.☆33Nov 7, 2022Updated 3 years ago
- A tool for benchmarking image generation models.☆33Jan 13, 2023Updated 3 years ago
- ImageNet-12k subset of ImageNet-21k (fall11)☆23Jun 13, 2023Updated 2 years ago
- Code implementation of our NeurIPS 2023 paper: Vocabulary-free Image Classification☆107Feb 2, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Reimplementation of VFDT and EFDT. Our codebase is simple, concise, easy to reproduce with strong performances.☆10Feb 1, 2021Updated 5 years ago
- ☆30Nov 25, 2021Updated 4 years ago
- A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆33Mar 21, 2023Updated 3 years ago
- This repo contains the code of "Contrastive Supervised Distillation for Continual Representation Learning", Tommaso Barletti, Niccolò Bio…☆20Jul 5, 2022Updated 3 years ago
- FRED: The Florence RGB-Event Drone Dataset☆73Apr 20, 2026Updated last month
- ☆27Mar 7, 2025Updated last year
- PyCon mini 東海 2024 のトーク「Google Colaboratoryで試すVLM」で紹介したサンプル集☆12Nov 15, 2024Updated last year
- CLIP-like model evaluation☆813Mar 19, 2026Updated 2 months ago
- Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…☆51May 10, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆12Jun 14, 2021Updated 4 years ago
- MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.☆954Mar 19, 2025Updated last year
- ☆15Dec 28, 2022Updated 3 years ago
- Easily compute clip embeddings from video frames☆149Oct 31, 2023Updated 2 years ago
- Experiments around a simple idea for inducing multiple hierarchical predictive model within a GPT☆227Mar 25, 2026Updated 2 months ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆49Jan 27, 2022Updated 4 years ago
- Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022☆31May 29, 2023Updated 3 years ago
- A bug-free and improved implementation of LLaVA-UHD, based on the code from the official repo☆35Aug 12, 2024Updated last year
- [ICCVW 2023] - Mapping Memes to Words for Multimodal Hateful Meme Classification☆27Apr 17, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆38Jul 24, 2023Updated 2 years ago
- Easily compute clip embeddings and build a clip retrieval system with them☆2,773Mar 28, 2026Updated 2 months ago
- Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.☆4,424Oct 19, 2025Updated 7 months ago
- 친절한 실전 딥러닝 수업☆12Sep 22, 2020Updated 5 years ago
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆161Apr 3, 2024Updated 2 years ago
- GPU controlled Hetzner Cloud workers swarm for Crawling@Home project☆58Oct 9, 2022Updated 3 years ago
- PyTorch code for MUST☆108May 1, 2025Updated last year