Big-Interleaved-Dataset
☆59Jan 21, 2023Updated 3 years ago
Alternatives and similar repositories for Big-Interleaved-Dataset
Users that are interested in Big-Interleaved-Dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of QA Networks☆10Jul 14, 2016Updated 9 years ago
- A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆33Mar 21, 2023Updated 3 years ago
- Script and models for clustering LAION-400m CLIP embeddings.☆26Jan 10, 2022Updated 4 years ago
- Recurrent Neural Network library for Torch7's nn☆19Jan 26, 2017Updated 9 years ago
- Tensorflow code for WACV 2019 paper "Attention Based Natural Language Grounding by Navigating Virtual Environment" - https://arxiv.org/ab…☆17Nov 7, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The list of some conference papers.☆11Apr 19, 2019Updated 7 years ago
- Code used for the creation of OBELICS, an open, massive and curated collection of interleaved image-text web documents, containing 141M d…☆214Aug 28, 2024Updated last year
- COYO-700M: Large-scale Image-Text Pair Dataset☆1,255Nov 30, 2022Updated 3 years ago
- ViT trained on COYO-Labeled-300M dataset☆33Nov 24, 2022Updated 3 years ago
- A Datasette instance for searching WebVid-10M☆15Sep 30, 2022Updated 3 years ago
- Code and data for the CoNLL 2018 paper "Adversarially Regularising Neural NLI Models to Integrate Logical Background Knowledge."☆25Jan 21, 2019Updated 7 years ago
- Un-*** 50 billions multimodality dataset☆24Sep 14, 2022Updated 3 years ago
- Xfce Desktop container designed for direct access to the GPU with EGL using VirtualGL for GPUs. Does not require /tmp/.X11-unix host sock…☆10Jul 25, 2022Updated 3 years ago
- Implementation and checkpoints of Imagen, Google's text-to-image synthesis neural network, in Pytorch☆18Dec 22, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆23Dec 16, 2022Updated 3 years ago
- ICCV 2023 (Oral) Open-domain Visual Entity Recognition Towards Recognizing Millions of Wikipedia Entities☆43Jun 7, 2025Updated 11 months ago
- [EMNLP 2021] Code and data for our paper "Visually Grounded Reasoning across Languages and Cultures"☆30Dec 30, 2021Updated 4 years ago
- Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models☆45Jun 14, 2024Updated last year
- Post-processing for fair classification☆16Jun 30, 2025Updated 10 months ago
- Fuel innovation and advance language models with HomoScriptor: A vibrant, community-driven dataset for fine-tuning large language models.☆18Oct 14, 2023Updated 2 years ago
- ☆35Jul 5, 2023Updated 2 years ago
- Your fruity companion for transformers☆14May 25, 2022Updated 4 years ago
- reproduces experiments from "Grounding inductive biases in natural images: invariance stems from variations in data"☆17Sep 25, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This project provides a data set with bounding boxes, body poses, 3D face meshes & captions of people from our LAION-2.2B. Additionally i…☆14Jan 2, 2022Updated 4 years ago
- DMax: Aggressive Parallel Decoding for dLLMs☆121May 15, 2026Updated last week
- A practice for million-scale multi-domain universal object detection☆28Jun 13, 2024Updated last year
- Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model☆13Feb 15, 2024Updated 2 years ago
- Get hundred of million of image+url from the crawling at home dataset and preprocess them☆222May 26, 2024Updated 2 years ago
- Implementation of AlphaZero in PyTorch.☆10Apr 19, 2019Updated 7 years ago
- Reproducible scaling laws for contrastive language-image learning (https://arxiv.org/abs/2212.07143)☆194Jun 21, 2025Updated 11 months ago
- TaiSu(太素)--a large-scale Chinese multimodal dataset(亿级大规模中文视觉语言预训练数据集)☆192Nov 17, 2023Updated 2 years ago
- Jump to better conclusions: SCAN both left and right☆11Jan 24, 2019Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Training simple models to predict CLIP image embeddings from text embeddings, and vice versa.☆60Mar 31, 2022Updated 4 years ago
- ☆37May 7, 2023Updated 3 years ago
- A subset of YFCC100M. Tools, checking scripts and links of web drive to download datasets(uncompressed).☆19Nov 13, 2024Updated last year
- Wikipedia navigation environment for OpenAI Gym☆41Apr 2, 2023Updated 3 years ago
- Based on StackExchange.Redis that operates Tair For Redis Modules.☆11Feb 28, 2025Updated last year
- The released data for paper "Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models".☆34Sep 16, 2023Updated 2 years ago
- MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.☆953Mar 19, 2025Updated last year