Big-Interleaved-Dataset
☆59Jan 21, 2023Updated 3 years ago
Alternatives and similar repositories for Big-Interleaved-Dataset
Users that are interested in Big-Interleaved-Dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of QA Networks☆10Jul 14, 2016Updated 9 years ago
- A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆33Mar 21, 2023Updated 3 years ago
- Script and models for clustering LAION-400m CLIP embeddings.☆26Jan 10, 2022Updated 4 years ago
- [ICLR 2024 Spotlight] Social Reward: Evaluating and Enhancing Generative AI through Million-User Feedback from an Online Creative Communi…☆12Mar 29, 2024Updated 2 years ago
- [Findings of ACL-2023] This is the official implementation of On the Difference of BERT-style and CLIP-style Text Encoders.☆14Jun 7, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆18Nov 7, 2022Updated 3 years ago
- Code used for the creation of OBELICS, an open, massive and curated collection of interleaved image-text web documents, containing 141M d…☆215Aug 28, 2024Updated last year
- COYO-700M: Large-scale Image-Text Pair Dataset☆1,256Nov 30, 2022Updated 3 years ago
- A Datasette instance for searching WebVid-10M☆15Sep 30, 2022Updated 3 years ago
- Code and data for the CoNLL 2018 paper "Adversarially Regularising Neural NLI Models to Integrate Logical Background Knowledge."☆25Jan 21, 2019Updated 7 years ago
- Xfce Desktop container designed for direct access to the GPU with EGL using VirtualGL for GPUs. Does not require /tmp/.X11-unix host sock…☆10Jul 25, 2022Updated 3 years ago
- Implementation and checkpoints of Imagen, Google's text-to-image synthesis neural network, in Pytorch☆18Dec 22, 2022Updated 3 years ago
- Recurrent Convolutional Memory Network (in progress)☆29Apr 16, 2016Updated 10 years ago
- ICCV 2023 (Oral) Open-domain Visual Entity Recognition Towards Recognizing Millions of Wikipedia Entities☆44Jun 7, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models☆45Jun 14, 2024Updated 2 years ago
- Post-processing for fair classification☆16Jun 30, 2025Updated 11 months ago
- SVIT: Scaling up Visual Instruction Tuning☆168Jun 20, 2024Updated last year
- Fuel innovation and advance language models with HomoScriptor: A vibrant, community-driven dataset for fine-tuning large language models.☆18Oct 14, 2023Updated 2 years ago
- reproduces experiments from "Grounding inductive biases in natural images: invariance stems from variations in data"☆17Sep 25, 2024Updated last year
- This project provides a data set with bounding boxes, body poses, 3D face meshes & captions of people from our LAION-2.2B. Additionally i…☆14Jan 2, 2022Updated 4 years ago
- DMax: Aggressive Parallel Decoding for dLLMs☆126May 25, 2026Updated 3 weeks ago
- A practice for million-scale multi-domain universal object detection☆28Jun 13, 2024Updated 2 years ago
- ☆30Nov 15, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model☆13Feb 15, 2024Updated 2 years ago
- ☆15Mar 6, 2025Updated last year
- Get hundred of million of image+url from the crawling at home dataset and preprocess them☆222May 26, 2024Updated 2 years ago
- GridSound wants to be a free browser-based HTML5 DAW (Digital Audio Workstation) following the new Web Audio API. You can test the applic…☆12Dec 9, 2018Updated 7 years ago
- Reproducible scaling laws for contrastive language-image learning (https://arxiv.org/abs/2212.07143)☆195Jun 21, 2025Updated 11 months ago
- TaiSu(太素)--a large-scale Chinese multimodal dataset(亿级大规模中文视觉语言预训练数据集)☆192Nov 17, 2023Updated 2 years ago
- Training simple models to predict CLIP image embeddings from text embeddings, and vice versa.☆60Mar 31, 2022Updated 4 years ago
- Holistic Coverage and Faithfulness Evaluation of Large Vision-Language Models (ACL-Findings 2024)☆16Apr 23, 2024Updated 2 years ago
- ☆37May 7, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A subset of YFCC100M. Tools, checking scripts and links of web drive to download datasets(uncompressed).☆19Nov 13, 2024Updated last year
- The easiest way to update static sites hosted on GitHub Pages with a visual editor☆11Mar 28, 2018Updated 8 years ago
- i-mae Pytorch Repo☆20Apr 6, 2024Updated 2 years ago
- ☆75May 10, 2024Updated 2 years ago
- # ParlAI Agent examples with PyTorch, Chainer and TensorFlow☆46Jan 19, 2018Updated 8 years ago
- The released data for paper "Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models".☆34Sep 16, 2023Updated 2 years ago
- MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.☆953Mar 19, 2025Updated last year