Big-Interleaved-Dataset
☆59Jan 21, 2023Updated 3 years ago
Alternatives and similar repositories for Big-Interleaved-Dataset
Users that are interested in Big-Interleaved-Dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Script and models for clustering LAION-400m CLIP embeddings.☆26Jan 10, 2022Updated 4 years ago
- Recurrent Neural Network library for Torch7's nn☆19Jan 26, 2017Updated 9 years ago
- The list of some conference papers.☆11Apr 19, 2019Updated 7 years ago
- Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...☆321Dec 9, 2023Updated 2 years ago
- [ICLR 2024 Spotlight] Social Reward: Evaluating and Enhancing Generative AI through Million-User Feedback from an Online Creative Communi…☆11Mar 29, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆19Nov 7, 2022Updated 3 years ago
- Code used for the creation of OBELICS, an open, massive and curated collection of interleaved image-text web documents, containing 141M d…☆213Aug 28, 2024Updated last year
- ViT trained on COYO-Labeled-300M dataset☆33Nov 24, 2022Updated 3 years ago
- A Datasette instance for searching WebVid-10M☆15Sep 30, 2022Updated 3 years ago
- Code and data for the CoNLL 2018 paper "Adversarially Regularising Neural NLI Models to Integrate Logical Background Knowledge."☆25Jan 21, 2019Updated 7 years ago
- Xfce Desktop container designed for direct access to the GPU with EGL using VirtualGL for GPUs. Does not require /tmp/.X11-unix host sock…☆10Jul 25, 2022Updated 3 years ago
- 采用知识图谱和上下文检索显著提高信息检索的精度☆10Oct 30, 2024Updated last year
- Implementation and checkpoints of Imagen, Google's text-to-image synthesis neural network, in Pytorch☆18Dec 22, 2022Updated 3 years ago
- [EMNLP 2021] Code and data for our paper "Visually Grounded Reasoning across Languages and Cultures"☆30Dec 30, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models☆45Jun 14, 2024Updated last year
- Post-processing for fair classification☆16Jun 30, 2025Updated 10 months ago
- SVIT: Scaling up Visual Instruction Tuning☆168Jun 20, 2024Updated last year
- ☆35Jul 5, 2023Updated 2 years ago
- Your fruity companion for transformers☆14May 25, 2022Updated 3 years ago
- reproduces experiments from "Grounding inductive biases in natural images: invariance stems from variations in data"☆17Sep 25, 2024Updated last year
- Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model☆13Feb 15, 2024Updated 2 years ago
- Get hundred of million of image+url from the crawling at home dataset and preprocess them☆222May 26, 2024Updated last year
- Implementation of AlphaZero in PyTorch.☆10Apr 19, 2019Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Reproducible scaling laws for contrastive language-image learning (https://arxiv.org/abs/2212.07143)☆193Jun 21, 2025Updated 10 months ago
- ViT models pretrained with up to ~5k hours of human-like video data☆14Aug 10, 2023Updated 2 years ago
- Dataset for Bilingual VLN☆11Dec 5, 2020Updated 5 years ago
- TaiSu(太素)--a large-scale Chinese multimodal dataset(亿级大规模中文视觉语言预训练数据集)☆192Nov 17, 2023Updated 2 years ago
- MLIR backend for Nx☆14May 24, 2024Updated last year
- Jump to better conclusions: SCAN both left and right☆11Jan 24, 2019Updated 7 years ago
- Training simple models to predict CLIP image embeddings from text embeddings, and vice versa.☆60Mar 31, 2022Updated 4 years ago
- Holistic Coverage and Faithfulness Evaluation of Large Vision-Language Models (ACL-Findings 2024)☆16Apr 23, 2024Updated 2 years ago
- ☆12Nov 3, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A subset of YFCC100M. Tools, checking scripts and links of web drive to download datasets(uncompressed).☆19Nov 13, 2024Updated last year
- Blocks World -- Simulator, Code, and Models (Misra et al. EMNLP 2017)☆40Feb 7, 2019Updated 7 years ago
- ☆75May 10, 2024Updated last year
- Wikipedia navigation environment for OpenAI Gym☆41Apr 2, 2023Updated 3 years ago
- Python package for extractive NLP using the OpenAI API☆17Aug 28, 2024Updated last year
- MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.☆953Mar 19, 2025Updated last year
- ☆28Oct 18, 2022Updated 3 years ago