huggingface / OBELICS
Code used for the creation of OBELICS, an open, massive and curated collection of interleaved image-text web documents, containing 141M documents, 115B text tokens and 353M images.
☆189Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for OBELICS
- M4 experiment logbook☆56Updated last year
- E5-V: Universal Embeddings with Multimodal Large Language Models☆173Updated 4 months ago
- Implementation of the deepmind Flamingo vision-language model, based on Hugging Face language models and ready for training☆164Updated last year
- Multimodal language model benchmark, featuring challenging examples☆149Updated 3 months ago
- VLM Evaluation: Benchmark for VLMs, spanning text generation tasks from VQA to Captioning☆87Updated 2 months ago
- This repo contains the code and data for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks"☆69Updated last week
- Code for "SemDeDup", a simple method for identifying and removing semantic duplicates from a dataset (data pairs which are semantically s…☆112Updated last year
- Python Library to evaluate VLM models' robustness across diverse benchmarks☆169Updated 3 weeks ago
- Scaling Data-Constrained Language Models