simonw / webvid-datasetteLinks
A Datasette instance for searching WebVid-10M
☆15Updated 3 years ago
Alternatives and similar repositories for webvid-datasette
Users that are interested in webvid-datasette are comparing it to the libraries listed below
Sorting:
- LL3M: Large Language and Multi-Modal Model in Jax☆74Updated last year
- An official codebase for paper " CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web Videos (ICCV 23)"☆52Updated 2 years ago
- ☆50Updated 2 years ago
- Big-Interleaved-Dataset☆58Updated 3 years ago
- [ICML 2025] This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"☆149Updated last year
- [ICLR 2024] Trajectory-as-Exemplar Prompting with Memory for Computer Control☆66Updated last month
- ☆23Updated 2 years ago
- Official code for infimm-hd☆16Updated last year
- Matryoshka Multimodal Models☆122Updated last year
- Official implementation of our paper "Finetuned Multimodal Language Models are High-Quality Image-Text Data Filters".☆69Updated 9 months ago
- This repository is maintained to release dataset and models for multimodal puzzle reasoning.☆113Updated 11 months ago
- Touchstone: Evaluating Vision-Language Models by Language Models☆83Updated 2 years ago
- Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)☆40Updated last year
- Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs☆100Updated last year
- 🦾 EvalGIM (pronounced as "EvalGym") is an evaluation library for generative image models. It enables easy-to-use, reproducible automatic…☆90Updated 2 weeks ago
- Train vector quantized CLIP models using pytorch lightning☆20Updated last year
- Plug in & Play Pytorch Implementation of the paper: "Evolutionary Optimization of Model Merging Recipes" by Sakana AI☆31Updated last year
- A Framework for Decoupling and Assessing the Capabilities of VLMs☆43Updated last year
- [CVPR 2024] CapsFusion: Rethinking Image-Text Data at Scale☆213Updated last year
- Un-*** 50 billions multimodality dataset☆23Updated 3 years ago
- Official implementation for the paper "Can Large Reasoning Models Self-Train?"☆71Updated 4 months ago
- Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models☆45Updated last year
- Official github repo of G-LLaVA☆148Updated 11 months ago
- Multimodal RewardBench☆61Updated 11 months ago
- ☆80Updated last year
- ☆65Updated 2 years ago
- This repo contains the code for "MEGA-Bench Scaling Multimodal Evaluation to over 500 Real-World Tasks" [ICLR 2025]☆77Updated 7 months ago
- ☆64Updated this week
- ☆88Updated 2 years ago
- Code used for the creation of OBELICS, an open, massive and curated collection of interleaved image-text web documents, containing 141M d…☆211Updated last year