M4 experiment logbook
☆58Aug 21, 2023Updated 2 years ago
Alternatives and similar repositories for m4-logs
Users that are interested in m4-logs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Multimodal language model benchmark, featuring challenging examples☆187Dec 18, 2024Updated last year
- Lion: Kindling Vision Intelligence within Large Language Models☆51Jan 25, 2024Updated 2 years ago
- [CVPR 2024] CapsFusion: Rethinking Image-Text Data at Scale☆214Feb 27, 2024Updated 2 years ago
- sketching algorithms implemented in chapel and python☆10Jun 8, 2017Updated 8 years ago
- Repo for our work "Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence"☆20Jun 2, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Python interface and preprocessing pipeline for the BBBC021 dataset of cellular images☆14Sep 19, 2021Updated 4 years ago
- Code used for the creation of OBELICS, an open, massive and curated collection of interleaved image-text web documents, containing 141M d…☆214Aug 28, 2024Updated last year
- Histopathology Feature Extractors (2024)☆14Jun 14, 2024Updated last year
- This repository contains the code of our paper 'Skip \n: A simple method to reduce hallucination in Large Vision-Language Models'.☆15Feb 12, 2024Updated 2 years ago
- ⚔️ [ICLR 2026] Official code of "Search Arena: Analyzing Search-Augmented LLMs".☆59Feb 23, 2026Updated 3 months ago
- ☆10Sep 25, 2019Updated 6 years ago
- VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMs☆56Mar 9, 2025Updated last year
- [CVPR 2026] OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe☆162Mar 30, 2026Updated 2 months ago
- ☆29Sep 2, 2025Updated 8 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official Code for "Painting with Words: Elevating Detailed Image Captioning with Benchmark and Alignment Learning" (ICLR 2025)☆14Mar 6, 2025Updated last year
- Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models☆45Jun 14, 2024Updated last year
- [ICLR 2025 Spotlight] OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text☆422May 5, 2025Updated last year
- Code release for "Understanding Bias in Large-Scale Visual Datasets"☆23Dec 4, 2024Updated last year
- JAX implementation of GPTQ quantization algorithm☆10Jul 19, 2023Updated 2 years ago
- ☆13Aug 13, 2025Updated 9 months ago
- Benchmarking and Analyzing Generative Data for Visual Recognition☆26Jul 25, 2023Updated 2 years ago
- Microsoft Automatic Mixed Precision Library☆636Dec 1, 2025Updated 5 months ago
- Aligning LMMs with Factually Augmented RLHF☆396Nov 1, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Index of URLs to pdf files all over the internet and scripts☆25May 2, 2023Updated 3 years ago
- ☆35Aug 30, 2021Updated 4 years ago
- If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions☆17Apr 4, 2024Updated 2 years ago
- ☆55Apr 1, 2024Updated 2 years ago
- The proposed simulated dataset consisting of 9,536 charts and associated data annotations in CSV format.☆26Feb 22, 2024Updated 2 years ago
- Using FlexAttention to compute attention with different masking patterns☆47Sep 22, 2024Updated last year
- a set of tools for computer vision processing☆18Jul 9, 2016Updated 9 years ago
- ☆27Jul 6, 2024Updated last year
- ☆31Nov 17, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An open-source framework for training large multimodal models.☆4,102Aug 31, 2024Updated last year
- Emergent Communication Pretraining for Few-Shot Machine Translation☆13Dec 3, 2020Updated 5 years ago
- Unofficial PyTorch implementation of DALL-E 2 by OpenAI☆10Apr 6, 2022Updated 4 years ago
- ☆17Feb 20, 2023Updated 3 years ago
- ☆16Aug 19, 2024Updated last year
- ☆88Aug 18, 2024Updated last year
- A Controllable Model of Grounded Response Generation (AAAI 21)☆13Oct 25, 2022Updated 3 years ago