M4 experiment logbook
☆58Aug 21, 2023Updated 2 years ago
Alternatives and similar repositories for m4-logs
Users that are interested in m4-logs are comparing it to the libraries listed below
Sorting:
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆46Jul 17, 2024Updated last year
- Code for ACL22 short Paper "Hierarchical Curriculum Learning for AMR Parsing"☆13Jun 1, 2022Updated 3 years ago
- sketching algorithms implemented in chapel and python☆10Jun 8, 2017Updated 8 years ago
- Simple script to re-rank images using OpenAI's CLIP https://github.com/openai/CLIP.☆15May 3, 2021Updated 4 years ago
- ☆10Dec 12, 2023Updated 2 years ago
- ☆33Mar 1, 2023Updated 3 years ago
- [IJCV 2026] Official Code for "PointHPS: Cascaded 3D Human Pose and Shape Estimation from Point Clouds"☆70Feb 12, 2026Updated 3 weeks ago
- ☆18Apr 19, 2024Updated last year
- ☆34Aug 30, 2021Updated 4 years ago
- This repository contains the code of our paper 'Skip \n: A simple method to reduce hallucination in Large Vision-Language Models'.☆15Feb 12, 2024Updated 2 years ago
- [CVPR 2024] CapsFusion: Rethinking Image-Text Data at Scale☆213Feb 27, 2024Updated 2 years ago
- get direct answers in google using LLMs☆18Apr 12, 2023Updated 2 years ago
- Blog post☆17Feb 16, 2024Updated 2 years ago
- If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions☆17Apr 4, 2024Updated last year
- Multimodal language model benchmark, featuring challenging examples☆185Dec 18, 2024Updated last year
- A bilingual dataset for image captioning☆19Oct 28, 2020Updated 5 years ago
- Code release for "Understanding Bias in Large-Scale Visual Datasets"☆22Dec 4, 2024Updated last year
- ☆25Jun 22, 2023Updated 2 years ago
- ☆47Aug 19, 2021Updated 4 years ago
- Index of URLs to pdf files all over the internet and scripts☆25May 2, 2023Updated 2 years ago
- MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning☆134Jun 20, 2023Updated 2 years ago
- Code used for the creation of OBELICS, an open, massive and curated collection of interleaved image-text web documents, containing 141M d…☆211Aug 28, 2024Updated last year
- Microsoft Automatic Mixed Precision Library☆634Dec 1, 2025Updated 3 months ago
- ☆55Apr 1, 2024Updated last year
- ☆27Mar 21, 2024Updated last year
- Benchmarking and Analyzing Generative Data for Visual Recognition☆26Jul 25, 2023Updated 2 years ago
- [NeurIPS 2022] DataMUX: Data Multiplexing for Neural Networks☆60Nov 24, 2022Updated 3 years ago
- ☆27Jul 6, 2024Updated last year
- Lion: Kindling Vision Intelligence within Large Language Models☆51Jan 25, 2024Updated 2 years ago
- Code for T-MARS data filtering☆35Aug 23, 2023Updated 2 years ago
- 🦩 Official repository of paper "Visual Instruction Tuning with Polite Flamingo" (AAAI-24 Oral)☆65Dec 9, 2023Updated 2 years ago
- [CVPR 2026] OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe☆148Feb 23, 2026Updated 2 weeks ago
- The open source implementation of the model from "Scaling Vision Transformers to 22 Billion Parameters"☆32Feb 6, 2026Updated last month
- Winner solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2023 workshop)☆29Jan 1, 2024Updated 2 years ago
- This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for E…☆548Feb 12, 2026Updated 3 weeks ago
- Official implementation of ORCA proposed in the paper "Cross-Modal Fine-Tuning: Align then Refine"☆73Mar 6, 2024Updated 2 years ago
- Official implementation of "Connect, Collapse, Corrupt: Learning Cross-Modal Tasks with Uni-Modal Data" (ICLR 2024)☆34Oct 16, 2024Updated last year
- ☆37May 28, 2023Updated 2 years ago
- [ICLR 2025 Spotlight] OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text☆413May 5, 2025Updated 10 months ago