mlfoundations / MINT-1TLinks
MINT-1T: A one trillion token multimodal interleaved dataset.
☆817Updated 11 months ago
Alternatives and similar repositories for MINT-1T
Users that are interested in MINT-1T are comparing it to the libraries listed below
Sorting:
- [ICML 2024] CLLMs: Consistency Large Language Models☆395Updated 7 months ago
- Code release for "LLMs can see and hear without any training"☆445Updated last month
- ☆614Updated last year
- Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation☆771Updated 2 weeks ago
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆883Updated 2 months ago
- LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills☆748Updated last year
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,385Updated last year
- OLMoE: Open Mixture-of-Experts Language Models☆792Updated 3 months ago
- This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for E…☆452Updated last month
- Understanding R1-Zero-Like Training: A Critical Perspective☆999Updated last week
- Reference implementation of Megalodon 7B model☆520Updated last month
- HPT - Open Multimodal LLMs from HyperGAI☆316Updated last year
- A family of open-sourced Mixture-of-Experts (MoE) Large Language Models☆1,549Updated last year
- [ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning☆654Updated last year
- From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)☆723Updated 8 months ago
- Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.☆734Updated 9 months ago
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,305Updated 2 months ago
- Open weights language model from Google DeepMind, based on Griffin.☆643Updated 3 weeks ago
- LLaVA-UHD v2: an MLLM Integrating High-Resolution Semantic Pyramid via Hierarchical Window Transformer☆382Updated 2 months ago
- PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"☆639Updated last year
- MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.☆932Updated 3 months ago
- Reaching LLaMA2 Performance with 0.1M Dollars☆983Updated 11 months ago
- ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Expert…☆1,466Updated 3 months ago
- LLM Analytics☆668Updated 8 months ago
- LIMO: Less is More for Reasoning☆965Updated 2 months ago
- Unleashing the Power of Reinforcement Learning for Math and Code Reasoners☆641Updated 3 weeks ago
- [NeurIPS'24 Spotlight, ICLR'25, ICML'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention…☆1,059Updated last week
- [ICLR 2025] DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads☆468Updated 4 months ago
- Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"☆859Updated last month
- Codebase for Aria - an Open Multimodal Native MoE☆1,056Updated 5 months ago