huggingface / m4-logs
M4 experiment logbook
☆56Updated last year
Related projects ⓘ
Alternatives and complementary repositories for m4-logs
- LL3M: Large Language and Multi-Modal Model in Jax☆64Updated 6 months ago
- Code used for the creation of OBELICS, an open, massive and curated collection of interleaved image-text web documents, containing 141M d…☆186Updated 2 months ago
- Multimodal language model benchmark, featuring challenging examples☆148Updated 2 months ago
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆151Updated 7 months ago
- ☆64Updated last year
- Language models scale reliably with over-training and on downstream tasks☆94Updated 7 months ago
- ☆62Updated last month
- E5-V: Universal Embeddings with Multimodal Large Language Models☆167Updated 3 months ago
- Python Library to evaluate VLM models' robustness across diverse benchmarks☆168Updated last week
- VLM Evaluation: Benchmark for VLMs, spanning text generation tasks from VQA to Captioning☆85Updated last month
- Language Quantized AutoEncoders☆94Updated last year
- Big-Interleaved-Dataset☆57Updated last year
- ☆50Updated last month
- Index of URLs to pdf files all over the internet and scripts☆21Updated last year
- Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image …☆51Updated 3 weeks ago
- ☆72Updated 4 months ago
- ☆45Updated last year
- Matryoshka Multimodal Models☆81Updated last month
- A huge dataset for Document Visual Question Answering☆13Updated 3 months ago
- ☆64Updated 4 months ago
- Implementation of the deepmind Flamingo vision-language model, based on Hugging Face language models and ready for training☆164Updated last year
- ☆71Updated 6 months ago
- See details in https://github.com/pytorch/xla/blob/r1.12/torch_xla/distributed/fsdp/README.md☆23Updated last year
- This repo contains evaluation code for the paper "BLINK: Multimodal Large Language Models Can See but Not Perceive". https://arxiv.or…☆107Updated 4 months ago
- Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch☆88Updated 10 months ago
- Official repository for the paper "SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention"☆91Updated last month
- ☆83Updated last year
- Code for Zero-Shot Tokenizer Transfer☆115Updated 2 weeks ago
- ☆86Updated 9 months ago
- Official implementation of MAIA, A Multimodal Automated Interpretability Agent☆62Updated 2 months ago