thubZ09 / All-Things-MultimodalLinks
Hub for researchers exploring VLMs and Multimodal Learning:)
☆37Updated this week
Alternatives and similar repositories for All-Things-Multimodal
Users that are interested in All-Things-Multimodal are comparing it to the libraries listed below
Sorting:
- Fine tune Gemma 3 on an object detection task☆46Updated this week
- ☆36Updated 2 weeks ago
- rl from zero pretrain, can it be done? we'll see.☆24Updated last week
- Set of scripts to finetune LLMs☆37Updated last year
- Build Agentic workflows with function calling using open LLMs☆26Updated this week
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 10 months ago
- ☆23Updated last year
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆67Updated 2 months ago
- OmegaViT (ΩViT) is a cutting-edge vision transformer architecture that combines multi-query attention, rotary embeddings, state space mod…☆14Updated last week
- A competition to get you started on the NeurIPS AI Hackercup☆28Updated 8 months ago
- Simple repository for training small reasoning models☆31Updated 4 months ago
- Testing paligemma2 finetuning on reasoning dataset☆18Updated 5 months ago
- ☆49Updated 7 months ago
- Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…☆45Updated last month
- ☆59Updated 2 weeks ago
- working implimention of deepseek MLA☆41Updated 4 months ago
- Collection of autoregressive model implementation☆85Updated last month
- I learn about and explain quantization☆26Updated last year
- Lego for GRPO☆28Updated last week
- alternative way to calculating self attention☆18Updated last year
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆54Updated last year
- ☆38Updated 10 months ago
- Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models. TMLR 2025.☆61Updated 3 weeks ago
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆37Updated last year
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆80Updated last year
- Collection of resources for RL and Reasoning☆25Updated 4 months ago
- Repository containing awesome resources regarding Hugging Face tooling.☆47Updated last year
- Coding an LLM and its building blocks from scratch.☆38Updated 2 months ago
- KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…☆24Updated last year