thubZ09 / all-things-multimodalLinks

Hub for researchers exploring VLMs and Multimodal Learning:)

☆46

Alternatives and similar repositories for all-things-multimodal

Users that are interested in all-things-multimodal are comparing it to the libraries listed below

Sorting:

huggingface / ai-deadlines
⏰ AI conference deadline countdowns
☆280Updated last week
cornstarch-org / Cornstarch
☆103Updated last week
isamu-isozaki / huggingface-reading-group
This repository's goal is to precompile all past presentations of the Huggingface reading group
☆48Updated last year
ariG23498 / gemma3-object-detection
Fine tune Gemma 3 on an object detection task
☆82Updated 2 months ago
facebookresearch / ExploreToM
Code for ExploreTom
☆86Updated 2 months ago
apoorvkh / academic-pretraining
$100K or 100 Days: Trade-offs when Pre-Training with Academic Resources
☆146Updated 3 months ago
ariG23498 / fine-tune-paligemma
Notebooks for fine tuning pali gemma
☆117Updated 5 months ago
hkproj / multi-latent-attention
☆44Updated 3 months ago
nahidalam / maya
Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya
☆116Updated last month
Pleias / Quest-Best-Tokens
An introduction to LLM Sampling
☆79Updated 9 months ago
adithya-s-k / YoloGemma
Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…
☆84Updated last year
Danau5tin / calculator_agent_rl
Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.
☆49Updated 4 months ago
AviSoori1x / seemore
From scratch implementation of a vision language model in pure PyTorch
☆239Updated last year
facebookresearch / llm-speedrunner
The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…
☆96Updated last month
jacobmarks / awesome-neurips-2023
Conference schedule, top papers, and analysis of the data for NeurIPS 2023!
☆120Updated last year
silvaxxx1 / MyLLM
"LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"
☆127Updated this week
fangyuan-ksgk / Tiny-GRPO
minimal GRPO implementation from scratch
☆96Updated 6 months ago
wandb / aihackercup
A competition to get you started on the NeurIPS AI Hackercup
☆29Updated 11 months ago
alexiglad / EBT
PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learning
☆492Updated last week
arpita8 / Awesome-Mixture-of-Experts-Papers
Survey: A collection of AWESOME papers and resources on the latest research in Mixture of Experts.
☆133Updated last year
ShadeAlsha / ICon
ICLR 2025 - official implementation for "I-Con: A Unifying Framework for Representation Learning"
☆111Updated 2 months ago
wolfecameron / lora_instruction_tune
☆40Updated last year
Pleias / Various-Finetuning
Set of scripts to finetune LLMs
☆38Updated last year
YuvrajSingh-mist / Paper-Replications
A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch
☆362Updated this week
LitLLM / litllms-for-literature-review-tmlr
Code for LitLLMs, LLMs for Literature Review: Are we there yet? (TMLR 2025)
☆38Updated 4 months ago
menloresearch / visual-thinker
☆175Updated last month
HishamAlyahya / semantic_backprop
Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖
☆75Updated 9 months ago
kmohan321 / Research_Papers
☆46Updated 5 months ago
google-deepmind / latent-multi-hop-reasoning
[ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?
☆77Updated 5 months ago
casper-hansen / OpenCoconut
OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.
☆172Updated 8 months ago