thubZ09 / multimodal-researchLinks

Hub for researchers exploring VLMs and Multimodal Learning:)

☆59

Alternatives and similar repositories for multimodal-research

Users that are interested in multimodal-research are comparing it to the libraries listed below

Sorting:

huggingface / ai-deadlines
⏰ AI conference deadline countdowns
☆293Updated this week
ariG23498 / gemma3-object-detection
Fine tune Gemma 3 on an object detection task
☆92Updated 5 months ago
AviSoori1x / seemore
From scratch implementation of a vision language model in pure PyTorch
☆252Updated last year
ariG23498 / fine-tune-paligemma
Notebooks for fine tuning pali gemma
☆117Updated 8 months ago
alexiglad / EBT
PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learning
☆565Updated last month
facebookresearch / ExploreToM
Code for ExploreTom
☆89Updated 5 months ago
RiddleHe / llm-interp
A collection of lightweight interpretability scripts to understand how LLMs think
☆70Updated last week
silvaxxx1 / MyLLM
"LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"
☆141Updated last month
groundlight / r1_vlm
Build your own visual reasoning model
☆415Updated 3 weeks ago
hkproj / multi-latent-attention
☆45Updated 6 months ago
cornstarch-org / Cornstarch
☆113Updated 3 months ago
lucidrains / mind-evolution
Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind
☆57Updated 6 months ago
kmohan321 / Research_Papers
☆46Updated 8 months ago
apoorvkh / academic-pretraining
$100K or 100 Days: Trade-offs when Pre-Training with Academic Resources
☆149Updated 2 months ago
syf0122 / brain_state_pred
This is the repository for brain state prediction using fMRI data and transformer.
☆81Updated last year
isamu-isozaki / huggingface-reading-group
This repository's goal is to precompile all past presentations of the Huggingface reading group
☆48Updated last year
SakanaAI / RLT
Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.
☆354Updated 5 months ago
arpita8 / Awesome-Mixture-of-Experts-Papers
Survey: A collection of AWESOME papers and resources on the latest research in Mixture of Experts.
☆139Updated last year
janhq / visual-thinker
☆183Updated 3 weeks ago
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆108Updated 9 months ago
facebookresearch / collaborative-reasoner
Source code for the collaborative reasoner research project at Meta FAIR.
☆111Updated 8 months ago
goodfire-ai / r1-interpretability
Open source interpretability artefacts for R1.
☆165Updated 7 months ago
attentionmech / tensorlens
aesthetic tensor visualiser
☆27Updated 7 months ago
SakanaAI / AI-Scientist-ICLR2025-Workshop-Experiment
☆276Updated 8 months ago
ShadeAlsha / ICon
ICLR 2025 - official implementation for "I-Con: A Unifying Framework for Representation Learning"
☆118Updated 5 months ago
NVlabs / RLP
RLP: Reinforcement as a Pretraining Objective
☆213Updated 2 months ago
adithya-s-k / YoloGemma
Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…
☆84Updated last year
VsonicV / es-fine-tuning-paper
This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"
☆277Updated 3 weeks ago
nahidalam / maya
Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya
☆123Updated 4 months ago
SakanaAI / natural_niches
The code repository of the paper: Competition and Attraction Improve Model Fusion
☆167Updated 3 months ago