thubZ09 / multimodal-researchLinks
Hub for researchers exploring VLMs and Multimodal Learning:)
☆59Updated this week
Alternatives and similar repositories for multimodal-research
Users that are interested in multimodal-research are comparing it to the libraries listed below
Sorting:
- ⏰ AI conference deadline countdowns☆293Updated this week
- Fine tune Gemma 3 on an object detection task☆92Updated 5 months ago
- From scratch implementation of a vision language model in pure PyTorch☆252Updated last year
- Notebooks for fine tuning pali gemma☆117Updated 8 months ago
- PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learning☆565Updated last month
- Code for ExploreTom☆89Updated 5 months ago
- A collection of lightweight interpretability scripts to understand how LLMs think☆70Updated last week
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆141Updated last month
- Build your own visual reasoning model☆415Updated 3 weeks ago
- ☆45Updated 6 months ago
- ☆113Updated 3 months ago
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆57Updated 6 months ago
- ☆46Updated 8 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆149Updated 2 months ago
- This is the repository for brain state prediction using fMRI data and transformer.☆81Updated last year
- This repository's goal is to precompile all past presentations of the Huggingface reading group☆48Updated last year
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆354Updated 5 months ago
- Survey: A collection of AWESOME papers and resources on the latest research in Mixture of Experts.☆139Updated last year
- ☆183Updated 3 weeks ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆108Updated 9 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆111Updated 8 months ago
- Open source interpretability artefacts for R1.☆165Updated 7 months ago
- aesthetic tensor visualiser☆27Updated 7 months ago
- ☆276Updated 8 months ago
- ICLR 2025 - official implementation for "I-Con: A Unifying Framework for Representation Learning"☆118Updated 5 months ago
- RLP: Reinforcement as a Pretraining Objective☆213Updated 2 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆84Updated last year
- This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"☆277Updated 3 weeks ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆123Updated 4 months ago
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆167Updated 3 months ago