thubZ09 / multimodal-researchLinks
Hub for researchers exploring VLMs and Multimodal Learning:)
☆51Updated this week
Alternatives and similar repositories for multimodal-research
Users that are interested in multimodal-research are comparing it to the libraries listed below
Sorting:
- This repository's goal is to precompile all past presentations of the Huggingface reading group☆48Updated last year
- ⏰ AI conference deadline countdowns☆285Updated 2 weeks ago
- Fine tune Gemma 3 on an object detection task☆87Updated 3 months ago
- A collection of lightweight interpretability scripts to understand how LLMs think☆61Updated this week
- From scratch implementation of a vision language model in pure PyTorch☆246Updated last year
- Code for ExploreTom☆86Updated 4 months ago
- ☆45Updated 5 months ago
- An introduction to LLM Sampling☆79Updated 10 months ago
- Build your own visual reasoning model☆413Updated 3 weeks ago
- ☆46Updated 7 months ago
- ☆178Updated 2 months ago
- Notebooks for fine tuning pali gemma☆117Updated 6 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆109Updated 3 weeks ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆117Updated 2 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆147Updated last month
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆134Updated 3 weeks ago
- This is the repository for brain state prediction using fMRI data and transformer.☆81Updated last year
- minimal GRPO implementation from scratch☆98Updated 7 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆79Updated 7 months ago
- ☆110Updated last month
- Open source interpretability artefacts for R1.☆163Updated 6 months ago
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆163Updated 2 months ago
- rl from zero pretrain, can it be done? yes.☆279Updated last month
- RLP: Reinforcement as a Pretraining Objective☆195Updated 3 weeks ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆107Updated 7 months ago
- Compiling useful links, papers, benchmarks, ideas, etc.☆45Updated 7 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆172Updated 9 months ago
- Training-Ready RL Environments + Evals☆158Updated this week
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆57Updated 5 months ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆48Updated last year