thubZ09 / multimodal-researchLinks
Hub for researchers exploring VLMs and Multimodal Learning:)
☆48Updated last week
Alternatives and similar repositories for multimodal-research
Users that are interested in multimodal-research are comparing it to the libraries listed below
Sorting:
- Fine tune Gemma 3 on an object detection task☆85Updated 2 months ago
- This repository's goal is to precompile all past presentations of the Huggingface reading group☆48Updated last year
- ⏰ AI conference deadline countdowns☆283Updated 2 weeks ago
- Notebooks for fine tuning pali gemma☆117Updated 5 months ago
- ☆28Updated last year
- RLP: Reinforcement as a Pretraining Objective☆155Updated last week
- ☆45Updated 4 months ago
- ☆106Updated last month
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆147Updated last week
- From scratch implementation of a vision language model in pure PyTorch☆243Updated last year
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆134Updated this week
- ☆142Updated last month
- Training-Ready RL Environments + Evals☆121Updated this week
- An introduction to LLM Sampling☆79Updated 9 months ago
- ☆38Updated last year
- ☆177Updated 2 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆84Updated last year
- ICLR 2025 - official implementation for "I-Con: A Unifying Framework for Representation Learning"☆111Updated 3 months ago
- Code for ExploreTom☆86Updated 3 months ago
- A competition to get you started on the NeurIPS AI Hackercup☆29Updated last year
- Code for "Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs"☆55Updated 7 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆99Updated this week
- Source code for the collaborative reasoner research project at Meta FAIR.☆102Updated 5 months ago
- Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.☆54Updated 5 months ago
- ☆32Updated 7 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆116Updated 2 months ago
- This is the repository for brain state prediction using fMRI data and transformer.☆81Updated last year
- aesthetic tensor visualiser☆27Updated 5 months ago
- ☆88Updated last week
- ☆135Updated 6 months ago