thubZ09 / vision-language-model-researchLinks
Hub for researchers exploring VLMs and Multimodal Learning:)
☆61Updated last week
Alternatives and similar repositories for vision-language-model-research
Users that are interested in vision-language-model-research are comparing it to the libraries listed below
Sorting:
- ⏰ AI conference deadline countdowns☆320Updated 2 weeks ago
- This repository's goal is to precompile all past presentations of the Huggingface reading group☆48Updated last year
- [ICLR 2026] Official PyTorch Implementation of RLP: Reinforcement as a Pretraining Objective☆226Updated this week
- Conference schedule, top papers, and analysis of the data for NeurIPS 2023!☆120Updated 2 years ago
- Training and evaluating encoding models to predict fMRI brain responses to naturalistic video stimuli☆295Updated 4 months ago
- Fine tune Gemma 3 on an object detection task☆96Updated 6 months ago
- A collection of lightweight interpretability scripts to understand how LLMs think☆89Updated this week
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆150Updated 3 months ago
- ☆114Updated 4 months ago
- This is the repository for brain state prediction using fMRI data and transformer.☆81Updated last year
- From scratch implementation of a vision language model in pure PyTorch☆253Updated last year
- Course Materials for Interpretability of Large Language Models (0368.4264) at Tel Aviv University☆294Updated 2 weeks ago
- Code for ExploreTom☆90Updated 7 months ago
- Survey: A collection of AWESOME papers and resources on the latest research in Mixture of Experts.☆140Updated last year
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆141Updated last month
- Unofficial implementation of Tiny Recursive Model (TRM), improvement to HRM from Sapient AI, by Alexia Jolicoeur-Martineau☆174Updated last month
- ☆46Updated 9 months ago
- ☆45Updated 8 months ago
- Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"☆63Updated 9 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆127Updated 3 months ago
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch☆121Updated 2 years ago
- Build your own visual reasoning model☆417Updated 2 weeks ago
- minimal GRPO implementation from scratch☆102Updated 10 months ago
- Open source interpretability artefacts for R1.☆169Updated 9 months ago
- An introduction to LLM Sampling☆79Updated last year
- ICLR 2025 - official implementation for "I-Con: A Unifying Framework for Representation Learning"☆126Updated 7 months ago
- Notebooks for fine tuning pali gemma☆117Updated 9 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆85Updated last year
- ☆29Updated last year
- PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learning☆584Updated 2 months ago