thubZ09 / multimodal-researchLinks
Hub for researchers exploring VLMs and Multimodal Learning:)
☆58Updated last week
Alternatives and similar repositories for multimodal-research
Users that are interested in multimodal-research are comparing it to the libraries listed below
Sorting:
- This repository's goal is to precompile all past presentations of the Huggingface reading group☆48Updated last year
- From scratch implementation of a vision language model in pure PyTorch☆250Updated last year
- ☆45Updated 6 months ago
- Fine tune Gemma 3 on an object detection task☆89Updated 4 months ago
- ⏰ AI conference deadline countdowns☆288Updated last week
- A collection of lightweight interpretability scripts to understand how LLMs think☆66Updated last week
- Notebooks for fine tuning pali gemma☆117Updated 7 months ago
- ☆46Updated 7 months ago
- This is the repository for brain state prediction using fMRI data and transformer.☆81Updated last year
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆141Updated last week
- ☆86Updated last year
- Training-Ready RL Environments + Evals☆177Updated last week
- ☆182Updated 3 months ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆60Updated last year
- Train LLM on Hugging Face infra☆67Updated 2 weeks ago
- Training and evaluating encoding models to predict fMRI brain responses to naturalistic video stimuli☆290Updated 2 months ago
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆390Updated 2 weeks ago
- PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learning☆561Updated 2 weeks ago
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Updated last year
- Code for ExploreTom☆87Updated 5 months ago
- Open source interpretability artefacts for R1.☆163Updated 7 months ago
- An introduction to LLM Sampling☆79Updated 11 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆112Updated last month
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆108Updated 8 months ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆48Updated last year
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆173Updated 10 months ago
- aesthetic tensor visualiser☆27Updated 7 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆147Updated last month
- minimal GRPO implementation from scratch☆99Updated 8 months ago
- Survey: A collection of AWESOME papers and resources on the latest research in Mixture of Experts.☆139Updated last year