thubZ09 / All-Things-MultimodalLinks

Hub for researchers exploring VLMs and Multimodal Learning:)

☆39

Alternatives and similar repositories for All-Things-Multimodal

Users that are interested in All-Things-Multimodal are comparing it to the libraries listed below

Sorting:

ariG23498 / gemma3-object-detection
Fine tune Gemma 3 on an object detection task
☆57Updated this week
Pleias / Quest-Best-Tokens
An introduction to LLM Sampling
☆78Updated 6 months ago
Danau5tin / calculator_agent_rl
Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.
☆41Updated last month
xjdr-alt / muzero_sketch
☆38Updated 11 months ago
isamu-isozaki / huggingface-reading-group
This repository's goal is to precompile all past presentations of the Huggingface reading group
☆48Updated 9 months ago
attentionmech / tensorlens
aesthetic tensor visualiser
☆24Updated 2 months ago
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆53Updated 4 months ago
ahstat / episodic-memory-benchmark
Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…
☆45Updated 2 months ago
hkproj / multi-latent-attention
☆39Updated last month
kubernetes-bad / reward-composer
Lego for GRPO
☆28Updated last month
open-thought / reasoning-gym-eval
Collection of LLM completions for reasoning-gym task datasets
☆24Updated last month
joey00072 / Attention-as-graph
alternative way to calculating self attention
☆18Updated last year
kmohan321 / Research_Papers
☆46Updated 2 months ago
xjdr-alt / llmri
look how they massacred my boy
☆63Updated 8 months ago
brendanhogan / picoDeepResearch
☆63Updated last month
tyler-romero / microR1
Simple repository for training small reasoning models
☆33Updated 4 months ago
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆101Updated 3 months ago
JD-P / RetroInstruct
Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.
☆32Updated 3 months ago
ariG23498 / quantized-diffusion-inference
Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs
☆38Updated 7 months ago
YuvrajSingh-mist / SmolLlama
So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…
☆15Updated 3 months ago
joey00072 / ohara
Collection of autoregressive model implementation
☆85Updated 2 months ago
AnswerDotAI / minai
A miniture AI training framework for PyTorch
☆42Updated 4 months ago
okarthikb / state-space-models
☆27Updated 11 months ago
pacman100 / peft-codegen-25
☆23Updated last year
adithya-s-k / YoloGemma
Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…
☆81Updated last year
UmerHA / quanting-notes
I learn about and explain quantization
☆26Updated last year
nano-R1 / resources
Compiling useful links, papers, benchmarks, ideas, etc.
☆46Updated 3 months ago
Agora-Lab-AI / OmegaViT
OmegaViT (ΩViT) is a cutting-edge vision transformer architecture that combines multi-query attention, rotary embeddings, state space mod…
☆14Updated this week
tokenbender / avataRL
rl from zero pretrain, can it be done? we'll see.
☆56Updated this week
Pleias / Various-Finetuning
Set of scripts to finetune LLMs
☆37Updated last year