mlfoundations/Gelato

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mlfoundations/Gelato)

mlfoundations / Gelato

🍨 Gelato — From Data Curation to Reinforcement Learning: Building a Strong Grounding Model for Computer-Use Agents

☆46

Alternatives and similar repositories for Gelato

Users that are interested in Gelato are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LAION-AI / scaling-laws-for-comparison
View on GitHub
☆22May 12, 2026Updated 2 months ago
jinhangzhan / RL_Heals_SFT
View on GitHub
☆21Mar 22, 2026Updated 4 months ago
Liyan06 / ChartMuseum
View on GitHub
[NeurIPS 2025] ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models
☆24Apr 20, 2026Updated 3 months ago
cognitiveailab / BYTESIZED32
View on GitHub
Byte-sized text games for code generation tasks on virtual environments
☆20Jul 8, 2024Updated 2 years ago
Snektron / gpumode-amd-fp8-mm
View on GitHub
My submission for the GPUMODE/AMD fp8 mm challenge
☆29Jun 4, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
robinarmingaud / glidre
View on GitHub
Zero and Few-shot document level relation extraction / ⚠️ Development moved to: https://github.com/cea-list-lasti/glidre
☆18Mar 13, 2026Updated 4 months ago
Oluwakemi2000 / agentic-cybersecurity-architecture
View on GitHub
A modular, agentic-AI-based adaptive cybersecurity architecture for digital ecosystems. Combines Zero Trust, real-time telemetry, and int…
☆22Jul 4, 2025Updated last year
qingy1337 / xplore-terminallm
View on GitHub
Allows two LLMs to communicate and run code in the terminal
☆28Dec 8, 2024Updated last year
OpenMOSS / Lorsa
View on GitHub
☆30Nov 9, 2025Updated 8 months ago
chetanxpatil / livnium
View on GitHub
Geometric AI research: a proven cube-math core, reusable vector-collapse dynamics, and reproducible experiments in embeddings, NLI, gener…
☆15Updated this week
ytaek-oh / vl_compo
View on GitHub
☆10Jul 5, 2024Updated 2 years ago
SemplificaAI / gliner2-rs
View on GitHub
GLiNER2 Rust support
☆19May 23, 2026Updated 2 months ago
yuhui-zh15 / C3
View on GitHub
Official implementation of "Connect, Collapse, Corrupt: Learning Cross-Modal Tasks with Uni-Modal Data" (ICLR 2024)
☆36Oct 16, 2024Updated last year
Yan98 / GTA1
View on GitHub
☆130Oct 3, 2025Updated 9 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
YanFangCS / GenLIP
View on GitHub
Official repo for "Let ViT Speak: Generative Language-Image Pre-training"
☆133Jun 10, 2026Updated last month
HanSolo9682 / CounterCurate
View on GitHub
This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.
☆19Jun 27, 2024Updated 2 years ago
Keely-Ai / F2D2
View on GitHub
Joint Distillation for Fast Likelihood Evaluation and Sampling in Flow-based Models
☆22Mar 5, 2026Updated 4 months ago
robbiemu / llama-gguf-optimize
View on GitHub
Scripts and tools for optimizing quantizations in llama.cpp with GGUF imatrices.
☆19Jan 10, 2025Updated last year
AstraBert / resume-matcher
View on GitHub
Match your resume with a job, effortlessly
☆29Apr 23, 2025Updated last year
learning-at-home / collaborative-latent-diffusion
View on GitHub
Collaborative inference of latent diffusion via hivemind
☆12May 29, 2023Updated 3 years ago
princeton-pli / DySCO
View on GitHub
DySCO: Dynamic Attention-Scaling Decoding for Long-Context LMs
☆17May 30, 2026Updated last month
ThomasVuNguyen / K
View on GitHub
Developing K - a language model to generate OPENSCAD code from prompt
☆19Dec 3, 2025Updated 7 months ago
ytgui / PilotANN
View on GitHub
Memory-Bounded GPU Acceleration for Vector Search
☆33Dec 29, 2025Updated 6 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
KHao123 / LaSe-E2V
View on GitHub
The source code for "LaSe-E2V: Towards Language-guided Semantic-Aware Event-to-Video Reconstruction"
☆10Jul 5, 2024Updated 2 years ago
hcompai / hai-cookbook
View on GitHub
H.AI cookbook provides code examples and guides to help developers use models developed by H Company.
☆81Feb 20, 2026Updated 5 months ago
janhq / space-thinker
View on GitHub
☆21Mar 25, 2025Updated last year
xlang-ai / OSWorld-G
View on GitHub
[NeurIPS 2025 Spotlight] Scaling Computer-Use Grounding via UI Decomposition and Synthesis
☆172Jun 18, 2026Updated last month
kohjingyu / multi-agent-computer-use
View on GitHub
Code for the multi-agent computer use project.
☆21Jul 3, 2026Updated 3 weeks ago
IBM / analog-foundation-models
View on GitHub
Code for paper "Analog Foundation Models"
☆36Mar 25, 2026Updated 3 months ago
mlfoundations / VisIT-Bench
View on GitHub
☆51Oct 29, 2023Updated 2 years ago
ThomasVuNguyen / MakeMe
View on GitHub
Create 3D files in the CLI with Small Language Model
☆44Oct 15, 2025Updated 9 months ago
facebookresearch / DejaVu
View on GitHub
Repository for the paper Do SSL Models Have Déjà Vu? A Case of Unintended Memorization in Self-supervised Learning
☆36May 2, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
SalesforceAIResearch / text2data
View on GitHub
☆22Jun 2, 2026Updated last month
princeton-pli / STAT
View on GitHub
Skill-Targeted Adaptive Training
☆24Mar 12, 2026Updated 4 months ago
wangys16 / GOV-NeSF
View on GitHub
☆10Oct 18, 2024Updated last year
mmhamdy / open-language-models
View on GitHub
A list of language models with permissive licenses such as MIT or Apache 2.0
☆25Feb 28, 2025Updated last year
Al-aminI / GraphMem
View on GitHub
Production-Grade Agent Memory Framework for Agentic AI
☆16Apr 15, 2026Updated 3 months ago
CogComp / APSI
View on GitHub
Code for EMNLP 2020 paper: Analogous Process Structure Induction for Sub-event Sequence Prediction
☆11Oct 19, 2020Updated 5 years ago
meganoob1337 / llama-swap-vllm-boilerplate
View on GitHub
Dynamic LLM model swapping system with Docker, vLLM integration, and GPU acceleration. Supports GGUF & Hugging Face models with automatic…
☆22Mar 6, 2026Updated 4 months ago