rhymes-ai / Aria
Codebase for Aria - an Open Multimodal Native MoE
☆1,007Updated last month
Alternatives and similar repositories for Aria:
Users that are interested in Aria are comparing it to the libraries listed below
- Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation☆725Updated 6 months ago
- A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.☆770Updated this week
- OLMoE: Open Mixture-of-Experts Language Models☆634Updated 2 months ago
- An Open Large Reasoning Model for Real-World Solutions☆1,465Updated 3 months ago
- ☆2,266Updated last week
- Next-Token Prediction is All You Need☆2,017Updated 4 months ago
- LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning☆1,868Updated last month
- Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"☆827Updated 6 months ago
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆977Updated last month
- Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".☆882Updated 4 months ago
- ☆1,338Updated 3 months ago
- Large Reasoning Models☆800Updated 3 months ago
- Scalable RL solution for advanced reasoning of language models☆1,338Updated last week
- Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。☆1,649Updated last month
- FastVideo is a lightweight framework for accelerating large video diffusion models.☆1,186Updated this week
- VPTQ, A Flexible and Extreme low-bit quantization algorithm☆593Updated this week
- [ICLR 2025] Agent S: an open agentic framework that uses computers like a human☆821Updated this week
- Training Large Language Model to Reason in a Continuous Latent Space☆926Updated last month
- [CVPR2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.☆1,025Updated this week
- OS-ATLAS: A Foundation Action Model For Generalist GUI Agents☆287Updated last week
- Rethinking Step-by-step Visual Reasoning in LLMs☆259Updated last month
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆315Updated 3 weeks ago