rhymes-ai / Aria
Codebase for Aria - an Open Multimodal Native MoE
☆787Updated this week
Related projects ⓘ
Alternatives and complementary repositories for Aria
- Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation☆917Updated last week
- Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation☆675Updated 3 months ago
- 🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)☆809Updated 4 months ago
- Agent S: an open agentic framework that uses computers like a human☆571Updated this week
- A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.☆509Updated last week
- LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills☆702Updated 9 months ago
- Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"☆774Updated 2 months ago
- ☆259Updated last week
- ☆781Updated 3 weeks ago
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine☆346Updated last month
- OLMoE: Open Mixture-of-Experts Language Models☆436Updated last week
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆328Updated 4 months ago
- Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".☆755Updated 2 weeks ago
- HPT - Open Multimodal LLMs from HyperGAI☆312Updated 5 months ago
- Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding☆553Updated last month
- Large Reasoning Models☆492Updated this week
- VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs☆856Updated last week
- Official repository for the paper PLLaVA☆584Updated 3 months ago
- Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.☆1,827Updated 3 months ago
- multi1: create o1-like reasoning chains with multiple AI providers (and locally). Supports LiteLLM as backend too for 100+ providers at o…☆305Updated last month
- Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"☆875Updated last month
- LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs☆1,462Updated 2 weeks ago
- [ACL 2024] Progressive LLaMA with Block Expansion.☆478Updated 5 months ago
- ☆920Updated this week
- ✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM☆949Updated 2 weeks ago
- [NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models☆532Updated 2 weeks ago
- Code for Quiet-STaR☆641Updated 2 months ago
- Next-Token Prediction is All You Need☆1,793Updated 2 weeks ago
- LLaVA-Interactive-Demo☆351Updated 3 months ago