Here we will track the latest AI Multimodal Models, including Multimodal Foundation Models, LLM, Agent, Audio, Image, Video, Music and 3D content. 🔥
☆37Feb 4, 2025Updated last year
Alternatives and similar repositories for ai-multimodal-timeline
Users that are interested in ai-multimodal-timeline are comparing it to the libraries listed below
Sorting:
- ☆12Updated this week
- ☆14Jul 11, 2024Updated last year
- ☆11Jul 30, 2024Updated last year
- ☆14Nov 23, 2024Updated last year
- ☆14Jan 17, 2026Updated 2 months ago
- Disease Pattern Miner is a free, open-source mining framework for interactively discovering sequential disease patterns in medical health…☆12Mar 21, 2019Updated 6 years ago
- ☆26Nov 26, 2025Updated 3 months ago
- An implementation of "Oblique Photogrammetry Supporting Procedural Tree Modeling in Urban Areas" using python☆17Aug 29, 2023Updated 2 years ago
- [ICLR 2025] DGQ: Distribution-Aware Group Quantization for Text-to-Image Diffusion Models☆19Mar 25, 2025Updated 11 months ago
- ☆10Jan 20, 2021Updated 5 years ago
- ☆10Aug 16, 2024Updated last year
- [EMNLP 2025 Findings] 3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation☆32Jun 12, 2025Updated 9 months ago
- A powerful and efficient API service utilizing LangGraph Agent with real-time streaming tokens via Websocket, built on FastAPI.☆21Jul 8, 2024Updated last year
- We introduce DiffH2O, a diffusion-based framework to synthesize dexterous hand-object interactions. DiffH2O generates realistic hand-obje…☆33Nov 21, 2025Updated 3 months ago
- ☆15Nov 9, 2025Updated 4 months ago
- ☆14Nov 12, 2024Updated last year
- All code for FlairGPT: Repurposing LLMs for Interior Designs, Eurographics 2025☆19Mar 6, 2025Updated last year
- Gigo is an end-to-end platform for learning to code and developing your skills. Gigo won't drop you off after some basic syntax, buckle u…☆26Aug 16, 2024Updated last year
- This repository is the official implementation of ED-NeRF.☆12Apr 24, 2024Updated last year
- Code for the ICML 2025 paper "SelfCite Self-Supervised Alignment for Context Attribution in Large Language Models"☆24Mar 12, 2026Updated last week
- The code of “DreamFuse: Adaptive Image Fusion with Diffusion Transformer”.☆25Jul 25, 2025Updated 7 months ago
- ☆19Nov 18, 2025Updated 4 months ago
- [ICCV 2023] Understanding 3D Object Interaction from a Single Image☆47Feb 29, 2024Updated 2 years ago
- [CVPR 2024 Hightlight] Code release for "The More You See in 2D, the More You Perceive in 3D"☆64Oct 12, 2024Updated last year
- Resilient multi-LLM orchestration with in-built failure handing, rate limits, retries, and circuit breaker.☆30Mar 9, 2026Updated last week
- ☆22Feb 1, 2025Updated last year
- Awesome-LLMs Resources☆13Nov 12, 2024Updated last year
- The official repository for DreamSampler (ECCV24)☆37Oct 11, 2024Updated last year
- ☆35Updated this week
- Give your AI coding assistants access to Raygun so they can investigate, explain, and help resolve errors for you.☆19Mar 2, 2026Updated 2 weeks ago
- ☆22Mar 2, 2024Updated 2 years ago
- Use Remote Functions to tokenize data with DLP in BigQuery using SQL☆23May 29, 2025Updated 9 months ago
- Retrieve simplified versions of webpages, powered by Mozilla's Readability.js☆15Oct 14, 2018Updated 7 years ago
- [ACL 2025 Main] SceneGenAgent: Precise Industrial Scene Generation with Coding Agent☆35Nov 29, 2024Updated last year
- Web-based tool converts GitHub repository contents into a single formatted text file☆14Mar 11, 2025Updated last year
- [ICCV 2025] LIRA☆21Nov 25, 2025Updated 3 months ago
- Word Discovery in Visually Grounded, Self-Supervised Speech Models☆26Dec 4, 2023Updated 2 years ago
- [CVPR 2025] MoST: Efficient Monarch Sparse Tuning for 3D Representation Learning☆18Sep 20, 2025Updated 6 months ago
- Example Code to Supplement the Label Studio Blog☆32Jan 6, 2026Updated 2 months ago