Here we will track the latest AI Multimodal Models, including Multimodal Foundation Models, LLM, Agent, Audio, Image, Video, Music and 3D content. 🔥
☆38Feb 4, 2025Updated last year
Alternatives and similar repositories for ai-multimodal-timeline
Users that are interested in ai-multimodal-timeline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2026] Official repository for "Reviving ConvNeXt for Efficient Convolutional Diffusion Models"☆71Mar 26, 2026Updated 3 months ago
- The official repo of the paper titled DeH4R: A Decoupled and Hybrid Method for Road Network Graph Extraction.☆23May 25, 2026Updated last month
- This tutorial introduces how to integrate C code into a Rust project, how to use Rust to compile dynamic and static libraries and how to …☆16May 18, 2025Updated last year
- ☆14Jul 11, 2024Updated last year
- ☆26Nov 26, 2025Updated 7 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Procedural generation of random urban landscapes.☆20Oct 5, 2020Updated 5 years ago
- ☆15Jan 25, 2024Updated 2 years ago
- Code and dataset for the paper "Text2City: One-Stage Text-Driven Urban Layout Regeneration"☆15Jun 27, 2024Updated 2 years ago
- [ACM MM 2025] Phys4DGen: Physics-Compliant 4D Generation with Multi-Material Composition Perception☆13Apr 18, 2026Updated 2 months ago
- A powerful and efficient API service utilizing LangGraph Agent with real-time streaming tokens via Websocket, built on FastAPI.☆21Jul 8, 2024Updated last year
- Procedural map generation with GANs.☆19Aug 25, 2021Updated 4 years ago
- We introduce DiffH2O, a diffusion-based framework to synthesize dexterous hand-object interactions. DiffH2O generates realistic hand-obje…☆43Nov 21, 2025Updated 7 months ago
- An open source MCP proxy.☆18Jan 3, 2025Updated last year
- ☆17Nov 9, 2025Updated 7 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [EMNLP 2025 Findings] 3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation☆38Jun 12, 2025Updated last year
- Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.☆56Oct 13, 2025Updated 8 months ago
- ☆14Nov 12, 2024Updated last year
- ☆15Dec 20, 2020Updated 5 years ago
- ☆12Mar 24, 2021Updated 5 years ago
- All code for FlairGPT: Repurposing LLMs for Interior Designs, Eurographics 2025☆21Mar 6, 2025Updated last year
- Automated agent using LangChain and Gmail API to classify and respond to incoming emails based on their content.☆14Oct 12, 2024Updated last year
- Tokun to can tokens☆17Jun 19, 2025Updated last year
- Add function calling to text-generation-inference☆13Oct 10, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This repository is the official implementation of ED-NeRF.☆12Apr 24, 2024Updated 2 years ago
- The code of “DreamFuse: Adaptive Image Fusion with Diffusion Transformer”.☆29Jul 25, 2025Updated 11 months ago
- ☆19Nov 18, 2025Updated 7 months ago
- Demonstration of a web interface for inferring facebook/seamless-m4t-v2-large model via API calls, using Flask as the backend server.☆10Jan 23, 2024Updated 2 years ago
- a repository containing the details of natural language inference dataset in Hindi☆14Dec 28, 2020Updated 5 years ago
- This is a chatbot built using Gradio that can access Google Search and webpages to answer questions. Supports GPT-3.5, GPT-4, Claude 2, …☆13Aug 31, 2023Updated 2 years ago
- Internal diffusion for video inpainting☆16May 19, 2025Updated last year
- Implementation of the paper: VG4D: Vision-Language Model Goes 4D Video Recognition(ICRA 2024)☆15Apr 23, 2024Updated 2 years ago
- ☆24Feb 1, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for the ICML 2025 paper "SelfCite Self-Supervised Alignment for Context Attribution in Large Language Models"☆30Mar 12, 2026Updated 3 months ago
- Zignite is a Cross-platform graphics engine built with Zig, featuring WebGPU rendering using GLFW for window management. It has WebAssemb…☆41Jul 5, 2025Updated 11 months ago
- Official repository for HOComp: Interaction-Aware Human-Object Composition☆30Dec 3, 2025Updated 7 months ago
- Resilient multi-LLM orchestration with in-built failure handing, rate limits, retries, and circuit breaker.☆48Jun 1, 2026Updated last month
- HippoRAG implementation using APIs☆14Jun 6, 2024Updated 2 years ago
- [ICCV 2025] LIRA☆22Nov 25, 2025Updated 7 months ago
- ☆15Mar 24, 2021Updated 5 years ago