A self-contained multimodal AI agents lab built using MongoDB, Gemini and LangGraph.
☆68Sep 23, 2025Updated 7 months ago
Alternatives and similar repositories for multimodal-agents-lab
Users that are interested in multimodal-agents-lab are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Apr 21, 2026Updated last month
- GitHub Action for building an ARM Template from Bicep☆13Jun 18, 2022Updated 3 years ago
- ☆26Mar 22, 2026Updated 2 months ago
- Creates an Azure AI Studio hub, project and required dependent resources including Azure Open AI Service, Cognitive Search and more.☆33Oct 2, 2024Updated last year
- Basic codes of ml☆13Dec 2, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ICLR 2024] Towards Robust Multi-Modal Reasoning via Model Selection☆14Mar 7, 2024Updated 2 years ago
- Multimodal Genuine Emotion and Expression Detection database☆12Jul 15, 2024Updated last year
- Samples, quickstarts, and developer resources for Azure Durable Task Scheduler — build reliable, fault-tolerant workflows with Durable Fu…☆54May 14, 2026Updated last week
- The official code of our paper at EMNLP 2022: Back to the Future: Bidirectional Information Decoupling Network for Multi-turn Dialogue Mo…☆16Feb 17, 2023Updated 3 years ago
- A simple exam generator and grader written in Python with OpenCV☆14Jan 14, 2026Updated 4 months ago
- GeneticFS is a library for feature selection in Machine Learning using a Genetic Algorithm as an optimisation method.☆20Oct 8, 2019Updated 6 years ago
- Starter template with Bicep as infrastructure provider for Azure Developer CLI (azd).☆37Mar 27, 2026Updated last month
- VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection☆26May 31, 2025Updated 11 months ago
- Generative AI Ops RAG project template☆43Apr 21, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Building a multi-agent RAG system with advanced RAG methods☆13Jan 12, 2025Updated last year
- ☆13Feb 5, 2025Updated last year
- ☆29Mar 30, 2026Updated last month
- ☆31Apr 5, 2025Updated last year
- 基于InternLm chat 7B大模型基座,构建一个Agent ,可以调用 MMYOLO 工具来完成图像内视觉任务☆11Oct 30, 2024Updated last year
- More reliable Video Understanding Evaluation☆15Sep 23, 2025Updated 7 months ago
- Karras et al. (2022) diffusion models for PyTorch☆17Oct 5, 2023Updated 2 years ago
- Bengali transformer using transformers☆22Apr 29, 2025Updated last year
- It is based on Image Processing. It has been implemented with "Python3" and "OpenCv3.3.0"☆19Oct 1, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICCV 2025] A Benchmark for Multi-Step Reasoning in Long Narrative Videos☆27Aug 8, 2025Updated 9 months ago
- A browser based CadQuery server☆13Feb 18, 2025Updated last year
- ☆23Apr 7, 2020Updated 6 years ago
- ViCToR: Improving Visual Comprehension via Token Reconstruction for Pretraining LMMs☆29Aug 15, 2025Updated 9 months ago
- Using multiple LLMs for ensemble Forecasting☆16Jan 17, 2024Updated 2 years ago
- Assistant API to chat with tabular data and perform analytics in natural language.☆56Aug 30, 2024Updated last year
- SDXL API provides a seamless interface for image generation and retrieval using Stable Diffusion XL integrated with Cloudflare AI Workers…☆13Feb 29, 2024Updated 2 years ago
- Bangla Unicode Normalization☆23May 26, 2024Updated last year
- Surrogate Modeling of the Aerodynamic Performance for Transonic Regime☆14Feb 12, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- EEG-Audio-Video Dataset for Emotion Recognition in Conversations☆44Feb 3, 2026Updated 3 months ago
- Fathom-DeepResearch: Unlocking Long Horizon Information Retrieval And Synthesis For SLMs☆58Oct 7, 2025Updated 7 months ago
- An official repository for MIA 2022 (NAACL 2022 Workshop) Shared Task on Cross-lingual Open-Retrieval Question Answering.☆31Jun 26, 2022Updated 3 years ago
- project page of "VAD v2: LLM-Like Probabilistic Modeling in End-to-End Autonomous Driving"☆11Apr 9, 2026Updated last month
- GPT 4 Vision + TTS 多模态能力 Demo☆17Nov 15, 2023Updated 2 years ago
- Large Multimodal Model☆15Apr 8, 2024Updated 2 years ago
- Closed-loop evaluation for end-to-end VLM autonomous driving agent☆25Mar 8, 2025Updated last year