A self-contained multimodal AI agents lab built using MongoDB, Gemini and LangGraph.
☆68Sep 23, 2025Updated 8 months ago
Alternatives and similar repositories for multimodal-agents-lab
Users that are interested in multimodal-agents-lab are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Apr 21, 2026Updated last month
- GitHub Action for building an ARM Template from Bicep☆13Jun 18, 2022Updated 3 years ago
- ☆26Mar 22, 2026Updated 2 months ago
- Creates an Azure AI Studio hub, project and required dependent resources including Azure Open AI Service, Cognitive Search and more.☆33Oct 2, 2024Updated last year
- A Pytorch implementing of A Deep Learning approach to Template Matching. Usie Hypernet + VGG to match the templates.☆12Dec 18, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICLR 2024] Towards Robust Multi-Modal Reasoning via Model Selection☆14Mar 7, 2024Updated 2 years ago
- The code of the paper: M. Karami, “HiGen: Hierarchical Graph Generative Networks”, arXiv preprint arxiv:2305.19337☆10Apr 9, 2024Updated 2 years ago
- ☆11Mar 1, 2019Updated 7 years ago
- Vehicle number plate detection using YOLO and OCR text extraction☆14Feb 8, 2020Updated 6 years ago
- Samples, quickstarts, and developer resources for Azure Durable Task Scheduler — build reliable, fault-tolerant workflows with Durable Fu…☆56Updated this week
- A simple exam generator and grader written in Python with OpenCV☆14Jan 14, 2026Updated 4 months ago
- VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection☆26May 31, 2025Updated last year
- Building a multi-agent RAG system with advanced RAG methods☆13Jan 12, 2025Updated last year
- ☆13Feb 5, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆65Feb 5, 2024Updated 2 years ago
- ☆29Mar 30, 2026Updated 2 months ago
- ☆31Apr 5, 2025Updated last year
- The final coursework for AI in Mental Health @ PKU.☆22Jan 5, 2024Updated 2 years ago
- 基于InternLm chat 7B大模型基座,构建一个Agent ,可以调用 MMYOLO 工具来完成图像内视觉任务☆11Oct 30, 2024Updated last year
- VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation [TMLR26]☆16Jun 1, 2026Updated last week
- Bengali transformer using transformers☆22Apr 29, 2025Updated last year
- It is based on Image Processing. It has been implemented with "Python3" and "OpenCv3.3.0"☆19Oct 1, 2019Updated 6 years ago
- Implementation of the paper "Improving Multi-step RAG with Hypergraph-based Memory for Long-context Complex Relational Modeling"☆128Jan 22, 2026Updated 4 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICCV 2025] A Benchmark for Multi-Step Reasoning in Long Narrative Videos☆27Jun 4, 2026Updated last week
- A browser based CadQuery server☆13Feb 18, 2025Updated last year
- CZU-MHAD: A multimodal dataset for human action recognition utilizing a depth camera and 10 wearable inertial sensors☆26Jun 2, 2022Updated 4 years ago
- ViCToR: Improving Visual Comprehension via Token Reconstruction for Pretraining LMMs☆29Aug 15, 2025Updated 9 months ago
- Using multiple LLMs for ensemble Forecasting☆16Jan 17, 2024Updated 2 years ago
- Assistant API to chat with tabular data and perform analytics in natural language.☆56Aug 30, 2024Updated last year
- SDXL API provides a seamless interface for image generation and retrieval using Stable Diffusion XL integrated with Cloudflare AI Workers…☆13Feb 29, 2024Updated 2 years ago
- Bangla Unicode Normalization☆23May 26, 2024Updated 2 years ago
- Surrogate Modeling of the Aerodynamic Performance for Transonic Regime☆14Feb 12, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆13Jan 29, 2024Updated 2 years ago
- A Keras Implementation of Supervised Video Summarization using Attention Based Encoder-Decoder Networks☆30Jun 22, 2022Updated 3 years ago
- mouse pet-ct image segmentation☆12Feb 19, 2023Updated 3 years ago
- Fathom-DeepResearch: Unlocking Long Horizon Information Retrieval And Synthesis For SLMs☆58Oct 7, 2025Updated 8 months ago
- Official Implementation of the paper "Three Bricks to Consolidate Watermarks for LLMs"☆53May 11, 2026Updated last month
- GPT 4 Vision + TTS 多模态能力 Demo☆17Nov 15, 2023Updated 2 years ago
- Large Multimodal Model☆15Apr 8, 2024Updated 2 years ago