The-Swarm-Corporation / Open-MAI-Dx-OrchestratorLinks
An open source implementation of the paper: "Sequential Diagnosis with Language Models" From Microsoft Built with Swarms Framework
☆27Updated 2 weeks ago
Alternatives and similar repositories for Open-MAI-Dx-Orchestrator
Users that are interested in Open-MAI-Dx-Orchestrator are comparing it to the libraries listed below
Sorting:
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement☆93Updated last week
- ☆56Updated 8 months ago
- OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024☆60Updated 4 months ago
- This is the offical page of WikiAutoGen, ICCV2025☆15Updated 3 weeks ago
- Official Implementation of "JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse"☆84Updated last month
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆46Updated 2 months ago
- An open source implementation of LFMs from Liquid AI: Liquid Foundation Models☆176Updated 3 weeks ago
- [ACL 2025 🔥] Rethinking Step-by-step Visual Reasoning in LLMs☆304Updated 2 months ago
- ☆61Updated 4 months ago
- Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment☆51Updated 6 months ago
- AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time☆73Updated last month
- An open source implementation of LFMs from Liquid AI: Liquid Foundation Models☆101Updated 9 months ago
- ICLR 2025 - official implementation for "I-Con: A Unifying Framework for Representation Learning"☆102Updated 3 weeks ago
- ☆162Updated 2 months ago
- (ACL-2025 main conference) Dolphin: Moving Towards Closed-loop Auto-research through Thinking, Practice, and Feedback☆21Updated 3 weeks ago
- LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture☆206Updated 6 months ago
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆55Updated last month
- Official GPU implementation of the paper "PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance"☆131Updated 8 months ago
- The official GitHub Page for MiniMax☆48Updated 2 weeks ago
- Pixel-Level Reasoning Model trained with RL☆170Updated 3 weeks ago
- Visual Planning: Let's Think Only with Images☆258Updated 2 months ago
- 💡 VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning☆230Updated 3 weeks ago
- This repo contains the code for the paper "Intuitive physics understanding emerges fromself-supervised pretraining on natural videos"☆169Updated 5 months ago
- [Fully open] [Encoder-free MLLM] Vision as LoRA☆316Updated last month
- [NeurIPS 2024] A task generation and model evaluation system for multimodal language models.☆71Updated 7 months ago
- Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models. TMLR 2025.☆86Updated 2 months ago
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆87Updated last year
- Official implementation of "PyVision: Agentic Vision with Dynamic Tooling."☆69Updated last week
- Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines☆125Updated 8 months ago
- ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning☆67Updated last month