Multi-Modal Tree of thoughts for DALLE-3 like auto self improvement
☆17Nov 11, 2024Updated last year
Alternatives and similar repositories for MultiModal-ToT
Users that are interested in MultiModal-ToT are comparing it to the libraries listed below
Sorting:
- A forest of autonomous agents.☆20Jan 27, 2025Updated last year
- A simple reproducible template to implement AI research papers☆24Sep 9, 2024Updated last year
- Automate your blogging with AI-powered tools for creating, optimizing, and deploying content. Generate SEO-optimized articles effortlessl…☆12Aug 16, 2024Updated last year
- The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"☆19Mar 11, 2024Updated 2 years ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆18Oct 13, 2025Updated 5 months ago
- Contextual Position Encoding but with some custom CUDA Kernels https://arxiv.org/abs/2405.18719☆22Jun 5, 2024Updated last year
- Official repository of "TDSD: Text-Driven Scene-Decoupled Weakly Supervised Video Anomaly Detection"☆11May 25, 2025Updated 9 months ago
- AthenaOS is a next generation AI-native operating system managed by Swarms of AI Agents☆36Jul 18, 2023Updated 2 years ago
- multi agent team with coding and data analysis capability to structure real estate investment plans and help with decision making.☆15Jun 11, 2024Updated last year
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆12Jan 29, 2024Updated 2 years ago
- ☆11Nov 11, 2022Updated 3 years ago
- A mini-app to solve the heat conduction equation☆15Jul 1, 2020Updated 5 years ago
- Traditional operating systems are reactive - they wait for user input or system events before taking action. SwarmOS breaks this paradigm…☆15Dec 6, 2024Updated last year
- Ultra Fast Multi-Modality Vector Database☆18Feb 21, 2024Updated 2 years ago
- NewsAgent is an enterprise-grade news aggregation agent designed to fetch, query, and summarize news from multiple sources at scale.☆27Oct 13, 2025Updated 5 months ago
- This repo hosts the code for the Fast Trainable Projection (FTP) project.☆12Nov 16, 2023Updated 2 years ago
- [CVPR 2024] Official Code for the Paper "Compositional Chain-of-Thought Prompting for Large Multimodal Models"☆146Jun 20, 2024Updated last year
- [NAACL'25] Contains code and documentation for our VANE-Bench paper.☆23Aug 19, 2025Updated 7 months ago
- Inference Llama 2 in one file of pure C. Nahh wait, now fresh in Julia!☆25Aug 2, 2023Updated 2 years ago
- mit6.830 all-pass☆12Mar 25, 2022Updated 3 years ago
- My Implementation of " Structure and Content-Guided Video Synthesis with Diffusion Models" by RunwayML☆26Jan 16, 2024Updated 2 years ago
- ☆16Jul 2, 2022Updated 3 years ago
- ☆10Jul 13, 2024Updated last year
- ☆19Aug 15, 2018Updated 7 years ago
- [CVPR 2025] Code for "Notes-guided MLLM Reasoning: Enhancing MLLM with Knowledge and Visual Notes for Visual Question Answering".☆23Jun 16, 2025Updated 9 months ago
- Plug in and Play implementation of "Certified Reasoning with Language Models" that elevates model reasoning by 40%☆16Jun 20, 2023Updated 2 years ago
- Portable auto-vectorizable n-body benchmark☆20Feb 25, 2026Updated 3 weeks ago
- Evaluate robustness of adaptation methods on large vision-language models☆19Aug 23, 2023Updated 2 years ago
- The Swarm Ecosystem☆28Aug 1, 2024Updated last year
- A Monte Carlo Neutron Transport Mini-App☆15Apr 15, 2019Updated 6 years ago
- A swarm of LLM agents that will help you test, document, and productionize your code!☆16Feb 16, 2026Updated last month
- A software for transferring pre-trained English models to foreign languages☆19Mar 20, 2023Updated 3 years ago
- Robometer: Scaling General-Purpose Robotic Reward Models via Trajectory Comparisons☆98Mar 15, 2026Updated last week
- Implementation for <Orthogonal Over-Parameterized Training> in CVPR'21.☆22Jul 16, 2021Updated 4 years ago
- ☆33Jan 30, 2026Updated last month
- This is the public repo for the MLPerf DeepCAM climate data segmentation proposal.☆16Sep 30, 2025Updated 5 months ago
- ☆24Oct 23, 2023Updated 2 years ago
- The AI Blog Article Generator is a Python-based tool that utilizes the Cohere API to generate high-quality, SEO-optimized blog articles. …☆20Feb 25, 2026Updated 3 weeks ago
- The official repo of "Towards Scalable Video Anomaly Retrieval: A Synthetic Video-Text Benchmark"☆21Jun 5, 2025Updated 9 months ago