joshbickett / multimodal-gamerLinks
A framework to enable multimodal models to play games on a computer.
β97Updated last year
Alternatives and similar repositories for multimodal-gamer
Users that are interested in multimodal-gamer are comparing it to the libraries listed below
Sorting:
- Build your Swarm of Internet Agents using MultiOn πβ78Updated last year
- The next evolution of Agentsβ48Updated last week
- A Python package to dynamically load functions for OpenAI Assistantβ54Updated last year
- β114Updated 5 months ago
- An automated tool for discovering insights from research papaer corporaβ138Updated 11 months ago
- β28Updated last year
- The open-source implementation of Q*, achieved in context as a zero-shot reprogramming of the attention mechanism. (synthetic data)β1Updated 5 months ago
- β80Updated last year
- Improve your questions! The AI for Inquiry - QuestionImprover Agent is an LLM-driven βtool for thoughtβ designed to enhance the depth andβ¦β146Updated 3 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)β90Updated 4 months ago
- Collection of Tree of Thoughts prompting techniques I've found useful to start with, then stylize, then iterateβ88Updated last year
- A continuously learning web-browsing AI agent that generalizes the Voyager architecture.β38Updated last year
- Transcribe and summarize videos using whisper and llms on apple mlx frameworkβ74Updated last year
- Starter app for creating an AI task completion agent with gmail capabilities.β27Updated 11 months ago
- auto fine tune of models with synthetic dataβ75Updated last year
- A framework for orchestrating AI agents using a mermaid graphβ75Updated last year
- π The open-source autonomous agent LLM initiative πβ91Updated last year
- CAMEL framework-based multi-agent system for task-driven and dynamic environmentsβ94Updated last year
- Claude API Test Projectβ87Updated last year
- πPolyGPT: An Overview of Agent-Based System Architecture for Autonomous Business Operationsβ83Updated last year
- β163Updated last year
- Globot is an agent that controls your browser using playwright and GPT-4V.β133Updated last year
- Unofficial implementation and experiments related to Set-of-Mark (SoM) ποΈβ85Updated last year
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crewβ¦β59Updated last year
- CLAIRe: Conversational Learning AI with Recallβ67Updated last year
- GPT-4 Vision Chrome Extensionβ109Updated last year
- β86Updated 8 months ago
- InterfaceAgent: a versatile framework designed to create system and interface agents capable of managing mobile and desktop applications β¦β112Updated last year
- An Open Source Playground with Agent Datasets and APIs for building and testing your own Autonomous Web Agentsβ191Updated last year
- β36Updated last year