kyegomez / SIMALinks
Pytorch Implementation of Deepmind's SIMA: "Scaling Instructable Agents Across Many Simulated Worlds"
☆17Updated last year
Alternatives and similar repositories for SIMA
Users that are interested in SIMA are comparing it to the libraries listed below
Sorting:
- Implementation of AutoRT: "AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents"☆40Updated 9 months ago
- A Data Source for Reasoning Embodied Agents☆19Updated last year
- GROOT: Learning to Follow Instructions by Watching Gameplay Videos (ICLR 2024 Spotlight)☆66Updated last year
- 🎮Manipulates mobile phones just like how you would. Official code for "MobA: A Two-Level Agent System for Efficient Mobile Task Automati…☆25Updated 4 months ago
- ☆29Updated last year
- A vast array of Multi-Modal Embodied Robotic Foundation Models!☆26Updated last year
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆12Updated last year
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆14Updated 2 weeks ago
- Enhancement in Multimodal Representation Learning.☆40Updated last year
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Updated last year
- Official implementation of the DECKARD Agent from the paper "Do Embodied Agents Dream of Pixelated Sheep?"☆94Updated 2 years ago
- Official Implementation of "JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse"☆93Updated last week
- A swarm of LLM agents that will help you test, document, and productionize your code!☆17Updated 2 weeks ago
- A forest of autonomous agents.☆19Updated 7 months ago
- 😊 TPTT: Transforming Pretrained Transformers into Titans☆26Updated this week
- ☆19Updated last year
- The implementation of the paper: "Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models"☆34Updated last year
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated this week
- The Next Generation Multi-Modality Superintelligence☆69Updated last year
- Implementation of the premier Text to Video model from OpenAI☆56Updated 9 months ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 6 months ago
- Pytorch implementation of the Gato paper from Deepmind☆12Updated 2 years ago
- The official implementation of the paper "Read to Play (R2-Play): Decision Transformer with Multimodal Game Instruction".☆33Updated last year
- Brainwave is a state-of-the-art neural decoder that transforms electroencephalogram (EEG) and brain signals into multimodal outputs inclu…☆12Updated 2 weeks ago
- A simple package for leveraging Falcon 180B and the HF ecosystem's tools, including training/inference scripts, safetensors, integrations…☆12Updated last year
- Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)☆36Updated last year
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆88Updated last year
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated 9 months ago
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆46Updated 6 months ago
- CogNetX is an advanced, multimodal neural network architecture inspired by human cognition. It integrates speech, vision, and video proce…☆16Updated this week