superagi / Veagle
Enhancement in Multimodal Representation Learning.
☆38Updated 6 months ago
Related projects: ⓘ
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆73Updated 2 months ago
- Codes for Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models☆93Updated last month
- ☆35Updated last year
- code for Optimus-1☆19Updated last month
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆36Updated 5 months ago
- The Official Code Repository for GUI-World.☆33Updated last month
- ☆74Updated 9 months ago
- Flow of Reasoning: Efficient Training of LLM Policy with Diverse Thinking☆25Updated this week
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆81Updated last month
- ☆38Updated 4 months ago
- ☆50Updated 2 months ago
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆75Updated 11 months ago
- ☆65Updated last year
- WebLINX is a benchmark for building web navigation agents with conversational capabilities☆111Updated 2 months ago
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆55Updated last week
- ☆37Updated last month
- Finetune any model on HF in less than 30 seconds☆56Updated last week
- Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"☆35Updated 8 months ago
- A repository for research on medium sized language models.☆71Updated 3 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆23Updated last year
- The official implementation of the paper "Read to Play (R2-Play): Decision Transformer with Multimodal Game Instruction".☆32Updated 7 months ago
- Official implementation of ECCV24 paper: POA☆23Updated last month
- Repository for the paper Stream of Search: Learning to Search in Language☆70Updated last month
- ☆12Updated 5 months ago
- ☆24Updated last week
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆29Updated 7 months ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆30Updated last month
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM☆53Updated 3 months ago
- ☆62Updated 5 months ago
- ☆54Updated 8 months ago