neulab / MultiUI
Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding
☆37Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for MultiUI
- ☆44Updated last month
- ☆35Updated last year
- The Official Code Repository for GUI-World.☆37Updated 3 months ago
- ☆57Updated last month
- Official Repo for UGround☆93Updated this week
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆46Updated 2 months ago
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆37Updated 6 months ago
- FuseAI Project☆76Updated 2 months ago
- Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"☆46Updated 3 weeks ago
- DPO, but faster 🚀☆21Updated 2 weeks ago
- ☆55Updated 3 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆75Updated 3 weeks ago
- WebLINX is a benchmark for building web navigation agents with conversational capabilities☆116Updated last month
- Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"☆35Updated 10 months ago
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM☆55Updated 5 months ago
- Data preparation code for CrystalCoder 7B LLM☆42Updated 6 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆68Updated 3 weeks ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆44Updated 9 months ago
- ☆41Updated 2 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 8 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated 9 months ago
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆79Updated this week
- ☆40Updated this week
- From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging☆52Updated last month
- Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs☆42Updated 4 months ago
- This is the official repository for Inheritune.☆105Updated last month
- This is the official repository of the paper "OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI"☆85Updated last month
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆30Updated 9 months ago
- HelloBench: evaluating long text generation capabilities of LLMs☆29Updated 3 weeks ago
- Reformatted Alignment☆112Updated last month