Self-hosted GPT-4V api
☆27Nov 6, 2023Updated 2 years ago
Alternatives and similar repositories for GPT-4V-API
Users that are interested in GPT-4V-API are comparing it to the libraries listed below
Sorting:
- ☆21May 24, 2024Updated last year
- OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents☆21Jan 6, 2026Updated last month
- ☆10May 20, 2019Updated 6 years ago
- SpyGame: An interactive multi-agent framework to evaluate intelligence with large language models :D☆15Nov 9, 2023Updated 2 years ago
- Official Repo of LangSuitE☆84Aug 15, 2024Updated last year
- State of What Art? A Call for Multi-Prompt LLM Evaluation☆15Jul 10, 2024Updated last year
- ☆15Jul 9, 2025Updated 7 months ago
- [NeurIPS 2023 Datasets and Benchmarks] "FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation", Yuanxin L…☆57Mar 4, 2024Updated last year
- ☆15Aug 18, 2022Updated 3 years ago
- code for "Natural Language to Code Translation with Execution"☆41Nov 2, 2022Updated 3 years ago
- ☆17Feb 20, 2023Updated 3 years ago
- Web-grounded natural language instructions☆18Nov 25, 2024Updated last year
- Paper collections of the continuous effort start from World Models.☆204Jul 6, 2024Updated last year
- [EMNLP'23] Code for Generating Data for Symbolic Language with Large Language Models☆18Oct 21, 2023Updated 2 years ago
- Implementation of the paper: "Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning…☆22Nov 2, 2021Updated 4 years ago
- Code for the paper "Decomposing the Enigma: Subgoal-based Demonstration Learning for Formal Theorem Proving"☆19May 25, 2023Updated 2 years ago
- [ICML2024]Adaptive decoding balances the diversity and coherence of open-ended text generation.☆19Jun 2, 2024Updated last year
- Computer Agent Arena: Test & compare AI agents in real desktop apps & web environments. Code/data coming soon!☆53Apr 7, 2025Updated 10 months ago
- Momentum Decoding: Open-ended Text Generation as Graph Exploration☆19Jan 27, 2023Updated 3 years ago
- Paper collection on building and evaluating language model agents via executable language grounding☆365Apr 29, 2024Updated last year
- [SUKI'22] Table Retrieval May Not Necessitate Table-Specific Model Design☆23Sep 23, 2022Updated 3 years ago
- [EMNLP 2023] TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding☆49Jan 9, 2024Updated 2 years ago
- ReCross: Unsupervised Cross-Task Generalization via Retrieval Augmentation☆24May 1, 2022Updated 3 years ago
- ☆27Feb 26, 2023Updated 3 years ago
- Exploring Few-Shot Adaptation of Language Models with Tables☆24Aug 22, 2022Updated 3 years ago
- VaLM: Visually-augmented Language Modeling. ICLR 2023.☆56Mar 6, 2023Updated 2 years ago
- This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception o…☆28Jul 9, 2025Updated 7 months ago
- Official repo of Progressive Data Expansion: data, code and evaluation☆29Nov 16, 2023Updated 2 years ago
- [ACL 2024] PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain☆106Mar 14, 2024Updated last year
- [ICLR 2024] Lemur: Open Foundation Models for Language Agents☆555Oct 28, 2023Updated 2 years ago
- ICLR 2021: Pre-Training for Context Representation in Conversational Semantic Parsing☆31Aug 30, 2021Updated 4 years ago
- [EMNLP-2022 Findings] Code for paper “ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback”.☆27Feb 4, 2023Updated 3 years ago
- ☆69Dec 15, 2024Updated last year
- WONDERBREAD benchmark + dataset for BPM tasks☆34Jul 30, 2025Updated 6 months ago
- ☆27Mar 6, 2023Updated 2 years ago
- [NeurIPS 2023] Self-Chained Image-Language Model for Video Localization and Question Answering☆195Jan 14, 2024Updated 2 years ago
- ☆17Sep 1, 2024Updated last year
- ☆64Nov 28, 2022Updated 3 years ago
- EfficientVLM: Fast and Accurate Vision-Language Models via Knowledge Distillation and Modal-adaptive Pruning (ACL 2023)☆33Jul 18, 2023Updated 2 years ago