allenai / hci-alt-textsLinks
Dataset and annotations for ASSETS 2022 publication
☆12Updated 2 years ago
Alternatives and similar repositories for hci-alt-texts
Users that are interested in hci-alt-texts are comparing it to the libraries listed below
Sorting:
- [ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award …☆42Updated 8 months ago
- ☆20Updated 2 months ago
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)☆37Updated 5 months ago
- Learning to route instances for Human vs AI Feedback (ACL 2025 Main)☆23Updated last month
- Code and data for the ACL 2024 Findings paper "Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning"☆26Updated last year
- ☆57Updated 9 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆130Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated 8 months ago
- ☆12Updated last year
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆89Updated 6 months ago
- ☆25Updated 2 years ago
- ☆62Updated 11 months ago
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆44Updated 11 months ago
- ☆61Updated 3 weeks ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 9 months ago
- Code accompanying "How I learned to start worrying about prompt formatting".☆105Updated 2 weeks ago
- ☆23Updated last year
- 🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!☆52Updated this week
- Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.☆40Updated 3 months ago
- ☆20Updated 3 months ago
- Codebase accompanying the Summary of a Haystack paper.☆78Updated 9 months ago
- ☆45Updated 10 months ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆65Updated last year
- ☆95Updated last year
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆58Updated 6 months ago
- 🌾 Universal, customizable and deployable fine-grained evaluation for text generation.☆23Updated last year
- VisText is a benchmark dataset for semantically rich chart captioning.☆93Updated last year
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agents☆24Updated 3 years ago
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆48Updated last year
- Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Models☆21Updated 3 weeks ago