allenai / hci-alt-textsLinks
Dataset and annotations for ASSETS 2022 publication
☆12Updated 3 years ago
Alternatives and similar repositories for hci-alt-texts
Users that are interested in hci-alt-texts are comparing it to the libraries listed below
Sorting:
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆98Updated 10 months ago
- PyLate efficient inference engine☆64Updated 3 weeks ago
- [ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award …☆42Updated 11 months ago
- VisText is a benchmark dataset for semantically rich chart captioning.☆95Updated last month
- ☆57Updated last year
- Edu-ConvoKit: An Open-Source Framework for Education Conversation Data☆100Updated 5 months ago
- Discovering Data-driven Hypotheses in the Wild☆113Updated 3 months ago
- Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.☆42Updated 6 months ago
- An attribution library for LLMs☆42Updated last year
- Get answers to research questions from 200M+ papers. Link to demo -☆206Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated 11 months ago
- Learning to route instances for Human vs AI Feedback (ACL Main '25)☆24Updated 2 months ago
- Code accompanying "How I learned to start worrying about prompt formatting".☆110Updated 3 months ago
- ☆17Updated 2 years ago
- This repository contains ScholarQABench data and evaluation pipeline.☆85Updated last month
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆34Updated last month
- Large-language Model Evaluation framework with Elo Leaderboard and A-B testing☆52Updated 11 months ago
- A set of utilities for running few-shot prompting experiments on large-language models☆123Updated last year
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆109Updated last year
- This project studies the performance and robustness of language models and task-adaptation methods.☆153Updated last year
- Pretraining Efficiently on S2ORC!☆170Updated 11 months ago
- Probabilistic LLM evaluations. [CogSci2023; ACL2023]☆72Updated last year
- ☆81Updated 2 weeks ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆66Updated last week
- NAACL 2024. Code & Dataset for "🌁 Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistake…☆43Updated last year
- Automated Qualitative Analysis of LLMs (ICLR 2025)☆47Updated 3 months ago
- AI Data Management & Evaluation Platform☆216Updated 2 years ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆102Updated last year
- 🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!☆54Updated 2 months ago
- Codebase accompanying the Summary of a Haystack paper.☆79Updated last year