minalee-research / coauthor-interfaceLinks
☆94Updated last year
Alternatives and similar repositories for coauthor-interface
Users that are interested in coauthor-interface are comparing it to the libraries listed below
Sorting:
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆70Updated 2 years ago
- A set of utilities for running few-shot prompting experiments on large-language models☆122Updated last year
- DialOp: Decision-oriented dialogue environments for collaborative language agents☆109Updated 8 months ago
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI☆94Updated 2 years ago
- ☆44Updated 8 months ago
- ☆94Updated 6 months ago
- Apps built using Inspired Cognition's Critique.☆58Updated 2 years ago
- Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback☆207Updated 2 years ago
- Get answers to research questions from 200M+ papers. Link to demo -☆205Updated last year
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆86Updated 11 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆131Updated last year
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆82Updated last year
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".☆161Updated last year
- [ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners☆116Updated 2 weeks ago
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"☆107Updated last year
- 👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"☆55Updated last year
- ☆239Updated 3 months ago
- AuditNLG: Auditing Generative AI Language Modeling for Trustworthiness☆101Updated 5 months ago
- Inspecting and Editing Knowledge Representations in Language Models☆116Updated last year
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆184Updated last week
- [NeurIPS 2023] PyTorch code for Can Language Models Teach? Teacher Explanations Improve Student Performance via Theory of Mind☆66Updated last year
- [ICLR 2024 Spotlight] FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets☆217Updated last year
- Probabilistic LLM evaluations. [CogSci2023; ACL2023]☆73Updated 11 months ago
- ☆16Updated last year
- Baby's CoThought: Leveraging LLMs for Enhanced Reasoning in Compact Models☆17Updated 6 months ago
- Seahorse is a dataset for multilingual, multi-faceted summarization evaluation. It consists of 96K summaries with human ratings along 6 q…☆88Updated last year
- Code, datasets, models for the paper "Automatic Evaluation of Attribution by Large Language Models"☆56Updated 2 years ago
- Code and data for "Measuring and Narrowing the Compositionality Gap in Language Models"☆314Updated last year
- Implementation of the paper: "Answering Questions by Meta-Reasoning over Multiple Chains of Thought"☆96Updated last year
- Based on the tree of thoughts paper☆48Updated last year