EdisonNi-hku / chatreportLinks
Github implementation of https://reports.chatclimate.ai/
☆22Updated 3 months ago
Alternatives and similar repositories for chatreport
Users that are interested in chatreport are comparing it to the libraries listed below
Sorting:
- This is the code repo for our paper "Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents".☆108Updated 11 months ago
- [ICLR24] The open-source repo of THU-KEG's KoLA benchmark.☆51Updated 2 years ago
- ☆147Updated last year
- ☆58Updated 11 months ago
- The GitHub repository for the paper "Self-prompted Chain-of-Thought on Large Language Models for Open-domain Multi-hop Reasoning" accepte…☆19Updated last year
- Data and code for paper "M3Exam: A Multilingual, Multimodal, Multilevel Benchmark for Examining Large Language Models"☆102Updated 2 years ago
- A framework for editing the CoTs for better factuality☆51Updated last year
- ☆59Updated 11 months ago
- ☆18Updated last year
- ☆68Updated 2 years ago
- ☆140Updated 2 years ago
- This the implementation of LeCo☆31Updated 8 months ago
- [NAACL 2024] Making Language Models Better Tool Learners with Execution Feedback☆42Updated last year
- ☆51Updated last year
- ☆36Updated last year
- ☆96Updated last year
- MathEval is a benchmark dedicated to the holistic evaluation on mathematical capacities of LLMs.☆83Updated 10 months ago
- Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)☆94Updated 7 months ago
- EMNLP'23 survey: a curation of awesome papers and resources on refreshing large language models (LLMs) without expensive retraining.☆135Updated last year
- CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation☆59Updated 4 months ago
- ☆16Updated 2 years ago
- [ACL 2023] This is the code repo for our ACL'23 paper "Augmentation-Adapted Retriever Improves Generalization of Language Models as Gener…☆60Updated last year
- Logiqa2.0 dataset - logical reasoning in MRC and NLI tasks☆99Updated 2 years ago
- A Toolkit for Table-based Question Answering☆113Updated last year
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆82Updated 2 years ago
- Unofficial implementation of AlpaGasus☆93Updated 2 years ago
- ☆33Updated last year
- Generative Judge for Evaluating Alignment☆246Updated last year
- Benchmarking Retrieval-Augmented Generation in Multi-Turn Legal Consultation Conversation☆21Updated 7 months ago
- Reformatted Alignment☆113Updated last year