MurongYue / LLM_MoT_cascade
This is the implementation for the paper "LARGE LANGUAGE MODEL CASCADES WITH MIX- TURE OF THOUGHT REPRESENTATIONS FOR COST- EFFICIENT REASONING".
☆17Updated 3 months ago
Related projects: ⓘ
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆45Updated 6 months ago
- BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆41Updated last month
- Official codebase for permutation self-consistency.☆16Updated 7 months ago
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data☆28Updated last month
- Code and data for paper "Context-faithful Prompting for Large Language Models".☆37Updated last year
- Evaluate the Quality of Critique☆35Updated 3 months ago
- Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; arXiv preprint arXiv:2403.…☆34Updated 2 months ago
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆82Updated 2 months ago
- LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆26Updated 5 months ago
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models☆42Updated last week
- Supporting code for ReCEval paper☆26Updated this week
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆37Updated 2 months ago
- the instructions and demonstrations for building a formal logical reasoning capable GLM☆49Updated 2 weeks ago
- AbstainQA, ACL 2024☆17Updated 3 weeks ago
- ☆11Updated 2 weeks ago
- Repository for paper Tools Are Instrumental for Language Agents in Complex Environments☆32Updated 8 months ago
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"☆28Updated 4 months ago
- Benchmarking Benchmark Leakage in Large Language Models☆39Updated 4 months ago
- Codebase for [Paper] Pre-training with Bag-of-Word Prediction for Dense Passage Retrieval☆12Updated 6 months ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆39Updated 7 months ago
- Implementation of the paper: "Making Retrieval-Augmented Language Models Robust to Irrelevant Context"☆60Updated last month
- Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval☆33Updated 3 months ago
- Lightweight tool to identify Data Contamination in LLMs evaluation☆39Updated 6 months ago
- ☆26Updated last year
- This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".☆22Updated last month
- "FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning" (ACL 2023)☆13Updated last year
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆51Updated last year
- Evaluating the Factuality of Large Language Models using Large-Scale Knowledge Graphs☆19Updated 2 weeks ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆81Updated 2 weeks ago
- This is the repository for paper "CREATOR: Tool Creation for Disentangling Abstract and Concrete Reasoning of Large Language Models"☆22Updated 11 months ago