RZFan525 / NLP-PhD-Application-In-The-World
The information of NLP PhD application in the world.
☆36Updated 8 months ago
Alternatives and similar repositories for NLP-PhD-Application-In-The-World:
Users that are interested in NLP-PhD-Application-In-The-World are comparing it to the libraries listed below
- Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planning☆36Updated last year
- 计算语言学22-23学年秋季学期 课程大作业baseline实现☆37Updated 2 years ago
- Evaluating the Ripple Effects of Knowledge Editing in Language Models☆55Updated last year
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆60Updated 9 months ago
- BeHonest: Benchmarking Honesty in Large Language Models☆31Updated 8 months ago
- The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…☆29Updated 5 months ago
- ☆86Updated last year
- ☆41Updated last year
- The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism☆29Updated 9 months ago
- ☆73Updated 11 months ago
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆63Updated last year
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions☆109Updated 7 months ago
- Code for our EMNLP-2023 paper: "Active Instruction Tuning: Improving Cross-Task Generalization by Training on Prompt Sensitive Tasks"☆24Updated last year
- ☆16Updated 3 years ago
- ☆16Updated 6 months ago
- Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study☆43Updated 2 years ago
- ☆42Updated last year
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆24Updated last year
- ☆18Updated last year
- LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation☆23Updated last week
- Official Implementation of "Probing Language Models for Pre-training Data Detection"☆19Updated 5 months ago
- Code and data for "ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM" (NeurIPS 2024 Track Datasets and…☆41Updated 6 months ago
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆60Updated 2 years ago
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆71Updated 2 years ago
- ☆25Updated 2 years ago
- [ACL 2024] Code for the paper "ALaRM: Align Language Models via Hierarchical Rewards Modeling"☆25Updated last year
- Rationale-enhanced language models are better continual relation learners (EMNLP 2023 Main Conference)☆12Updated last year
- ☆39Updated last year
- Feeling confused about super alignment? Here is a reading list☆42Updated last year
- Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"☆11Updated last year