Alsace08 / OOD-Math-Reasoning
Code and Data Repo for Paper "Trajectory Volatility for Out-of-Distribution Detection in Mathematical Reasoning"
☆11Updated 3 months ago
Related projects: ⓘ
- Methods and evaluation for aligning language models temporally☆24Updated 6 months ago
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆62Updated 7 months ago
- ☆22Updated last year
- [ACL 2024] A Prospector of Long-Dependency Data for Large Language Models☆48Updated last month
- The repository for ACL 2024 paper "TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models"☆21Updated 2 months ago
- self-adaptive in-context learning☆42Updated last year
- ☆17Updated 2 years ago
- Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planning☆35Updated last year
- ☆26Updated last year
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆20Updated 6 months ago
- ☆32Updated 8 months ago
- [EMNLP 2023] Once Upon a *Time* in *Graph*: Relative-Time Pretraining for Complex Temporal Reasoning☆17Updated 10 months ago
- kNN-TL: k-Nearest-Neighbor Transfer Learning for Low-Resource Neural Machine Translation (ACL2023)☆12Updated last year
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆56Updated 6 months ago
- [NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"☆54Updated 9 months ago
- ☆21Updated last year
- 珠算代码大模型(Abacus Code LLM)☆18Updated this week
- ☆16Updated last year
- my commonly-used tools☆46Updated last month
- ☆15Updated last week
- Evaluating the Ripple Effects of Knowledge Editing in Language Models☆45Updated 5 months ago
- A Synthetic, Scalable and Systematic Evaluation Suite for Large Language Models☆31Updated 3 months ago
- Dive-into-LLMs Tutorial for Beginners☆6Updated 4 months ago
- ☆13Updated 10 months ago
- ☆35Updated last year
- Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ACL-2022☆18Updated 2 years ago
- https://openreview.net/forum?id=OC1o4_OI6Jw☆13Updated 2 years ago
- ☆57Updated last year
- Code and data for "Dialogue Planning via Brownian Bridge Stochastic Process for Goal-directed Proactive Dialogue" (ACL Findings 2023).☆21Updated 10 months ago
- ☆31Updated 3 months ago