tonywu95 / LIME
Official code for paper LIME: Learning Inductive Bias for Primitives of Mathematical Reasoning
☆27Updated 3 years ago
Alternatives and similar repositories for LIME:
Users that are interested in LIME are comparing it to the libraries listed below
- Code Repository for "A Causal Framework to Quantify the Robustness of Mathematical Reasoning with Language Models".☆13Updated 2 years ago
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆26Updated last year
- ☆13Updated 3 years ago
- Emergent Communication Pretraining for Few-Shot Machine Translation☆13Updated 4 years ago
- NaturalProver: Grounded Mathematical Proof Generation with Language Models☆36Updated last year
- GHOSTS dataset☆38Updated last year
- The official code of EMNLP 2022, "SCROLLS: Standardized CompaRison Over Long Language Sequences".☆70Updated last year
- This repo contains code for the paper "Psychologically-informed chain-of-thought prompts for metaphor understanding in large language mod…☆14Updated last year
- Codebase for Context-aware Meta-learned Loss Scaling (CaMeLS). https://arxiv.org/abs/2305.15076.☆23Updated 11 months ago
- Neural Unification for Logic Reasoning over Language☆22Updated 3 years ago
- Benchmarking Generalization to New Tasks from Natural Language Instructions☆26Updated 3 years ago
- Query-focused summarization data☆41Updated last year
- Weakly Supervised Text-to-SQL Parsing through Question Decomposition☆22Updated last year
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28Updated 2 years ago
- Few-shot Learning with Auxiliary Data☆26Updated last year
- This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Ca…☆58Updated last year
- ☆10Updated 2 years ago
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆45Updated last year
- ☆32Updated 11 months ago
- Scratchpad/Chain-of-Thought Prompts☆12Updated 2 years ago
- ☆13Updated 2 years ago
- A unified benchmark for math reasoning☆87Updated last year
- ☆45Updated last year
- Minimum Description Length probing for neural network representations☆18Updated last week
- ☆16Updated last year
- The data and the PyTorch implementation for the models and experiments in the paper "Language Model Decoding as Likelihood–Utility Alignm…☆13Updated last year
- Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.☆18Updated 2 years ago
- Baleen: Robust Multi-Hop Reasoning at Scale via Condensed Retrieval (NeurIPS'21)☆44Updated 3 years ago
- Documenting large text datasets 🖼️ 📚☆11Updated last month
- ☆23Updated 4 months ago