julianmichael / debate
Debate interface, experiments, etc.
☆11Updated 6 months ago
Related projects: ⓘ
- HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]☆13Updated last year
- NaturalProver: Grounded Mathematical Proof Generation with Language Models☆34Updated last year
- Llemma formal2formal (tactic prediction) theorem proving experiments☆15Updated 11 months ago
- Minimum Description Length probing for neural network representations☆15Updated 11 months ago
- ☆16Updated last year
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)☆32Updated this week
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆15Updated 3 weeks ago
- ☆11Updated 11 months ago
- Few-shot Learning with Auxiliary Data☆26Updated 9 months ago
- Official code repository for the paper: AbsPyramid: Benchmarking the Abstration Ability of Language Models with a Unified Entailment Grap…☆12Updated 7 months ago
- code for "Natural Language to Code Translation with Execution"☆39Updated last year
- The official repository for the paper Multilingual Mathematical Autoformalization☆30Updated 4 months ago
- ☆20Updated last week
- Harmonic Datasets☆26Updated 2 months ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆40Updated 8 months ago
- ☆8Updated last month
- [ICML 24 NGSM workshop] Associative Recurrent Memory Transformer implementation and scripts for training and evaluating☆26Updated last week
- Codebase for Context-aware Meta-learned Loss Scaling (CaMeLS). https://arxiv.org/abs/2305.15076.☆23Updated 7 months ago
- Critique-out-Loud Reward Models☆17Updated 2 weeks ago
- Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.☆18Updated last year
- Adding new tasks to T0 without catastrophic forgetting☆30Updated last year
- The is the official implementation of "Lyra: Orchestrating Dual Correction in Automated Theorem Proving"☆11Updated 2 months ago
- This is the official repository for all the code of TheoremLlama☆26Updated 2 months ago
- Repository for Skill Set Optimization☆12Updated last month
- Code repo for MathAgent☆13Updated 9 months ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated last year
- ☆27Updated last year
- Code for the paper LeanReasoner: Boosting Complex Logical Reasoning with Lean: https://arxiv.org/pdf/2403.13312.pdf☆14Updated 3 months ago
- [ACL'24] Can LLMs Speak For Diverse People? Tuning LLMs via Debate to Generate Controllable Controversial Statements☆11Updated last week
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆14Updated 6 months ago