Guangxuan-Xiao / GSM8K-eval
☆29Updated last year
Alternatives and similar repositories for GSM8K-eval:
Users that are interested in GSM8K-eval are comparing it to the libraries listed below
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View☆46Updated 3 months ago
- [ICML 2024] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".☆83Updated 7 months ago
- Official Repository for The Paper: Safety Alignment Should Be Made More Than Just a Few Tokens Deep☆65Updated 6 months ago
- An index of algorithms for reinforcement learning from human feedback (rlhf))☆91Updated 9 months ago
- This is the official code for the paper "Vaccine: Perturbation-aware Alignment for Large Language Models" (NeurIPS2024)☆33Updated 2 months ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆103Updated 10 months ago
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆153Updated 9 months ago
- Accepted LLM Papers in NeurIPS 2024☆33Updated 3 months ago
- ☆48Updated last year
- ☆71Updated last month
- ☆47Updated 2 months ago
- ☆50Updated 3 weeks ago
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…☆31Updated 7 months ago
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆64Updated 5 months ago
- Official Code for Paper: Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications☆68Updated 3 months ago
- Reference implementation for Token-level Direct Preference Optimization(TDPO)☆126Updated 6 months ago
- [ICLR'24] RAIN: Your Language Models Can Align Themselves without Finetuning☆88Updated 8 months ago
- [ACL'24, Outstanding Paper] Emulated Disalignment: Safety Alignment for Large Language Models May Backfire!☆33Updated 5 months ago
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied wit…☆105Updated 6 months ago
- This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.☆283Updated 5 months ago
- ☆13Updated 11 months ago
- This my attempt to create Self-Correcting-LLM based on the paper Training Language Models to Self-Correct via Reinforcement Learning by g…☆29Updated last month
- ☆122Updated 6 months ago
- ☆17Updated 6 months ago
- Representation Surgery for Multi-Task Model Merging. ICML, 2024.☆34Updated 3 months ago
- LLM Unlearning☆141Updated last year
- Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".☆18Updated 3 months ago
- ☆16Updated 2 months ago
- Official implementation of ICLR'24 paper, "Curiosity-driven Red Teaming for Large Language Models" (https://openreview.net/pdf?id=4KqkizX…☆67Updated 10 months ago
- The repo for In-context Autoencoder☆104Updated 8 months ago