tianlwang / eval_gsm8k
☆17Updated 6 months ago
Alternatives and similar repositories for eval_gsm8k:
Users that are interested in eval_gsm8k are comparing it to the libraries listed below
- 测试 https://huggingface.co/OFA-Sys/gsm8k-rft-llama7b-u13b 的 GSM8K 分数☆14Updated last year
- ☆29Updated last year
- Implementation of ICML 23 Paper: Specializing Smaller Language Models towards Multi-Step Reasoning.☆128Updated last year
- ☆50Updated 3 weeks ago
- Paper list and datasets for the paper: A Survey on Data Selection for LLM Instruction Tuning☆38Updated 11 months ago
- Reference implementation for Token-level Direct Preference Optimization(TDPO)☆126Updated 6 months ago
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆161Updated 11 months ago
- ☆20Updated 2 months ago
- [SIGIR'24] The official implementation code of MOELoRA.☆143Updated 6 months ago
- ☆24Updated 10 months ago
- Released code for our ICLR23 paper.☆63Updated last year
- ☆162Updated 6 months ago
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions☆106Updated 4 months ago
- ☆21Updated last year
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$☆39Updated 3 months ago
- An index of algorithms for reinforcement learning from human feedback (rlhf))☆91Updated 9 months ago
- Explore what LLMs are really leanring over SFT☆28Updated 10 months ago
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".☆66Updated 2 weeks ago
- ☆153Updated 7 months ago
- ☆48Updated last year
- Official repository for ICLR 2024 Spotlight paper "Large Language Models Are Not Robust Multiple Choice Selectors"☆38Updated 7 months ago
- Code for `Iterative Tool Learning from Introspection Feedback by Easy-to-Difficult Curriculum`☆17Updated 9 months ago
- The Paper List on Data Contamination for Large Language Models Evaluation.☆88Updated 2 weeks ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆103Updated 10 months ago
- ☆60Updated 2 years ago
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied wit…☆105Updated 6 months ago
- A Survey on the Honesty of Large Language Models☆51Updated last month
- Evaluating the Ripple Effects of Knowledge Editing in Language Models☆53Updated 9 months ago
- Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language …☆29Updated 2 weeks ago
- 🚀LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training☆65Updated last month