Haskely / gsm8k-rft-llama7b-u13b_evaluationView external linksLinks
测试 https://huggingface.co/OFA-Sys/gsm8k-rft-llama7b-u13b 的 GSM8K 分数
☆15Aug 10, 2023Updated 2 years ago
Alternatives and similar repositories for gsm8k-rft-llama7b-u13b_evaluation
Users that are interested in gsm8k-rft-llama7b-u13b_evaluation are comparing it to the libraries listed below
Sorting:
- compare the theory attention gradient with PyTorch attention gradient☆15Apr 1, 2024Updated last year
- ☆31Jul 4, 2024Updated last year
- Codebase for fine-tuning Llama2 70B to generate math test questions and answers.☆11Aug 30, 2024Updated last year
- ☆15Jan 27, 2026Updated 3 weeks ago
- Fine-tuning GPT-2 to generate research paper abstracts☆12Apr 28, 2021Updated 4 years ago
- ☆11Dec 28, 2023Updated 2 years ago
- ☆11Feb 28, 2024Updated last year
- 😎 Awesome papers on token redundancy reduction☆11Mar 12, 2025Updated 11 months ago
- Code for the paper "Knowledge-Aware Federated Active Learning with Non-IID Data", ICCV2023☆10Sep 8, 2023Updated 2 years ago
- Source code of FedAttack.☆11Feb 9, 2022Updated 4 years ago
- Code and models for ``Answering Open-Domain Multi-Answer Questions via a Recall-then-Verify Framework (ACL 2022)''☆12Jun 29, 2022Updated 3 years ago
- 暑期机器学习讨论班是由张祥老师组织发起,全体学生参与的讨论交流活动。目的是让学生巩固机器学习基本算法,掌握基本原理和使用。组织形式为学生选题并制作PPT,采用演讲的形式授课给全体参与学生和导师。☆10Sep 19, 2018Updated 7 years ago
- Code for the WWW'23 paper "Sanitizing Sentence Embeddings (and Labels) for Local Differential Privacy"☆12Feb 20, 2023Updated 2 years ago
- ☆10Nov 22, 2022Updated 3 years ago
- Python 网络爬虫的案例,爬取的网站有豆瓣、美团、哔哩哔哩、图片资源、古诗词、广东工业大学官网等。☆12Apr 30, 2021Updated 4 years ago
- ☆14May 7, 2024Updated last year
- ☆15May 31, 2024Updated last year
- PANDA: Prompt Transfer Meets Knowledge Distillation for Efficient Model Adaptation☆16Mar 28, 2023Updated 2 years ago
- ☆13Mar 29, 2023Updated 2 years ago
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆64Feb 13, 2023Updated 3 years ago
- ☆16Feb 17, 2019Updated 7 years ago
- This is the official Python version of CoreInfer: Accelerating Large Language Model Inference with Semantics-Inspired Adaptive Sparse Act…☆17Oct 25, 2024Updated last year
- ☆17Jul 10, 2023Updated 2 years ago
- The baseline method for CCIR 22 https://www.datafountain.cn/competitions/573☆13Aug 2, 2022Updated 3 years ago
- ☆17Aug 29, 2025Updated 5 months ago
- Soft Mixture of Experts Vision Transformer, addressing MoE limitations as highlighted by Puigcerver et al., 2023.☆15Aug 13, 2023Updated 2 years ago
- Pytorch Framework learning for deeplearning☆14Jan 2, 2024Updated 2 years ago
- Repo for our Paper: Cross Quality LFW: A database for Analyzing Cross-Resolution Image Face Recognition in Unconstrained Environments☆19Nov 25, 2022Updated 3 years ago
- [ICML2025] KVTuner: Sensitivity-Aware Layer-wise Mixed Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference☆26Jan 27, 2026Updated 3 weeks ago
- ☆18Oct 6, 2022Updated 3 years ago
- different AI algorithms to solve board games☆19Nov 4, 2018Updated 7 years ago
- Targeted Data Generation with Large Language Models☆19Jun 25, 2024Updated last year
- MMD-ReID: A Simple but Effective solution for Visible-Thermal Person ReID☆17Jan 27, 2022Updated 4 years ago
- ☆19Sep 15, 2022Updated 3 years ago
- Release of the ConditionalQA dataset☆21Nov 2, 2021Updated 4 years ago
- learned cardinalities for databases☆16Apr 12, 2023Updated 2 years ago
- ☆28Mar 20, 2024Updated last year
- Single Player Monte Carlo Tree Search implementation☆18Jan 22, 2020Updated 6 years ago
- Data Augmentation on Graphs: A Technical Survey☆15Feb 12, 2023Updated 3 years ago