☆33Jul 4, 2024Updated last year
Alternatives and similar repositories for eval_gsm8k
Users that are interested in eval_gsm8k are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Math evaluations of llama models.☆10Jan 3, 2024Updated 2 years ago
- compare the theory attention gradient with PyTorch attention gradient☆16Apr 1, 2024Updated 2 years ago
- 测试 https://huggingface.co/OFA-Sys/gsm8k-rft-llama7b-u13b 的 GSM8K 分数☆15Aug 10, 2023Updated 2 years ago
- The code used to train and run inference with MMDocIR☆33May 29, 2025Updated 11 months ago
- The repository of EMNLP 2023 "MixEdit: Revisiting Data Augmentation and Beyond for Grammatical Error Correction"☆12Nov 25, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆17Feb 26, 2024Updated 2 years ago
- CMATH: Can your language model pass Chinese elementary school math test?☆56Jul 3, 2023Updated 2 years ago
- ☆15Aug 7, 2025Updated 9 months ago
- Adversarial Attack for Pre-trained Code Models☆10Jul 19, 2022Updated 3 years ago
- ☆19Jan 3, 2025Updated last year
- All materials related to GNN☆13Jan 4, 2023Updated 3 years ago
- ☆13Oct 20, 2020Updated 5 years ago
- ☆37Nov 16, 2017Updated 8 years ago
- Implementation of Nested Named Entity Recognition using Flair☆24Oct 29, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for "Retaining Key Information under High Compression Rates: Query-Guided Compressor for LLMs" (ACL 2024)☆19Jun 12, 2024Updated last year
- Embedding-based evaluation metrics for dialogue generation.☆15Jan 8, 2023Updated 3 years ago
- [NAACL 2025 Findings] Code for "Perception Compressor: A Training-Free Prompt Compression Framework in Long Context Scenarios"☆28Mar 5, 2025Updated last year
- ☆16Aug 19, 2024Updated last year
- [NDSS'25] The official implementation of safety misalignment.☆19Jan 8, 2025Updated last year
- DELTA-pytorch:DELTA: Dynamically Optimizing GPU Memory beyond Tensor Recomputation☆12Apr 16, 2024Updated 2 years ago
- Official Pytorch implementation of 'Facing the Elephant in the Room: Visual Prompt Tuning or Full Finetuning'? (ICLR2024)☆13Mar 8, 2024Updated 2 years ago
- Easy-to-use Retrieval-Enhanced Transformer implementation☆10Sep 30, 2022Updated 3 years ago
- A benchmark for testing memorization abilities of LMs☆24Oct 15, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.