☆31Jul 4, 2024Updated last year
Alternatives and similar repositories for eval_gsm8k
Users that are interested in eval_gsm8k are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Math evaluations of llama models.☆10Jan 3, 2024Updated 2 years ago
- compare the theory attention gradient with PyTorch attention gradient☆16Apr 1, 2024Updated 2 years ago
- 测试 https://huggingface.co/OFA-Sys/gsm8k-rft-llama7b-u13b 的 GSM8K 分数☆15Aug 10, 2023Updated 2 years ago
- Implementation of "Decoding-time Realignment of Language Models", ICML 2024.☆21Jun 17, 2024Updated last year
- Auto math prover.☆11Jul 10, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Tex…☆15Nov 25, 2023Updated 2 years ago
- ☆17Feb 26, 2024Updated 2 years ago
- CMATH: Can your language model pass Chinese elementary school math test?☆53Jul 3, 2023Updated 2 years ago
- List of papers that applied graph network to NLP☆13Feb 26, 2019Updated 7 years ago
- ☆14Aug 7, 2025Updated 8 months ago
- The official code of Multi-player Nash Preference Optimization [ICLR 2026]☆35Feb 4, 2026Updated 2 months ago
- PyTorch Implementation of Variance Reduced Optimization Algorithms -- SARAH and SVRG.☆15Jul 11, 2021Updated 4 years ago
- All materials related to GNN☆13Jan 4, 2023Updated 3 years ago
- ☆13Oct 20, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆21Jan 16, 2025Updated last year
- ☆37Nov 16, 2017Updated 8 years ago
- Official repository for the paper "Gradient-based Jailbreak Images for Multimodal Fusion Models" (https//arxiv.org/abs/2410.03489)☆19Oct 22, 2024Updated last year
- [COLM'25] Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill?☆37Jun 5, 2025Updated 10 months ago
- This is official project in our paper: Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers☆31Jan 13, 2024Updated 2 years ago
- Embedding-based evaluation metrics for dialogue generation.☆15Jan 8, 2023Updated 3 years ago
- ☆15Aug 19, 2024Updated last year
- A benchmark for testing memorization abilities of LMs☆22Oct 15, 2024Updated last year
- [NDSS'25] The official implementation of safety misalignment.☆18Jan 8, 2025Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Easy-to-use Retrieval-Enhanced Transformer implementation☆10Sep 30, 2022Updated 3 years ago
- [AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.☆47Oct 14, 2024Updated last year
- ☆11Jul 6, 2023Updated 2 years ago
- Easy wrapper for inserting LoRA layers in CLIP.☆40Jun 16, 2024Updated last year
- [Neurips 2025]StegoZip: Enhancing Linguistic Steganography Payload in Practice with Large Language Models☆29Dec 4, 2025Updated 4 months ago
- A demo of the Mito Streamlit Spreadsheet☆18Aug 3, 2023Updated 2 years ago
- Accelerated Bregman Proximal Gradient Methods☆29Jun 12, 2023Updated 2 years ago
- ☆18Oct 29, 2022Updated 3 years ago
- Welcome to the official repository for Siren, a project aimed at understanding and mitigating harmful behaviors in large language models …☆15Sep 12, 2025Updated 6 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Mirror Diffusion Models, NeurIPS 2023☆33Dec 1, 2023Updated 2 years ago
- Official implementation for the paper Lancet: Accelerating Mixture-of-Experts Training via Whole Graph Computation-Communication Overlapp…☆14Nov 17, 2025Updated 4 months ago
- PANDA: Prompt Transfer Meets Knowledge Distillation for Efficient Model Adaptation☆16Mar 28, 2023Updated 3 years ago
- Code for "SCL-RAI: Span-based Contrastive Learning with Retrieval Augmented Inference for Unlabeled Entity Problem in NER" @COLING-2022☆11Aug 20, 2022Updated 3 years ago
- ☆44Feb 22, 2026Updated last month
- ☆15Apr 27, 2024Updated last year
- ☆10Nov 22, 2022Updated 3 years ago