tianlwang/eval_gsm8k

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tianlwang/eval_gsm8k)

tianlwang / eval_gsm8k

☆33

Alternatives and similar repositories for eval_gsm8k

Users that are interested in eval_gsm8k are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

henrykmichalewski / math-evals
View on GitHub
Math evaluations of llama models.
☆10Jan 3, 2024Updated 2 years ago
Haskely / gsm8k-rft-llama7b-u13b_evaluation
View on GitHub
测试 https://huggingface.co/OFA-Sys/gsm8k-rft-llama7b-u13b 的 GSM8K 分数
☆15Aug 10, 2023Updated 2 years ago
liutianlin0121 / decoding-time-realignment
View on GitHub
Implementation of "Decoding-time Realignment of Language Models", ICML 2024.
☆21Jun 17, 2024Updated 2 years ago
lasgroup / SafetyPolytope
View on GitHub
Learning Safety Constraints for Large Language Models (ICML2025)
☆35May 25, 2026Updated last month
Jometeorie / probing_llama
View on GitHub
☆17Feb 26, 2024Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
XiaoMi / cmath
View on GitHub
CMATH: Can your language model pass Chinese elementary school math test?
☆56Jul 3, 2023Updated 3 years ago
thu-ml / TetraJet-v2-NVFP4Training
View on GitHub
[ICML 2026 Spotlight] Official implementation of TetraJet-v2: Accurate NVFP4 Training for LLMs, with fully-NVFP4 linear layer with unbias…
☆17Jul 3, 2026Updated 3 weeks ago
NeuralSentinel / SafeInfer
View on GitHub
☆23Jan 14, 2025Updated last year
THUKElab / CCL2023-CLTC-THU_KELab
View on GitHub
This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Tex…
☆15Nov 25, 2023Updated 2 years ago
ZZR0 / CodeAttack
View on GitHub
Adversarial Attack for Pre-trained Code Models
☆10Jul 19, 2022Updated 4 years ago
SamuelHorvath / Variance_Reduced_Optimizers_Pytorch
View on GitHub
PyTorch Implementation of Variance Reduced Optimization Algorithms -- SARAH and SVRG.
☆15Jul 11, 2021Updated 5 years ago
yawen-d / DQN_Family_PyTorch
View on GitHub
This is a repository of DQN and its variants implementation in PyTorch based on the original papar.
☆13Nov 18, 2019Updated 6 years ago
yahshibu / nested-ner-tacl2020-flair
View on GitHub
Implementation of Nested Named Entity Recognition using Flair
☆24Oct 29, 2021Updated 4 years ago
nuochenpku / LLaMA_Analysis
View on GitHub
This is official project in our paper: Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers
☆31Jan 13, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
acl-org / acl-2025
View on GitHub
☆15Aug 7, 2025Updated 11 months ago
spetryk / GALS
View on GitHub
☆13Aug 14, 2022Updated 3 years ago
diqiuzhuanzhuan / openllm-func-call-synthesizer
View on GitHub
openllm-func-call-synthesizer is an open-source data synthesis and annotation framework designed to generate high-quality function-callin…
☆20Jul 14, 2026Updated last week
uw-mad-dash / decoding-speculative-decoding
View on GitHub
☆16Aug 19, 2024Updated last year
pittisl / ElasticTrainer
View on GitHub
Code for paper "ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection" (MobiSys'23)
☆14Nov 1, 2023Updated 2 years ago
0x7o / RETRO-transformer
View on GitHub
Easy-to-use Retrieval-Enhanced Transformer implementation
☆10Sep 30, 2022Updated 3 years ago
XMUDeepLIT / QGC
View on GitHub
Code for "Retaining Key Information under High Compression Rates: Query-Guided Compressor for LLMs" (ACL 2024)
☆20Jun 12, 2024Updated 2 years ago
pgazmuri / GPTReactor
View on GitHub
React CodeGen using GPT
☆12Feb 11, 2024Updated 2 years ago
sangminwoo / awesome-token-redundancy-reduction
View on GitHub
😎 Awesome papers on token redundancy reduction
☆14Mar 12, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
sixws / Demo
View on GitHub
一个小游戏
☆14Aug 17, 2022Updated 3 years ago
chuhac / Reasoning-to-Defend
View on GitHub
[EMNLP 2025] Reasoning-to-Defend: Safety-Aware Reasoning Can Defend Large Language Models from Jailbreaking
☆12Aug 22, 2025Updated 11 months ago
microsoft / ARXGEN
View on GitHub
Scripts to parse arxiv documents for NLP tasks
☆19Jun 12, 2023Updated 3 years ago
cgrpa / AzureOAIBalancer
View on GitHub
AzureAIOBalancer is a Terraform repository for automating the deployment of a load-balanced Azure OpenAI environment across multiple regi…
☆10Nov 3, 2023Updated 2 years ago
WenyiWU0111 / CoMEM
View on GitHub
This is the official code repository for the paper: Towards General Continuous Memory for Vision-Language Models.
☆31Jul 3, 2025Updated last year
yuleiqin / RAIF
View on GitHub
A Recipe for Building LLM Reasoners to Solve Complex Instructions
☆32Oct 9, 2025Updated 9 months ago
NiuTrans / ForgettingCurve
View on GitHub
A benchmark for testing memorization abilities of LMs
☆24Oct 15, 2024Updated last year
WHU-ZQH / PANDA
View on GitHub
PANDA: Prompt Transfer Meets Knowledge Distillation for Efficient Model Adaptation
☆16Mar 28, 2023Updated 3 years ago
THU-KEG / SafetyNeuron
View on GitHub
Data and code for the paper: Finding Safety Neurons in Large Language Models
☆29Jan 29, 2026Updated 5 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Sugewud / Safe-Sora
View on GitHub
[NeurIPS 2025] The official implementation of paper "Safe-Sora: Safe Text-to-Video Generation via Graphical Watermarking"
☆20Oct 10, 2025Updated 9 months ago
YiyiyiZhao / siren
View on GitHub
Welcome to the official repository for Siren, a project aimed at understanding and mitigating harmful behaviors in large language models …
☆15Jun 14, 2026Updated last month
fvmassoli / cross-resolution-face-recognition
View on GitHub
☆10Nov 22, 2022Updated 3 years ago
TencentAILabHealthcare / UMIX
View on GitHub
☆18Oct 29, 2022Updated 3 years ago
maljazaery / llm-load-test-azure
View on GitHub
☆10Sep 25, 2024Updated last year
zhenqincn / FedAPEN
View on GitHub
This repository contains the official implementation of the paper entitled with "FedAPEN: Personalized Cross-silo Federated Learning with…
☆14Dec 4, 2023Updated 2 years ago
lixintong1992 / Machine_Learning
View on GitHub
My Machine Learning repository
☆10Apr 10, 2017Updated 9 years ago