☆60Sep 23, 2023Updated 2 years ago
Alternatives and similar repositories for GSM8K-eval
Users that are interested in GSM8K-eval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated 2 years ago
- ☆14Jun 24, 2024Updated 2 years ago
- Co-Supervised Learning: Improving Weak-to-Strong Generalization with Hierarchical Mixture of Experts☆15Feb 26, 2024Updated 2 years ago
- Code for "Masked Autoencoding for Scalable and Generalizable Decision Making". NeurIPS 2022☆47Mar 12, 2024Updated 2 years ago
- ☆27Oct 6, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆14Jan 6, 2025Updated last year
- Official code for ICML 2024 paper on Persona In-Context Learning (PICLe)☆28Jun 27, 2024Updated 2 years ago
- ☆10Oct 21, 2021Updated 4 years ago
- ☆88Nov 21, 2025Updated 7 months ago
- ☆16Mar 26, 2025Updated last year
- ☆12Dec 17, 2023Updated 2 years ago
- ☆46Oct 1, 2024Updated last year
- ☆47Dec 9, 2024Updated last year
- Comprehensive LLM evaluation framework: GPQA Diamond to Chatbot Arena. Tests all major models equally, easily extensible.☆17Aug 22, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 从零快速使用Ubuntu,搭建深度学习环境,持续更新中☆12Apr 18, 2023Updated 3 years ago
- Code for the paper "Jailbreak Large Vision-Language Models Through Multi-Modal Linkage"☆35Dec 6, 2024Updated last year
- A PyTorch Implementation of the EMNLP 2020 paper "Mitigating Gender Bias for Neural Dialogue Generation with Adversarial Learning"☆13Feb 20, 2021Updated 5 years ago
- Official code for the paper: DRA-GRPO: Exploring Diversity-Aware Reward Adjustment for R1-Zero-Like Training of Large Language Models☆24Jan 6, 2026Updated 5 months ago
- We propose Bidirectional Evolutionary Search (BES), a search framework that couples forward candidate evolution with backward goal decomp…☆160May 28, 2026Updated last month
- Inference-time alignment for harmlessness through cross-model guidance (ACL 2024). Code + MM-Harmful Bench.☆38Oct 2, 2024Updated last year
- This is a sample implementation of "Robust Graph Convolutional Networks Against Adversarial Attacks", KDD 2019.☆10Dec 8, 2020Updated 5 years ago
- Automatic differentiation for Triton Kernels☆29Aug 12, 2025Updated 10 months ago
- Improving Steering Vectors by Targeting Sparse Autoencoder Features☆28Nov 20, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- helper functions for processing and integrating visual language information with Qwen-VL Series Model☆17Aug 30, 2024Updated last year
- Learning to Skip the Middle Layers of Transformers☆17Aug 7, 2025Updated 10 months ago
- ☆14Nov 21, 2023Updated 2 years ago
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆11Dec 30, 2024Updated last year
- AdaICL: Which Examples to Annotate of In-Context Learning? Towards Effective and Efficient Selection☆19Oct 30, 2023Updated 2 years ago
- A Rust implementation of Yolo for object detection and tracking.☆10Nov 17, 2022Updated 3 years ago
- Jupyter notebooks from our weekly (or so) hackathons☆11Dec 3, 2024Updated last year
- Code for "Exponential Family Estimation via Adversarial Dynamics Embedding" (NeurIPS 2019)☆14Nov 26, 2019Updated 6 years ago
- ☆20May 14, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models☆15Nov 4, 2023Updated 2 years ago
- ☆13Mar 30, 2022Updated 4 years ago
- [NeurIPS25] Official repo for "Simplicity Prevails: Rethinking Negative Preference Optimization for LLM Unlearning"☆45Oct 3, 2025Updated 8 months ago
- https://xuruowei.com 是她的家人朋友们和她的爱人高策为纪念她留下的。徐若薇于 2026 年 2 月 28 日离世。我们希望通过这个时间线纪念她的一生——照片、故事、文字、音乐与她钟爱的一切。沿着她生 命的轨迹漫步,重新触摸那些有温度的瞬间。☆28Apr 1, 2026Updated 3 months ago
- [NeurIPS 2025] Official implementation of "Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning"☆33Oct 20, 2025Updated 8 months ago
- ☆16Apr 26, 2023Updated 3 years ago
- Code accompanying VarGrad: A Low-Variance Gradient Estimator for Variational Inference☆12Oct 12, 2020Updated 5 years ago