☆108May 11, 2026Updated 3 weeks ago
Alternatives and similar repositories for asta-bench
Users that are interested in asta-bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Training tiny models to prove hard theorems☆77Mar 5, 2026Updated 3 months ago
- [ACL 2026] Repository of IPBench☆22Apr 6, 2026Updated 2 months ago
- 🔍 Enable AI assistants to search and access bioRxiv papers through a simple MCP interface.☆23Mar 18, 2025Updated last year
- Generating Protein Variants with Different Generative Models (HMM, VAE, ESM-2, ProtGPT2)☆11Mar 14, 2024Updated 2 years ago
- Code and data for the NAACL 2021 paper: "XFORMAL: A Benchmark for Multilingual Formality Style Transfer"☆12Jun 7, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Simple correct&smooth implementation in PyTorch.☆13Nov 8, 2022Updated 3 years ago
- Embedding Recycling for Language models☆38Jul 11, 2023Updated 2 years ago
- ☆18May 15, 2023Updated 3 years ago
- ☆20Jul 10, 2025Updated 10 months ago
- A4-Agent: An Agentic Framework for Zero-Shot Affordance Reasoning☆41Mar 12, 2026Updated 2 months ago
- Official Implementation of the paper "Jointly Reinforcing Diversity and Quality in Language Model Generations"☆60May 8, 2026Updated 3 weeks ago
- Internal utility libraries for Pkl☆16May 29, 2026Updated last week
- PyTorch implementation of Basenji2.☆22Apr 29, 2025Updated last year
- A framework aiming to bridge fast robot prototyping, predefined motion primitives, heterogeneous teleoperation, data collection, and flex…☆27Apr 4, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Open-Vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models (ICCV 20…☆18Apr 23, 2024Updated 2 years ago
- InstructMol: Multi-Modal Integration for Building a Versatile and Reliable Molecular Assistant in Drug Discovery (COLING 2025)☆54Dec 2, 2024Updated last year
- [EMNLP 2025] Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards☆68Sep 15, 2025Updated 8 months ago
- ☆23Feb 4, 2023Updated 3 years ago
- Multilingual and Multiculture Benchmark and LLM☆40May 18, 2026Updated 3 weeks ago
- Source code for paper "On the Pareto Front of Multilingual Neural Machine Translation" @ NeurIPS 2023☆17Sep 27, 2023Updated 2 years ago
- Container-free RL framework for training software engineering agents☆59Mar 4, 2026Updated 3 months ago
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆36Oct 3, 2025Updated 8 months ago
- ☆26Jun 5, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆10Jun 28, 2023Updated 2 years ago
- ☆22Dec 3, 2025Updated 6 months ago
- GRPO Training Script for Qwen Model on GSM8K Dataset. This script trains a Qwen model using the GRPO (Generalized Reinforcement Policy Op…☆32Dec 11, 2025Updated 5 months ago
- An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.☆14Jan 9, 2024Updated 2 years ago
- ☆12Nov 5, 2024Updated last year
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 6 months ago
- Official codebase for NeurIPS 2022 paper End-to-end Learning to Index and Search in Large Output Spaces☆12Apr 19, 2023Updated 3 years ago
- ☆35Oct 23, 2025Updated 7 months ago
- [NeurIPS 2021] "Adversarial GLUE: A Multi-Task Benchmark for Robustness Evaluation of Language Models" by Boxin Wang*, Chejian Xu*, Shuoh…☆13Apr 3, 2023Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Forcing Diffuse Distributions out of Language Models☆18Sep 10, 2024Updated last year
- ☆293Updated this week
- This is the repository for our EMNLP 2022 paper "The Importance of Being Parameters: An Intra-Distillation Method for Serious Gains".☆10Jun 2, 2023Updated 3 years ago
- Synthetic Data Generation for Evaluation☆14Feb 21, 2025Updated last year
- Curso de Deep Learning para programadores.☆21Aug 11, 2019Updated 6 years ago
- ☆87Sep 25, 2025Updated 8 months ago
- [CVPR 2026 (Findings) 🔥🔥] Self Evolving Large Multimodal Models with Continuous Rewards☆23Mar 5, 2026Updated 3 months ago