☆104May 11, 2026Updated last week
Alternatives and similar repositories for asta-bench
Users that are interested in asta-bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Utility to record data from the FinalSpark live☆13Sep 16, 2025Updated 8 months ago
- ☆138May 11, 2026Updated last week
- Training tiny models to prove hard theorems☆77Mar 5, 2026Updated 2 months ago
- [ACL 2026] Repository of IPBench☆21Apr 6, 2026Updated last month
- A framework aiming to bridge fast robot prototyping, predefined motion primitives, heterogeneous teleoperation, data collection, and flex…☆26Apr 4, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official Implementation of the paper "Jointly Reinforcing Diversity and Quality in Language Model Generations"☆59May 8, 2026Updated last week
- Forecasting high-impact research topics via machine learning on evolving knowledge graphs☆51Nov 26, 2025Updated 5 months ago
- ☆42Aug 20, 2025Updated 8 months ago
- Open-Vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models (ICCV 20…☆18Apr 23, 2024Updated 2 years ago
- ☆67May 2, 2026Updated 2 weeks ago
- InstructMol: Multi-Modal Integration for Building a Versatile and Reliable Molecular Assistant in Drug Discovery (COLING 2025)☆54Dec 2, 2024Updated last year
- Source code for paper "On the Pareto Front of Multilingual Neural Machine Translation" @ NeurIPS 2023☆17Sep 27, 2023Updated 2 years ago
- Official repo of Toucan: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments☆239Dec 16, 2025Updated 5 months ago
- Container-free RL framework for training software engineering agents☆58Mar 4, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆36Oct 3, 2025Updated 7 months ago
- ☆10Jun 28, 2023Updated 2 years ago
- Rethinking the Trust Region in LLM Reinforcement Learning☆54Mar 2, 2026Updated 2 months ago
- ☆22Dec 3, 2025Updated 5 months ago
- ☆37Apr 21, 2026Updated 3 weeks ago
- An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.☆14Jan 9, 2024Updated 2 years ago
- ☆43Apr 28, 2026Updated 2 weeks ago
- Official codebase for NeurIPS 2022 paper End-to-end Learning to Index and Search in Large Output Spaces☆12Apr 19, 2023Updated 3 years ago
- [CVPR 2026] TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models☆65Feb 21, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Introducing Filtered Direct Preference Optimization (fDPO) that enhances language model alignment with human preferences by discarding lo…☆16Nov 27, 2024Updated last year
- ☆36Oct 23, 2025Updated 6 months ago
- ☆280May 6, 2026Updated last week
- This is the repository for our EMNLP 2022 paper "The Importance of Being Parameters: An Intra-Distillation Method for Serious Gains".☆10Jun 2, 2023Updated 2 years ago
- Synthetic Data Generation for Evaluation☆14Feb 21, 2025Updated last year
- Useful LLM contexts ready to be used in AIMagic☆32Apr 6, 2026Updated last month
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆241Mar 17, 2026Updated 2 months ago
- ☆87Sep 25, 2025Updated 7 months ago
- ☆64Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ROSA+: RWKV's ROSA implementation with fallback statistical predictor☆35Oct 13, 2025Updated 7 months ago
- PyTorch Implementation of CLEAN-Contact: Contrastive Learning-enabled Enzyme Functional Annotation Prediction with Structural Inference☆11May 29, 2024Updated last year
- Replications data and code for "LaLonde (1986) after Nearly Four Decades: Lessons Learned"☆34Jun 14, 2024Updated last year
- pytorch版基于gpt+nezha的中文多轮Cdial☆11Oct 22, 2022Updated 3 years ago
- Code for the MTEB Arena☆24Jul 2, 2025Updated 10 months ago
- The official repository of the first version of ACE-Brain foundation model.☆76Mar 13, 2026Updated 2 months ago
- Starter for building with CopilotKit and LangGraph☆15Mar 12, 2026Updated 2 months ago