☆219Jun 17, 2025Updated 10 months ago
Alternatives and similar repositories for llm_benchmarks
Users that are interested in llm_benchmarks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆129Sep 12, 2024Updated last year
- ☆133Sep 12, 2024Updated last year
- ☆128Sep 12, 2024Updated last year
- ☆121Sep 11, 2024Updated last year
- ☆115Sep 12, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆114Sep 12, 2024Updated last year
- ☆122Sep 11, 2024Updated last year
- ☆117Sep 12, 2024Updated last year
- ☆111Sep 12, 2024Updated last year
- ☆116Sep 12, 2024Updated last year
- ☆111Sep 12, 2024Updated last year
- ☆109Sep 12, 2024Updated last year
- ☆110Sep 12, 2024Updated last year
- ☆108Sep 11, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆111Sep 12, 2024Updated last year
- ☆110Sep 12, 2024Updated last year
- ☆109Sep 12, 2024Updated last year
- ☆136Sep 12, 2024Updated last year
- ☆139Sep 12, 2024Updated last year
- ☆52Oct 8, 2024Updated last year
- ☆90Oct 8, 2024Updated last year
- FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion☆216Updated this week
- ☆77Jun 15, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Adversarial Counterfactual Temporal Inference Network☆78Jul 28, 2025Updated 9 months ago
- ☆120Jul 28, 2025Updated 9 months ago
- ☆107Sep 26, 2024Updated last year
- ☆113Dec 4, 2022Updated 3 years ago
- [AAAI 2025] Holistic Semantic Representation for Navigational Trajectory Generation☆18Mar 7, 2026Updated last month
- The first Object-Oriented Programming (OOP) Evaluation Benchmark for LLMs☆27Jan 15, 2025Updated last year
- [ICLR 2022] Official repository for "Knowledge Removal in Sampling-based Bayesian Inference"☆18Mar 15, 2022Updated 4 years ago
- Model Doctor: A Simple Gradient Aggregation Strategy for Diagnosing and Treating CNN Classifiers [https://arxiv.org/pdf/2112.04934.pdf]☆15May 13, 2023Updated 2 years ago
- ☆25Aug 19, 2025Updated 8 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆21Jan 28, 2023Updated 3 years ago
- A Survey of Direct Preference Optimization (DPO)☆95Jul 4, 2025Updated 9 months ago
- [AAAI 2024] UniAP: Towards Universal Animal Perception in Vision via Few-shot Learning☆12Dec 10, 2023Updated 2 years ago
- Simple Graph Condensation☆13Feb 26, 2025Updated last year
- [ECCV 2022] Code for the paper, ReAct: Temporal Action Detection with Relational Queries☆39Oct 19, 2022Updated 3 years ago
- [ICLR 2022] Official repository for "Robust Unlearnable Examples: Protecting Data Against Adversarial Learning"☆49Jul 20, 2024Updated last year
- The official implementation of Spatiotemporal Gated Traffic Trajectory Simulation with Semantic-aware Graph Learning (Information Fusion …☆10May 6, 2024Updated last year