☆86Nov 21, 2025Updated 4 months ago
Alternatives and similar repositories for mistral-evals
Users that are interested in mistral-evals are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Debug print operator for cudagraph debugging☆14Aug 2, 2024Updated last year
- ☆21Dec 14, 2024Updated last year
- ☆12Apr 18, 2025Updated 11 months ago
- simplest online-softmax notebook for explain Flash Attention☆16Jan 27, 2026Updated 2 months ago
- codes for Efficient Test-Time Scaling via Self-Calibration☆19Sep 13, 2025Updated 7 months ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Tracking the history of the FARA data from https://www.justice.gov/nsd-fara☆16Aug 3, 2023Updated 2 years ago
- Datamodels for hugging face tokenizers☆106Updated this week
- The Community Guide to Open-Source AI Software Craft☆35Jul 16, 2025Updated 8 months ago
- ☆139May 29, 2025Updated 10 months ago
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆12Apr 8, 2026Updated last week
- ☆13Jan 22, 2025Updated last year
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆42Dec 29, 2025Updated 3 months ago
- Quantized Attention on GPU☆44Nov 22, 2024Updated last year
- OpenTelemetry Python distribution for Uptrace☆27Mar 13, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ICML 2024] SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models☆22May 28, 2024Updated last year
- [NeurIPS'23] Uncertainty Estimation for Safety-critical Scene Segmentation via Fine-grained Reward Maximization☆20Aug 4, 2024Updated last year
- ☆60Sep 23, 2023Updated 2 years ago
- (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.☆21Jul 13, 2022Updated 3 years ago
- A zero-config OpenAI client with support for 20+ providers, API key rotation, rate limits, optional LangChain integration and more.☆19Dec 11, 2025Updated 4 months ago
- Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"☆65Oct 19, 2024Updated last year
- A Datasette plugin for making data visualizations with Observable Plot☆26Oct 21, 2025Updated 5 months ago
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆26Mar 6, 2025Updated last year
- [ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scie…☆191Jun 8, 2025Updated 10 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆90Feb 17, 2025Updated last year
- A curated list of resources related to structured generation 🔥☆23Jul 25, 2025Updated 8 months ago
- Test-time-training on nearest neighbors for large language models☆50Apr 18, 2024Updated last year
- The Official Implementation of Ada-KV [NeurIPS 2025]☆131Nov 26, 2025Updated 4 months ago
- ☆12Jan 31, 2024Updated 2 years ago
- AMD’s C++ library for accelerating tensor primitives☆49Updated this week
- Cold Compress is a hackable, lightweight, and open-source toolkit for creating and benchmarking cache compression methods built on top of…☆149Aug 9, 2024Updated last year
- Eagle: Frontier Vision-Language Models with Data-Centric Strategies☆938Oct 25, 2025Updated 5 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆49Feb 4, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Human-centric environment representations from egocentric video☆14Feb 5, 2026Updated 2 months ago
- Neural Reflectance Field from Shading and Shadow under a Fixed Viewpoint☆16Aug 8, 2022Updated 3 years ago
- Data for the MTEB leaderboard☆50Apr 7, 2026Updated last week
- Showcasing the power of Ruby on Rails.☆12Jun 7, 2020Updated 5 years ago
- Code, Data and Model for Paper "Learning from Peers in Reasoning Models"☆27May 13, 2025Updated 11 months ago
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆33Nov 4, 2024Updated last year
- [NeurIPS 2025] More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models☆78May 31, 2025Updated 10 months ago