☆84Nov 21, 2025Updated 4 months ago
Alternatives and similar repositories for mistral-evals
Users that are interested in mistral-evals are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Debug print operator for cudagraph debugging☆14Aug 2, 2024Updated last year
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆13Jun 22, 2025Updated 9 months ago
- ☆21Dec 14, 2024Updated last year
- ☆12Apr 18, 2025Updated 11 months ago
- simplest online-softmax notebook for explain Flash Attention☆16Jan 27, 2026Updated last month
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- codes for Efficient Test-Time Scaling via Self-Calibration☆19Sep 13, 2025Updated 6 months ago
- Tracking the history of the FARA data from https://www.justice.gov/nsd-fara☆16Aug 3, 2023Updated 2 years ago
- Datamodels for hugging face tokenizers☆104Mar 12, 2026Updated last week
- A Bayesian model for time-series count data with weekend effects and a lagged reporting process☆10Mar 7, 2022Updated 4 years ago
- ☆136May 29, 2025Updated 9 months ago
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆12Sep 11, 2025Updated 6 months ago
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 5 months ago
- ☆13Jan 22, 2025Updated last year
- Quantized Attention on GPU☆44Nov 22, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ICML 2024] SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models☆21May 28, 2024Updated last year
- (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.☆21Jul 13, 2022Updated 3 years ago
- Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"☆65Oct 19, 2024Updated last year
- A Datasette plugin for making data visualizations with Observable Plot☆26Oct 21, 2025Updated 5 months ago
- [ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scie…☆186Jun 8, 2025Updated 9 months ago
- This is the repository for the AAAI 21 paper [Contrastive and Generative Graph Convolutional Networks for Graph-based Semi-Supervised Lea…☆17Feb 28, 2023Updated 3 years ago
- 丢小墙小程序项目,使用腾讯云开发☆11Dec 10, 2022Updated 3 years ago
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆90Feb 17, 2025Updated last year
- Efficient Feature Extraction for High-resolution Video Frame Interpolation (BMVC 2022)☆13Aug 24, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Test-time-training on nearest neighbors for large language models☆49Apr 18, 2024Updated last year
- The Official Implementation of Ada-KV [NeurIPS 2025]☆128Nov 26, 2025Updated 4 months ago
- AMD’s C++ library for accelerating tensor primitives☆49Updated this week
- ☆12Jan 31, 2024Updated 2 years ago
- [AAAI'25]: Building a Multi-modal Spatiotemporal Expert for Zero-shot Action Recognition with CLIP☆19Aug 5, 2025Updated 7 months ago
- Cold Compress is a hackable, lightweight, and open-source toolkit for creating and benchmarking cache compression methods built on top of…☆149Aug 9, 2024Updated last year
- Eagle: Frontier Vision-Language Models with Data-Centric Strategies☆934Oct 25, 2025Updated 5 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆50Feb 4, 2026Updated last month
- Neural Reflectance Field from Shading and Shadow under a Fixed Viewpoint☆16Aug 8, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official implementation for paper TEVAD: Improved video anomaly detection with captions☆41Apr 5, 2023Updated 2 years ago
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆32Nov 4, 2024Updated last year
- Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]☆147Sep 20, 2024Updated last year
- Boosting 4-bit inference kernels with 2:4 Sparsity☆94Sep 4, 2024Updated last year
- [NeurIPS 2025] More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models☆75May 31, 2025Updated 9 months ago
- Repositorio general para Bootcamps de Data Science en Coding Dojo☆11Nov 13, 2025Updated 4 months ago
- Multimodal language model benchmark, featuring challenging examples☆186Dec 18, 2024Updated last year