☆87Nov 21, 2025Updated 6 months ago
Alternatives and similar repositories for mistral-evals
Users that are interested in mistral-evals are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Apr 18, 2025Updated last year
- simplest online-softmax notebook for explain Flash Attention☆17Jan 27, 2026Updated 4 months ago
- codes for Efficient Test-Time Scaling via Self-Calibration☆20Sep 13, 2025Updated 9 months ago
- Tracking the history of the FARA data from https://www.justice.gov/nsd-fara☆16Aug 3, 2023Updated 2 years ago
- The Community Guide to Open-Source AI Software Craft☆37Jul 16, 2025Updated 10 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆139May 29, 2025Updated last year
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 8 months ago
- ☆14Jan 22, 2025Updated last year
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆43Dec 29, 2025Updated 5 months ago
- Quantized Attention on GPU☆44Nov 22, 2024Updated last year
- [ICML 2024] SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models☆22May 28, 2024Updated 2 years ago
- ☆60Sep 23, 2023Updated 2 years ago
- Compare how fine-tuned AI video models interpret the same prompts☆14Jan 29, 2025Updated last year
- (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.☆21Jul 13, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A Datasette plugin for making data visualizations with Observable Plot☆26Oct 21, 2025Updated 7 months ago
- Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"☆68Oct 19, 2024Updated last year
- [ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scie…☆196Jun 8, 2025Updated last year
- This is the repository for the AAAI 21 paper [Contrastive and Generative Graph Convolutional Networks for Graph-based Semi-Supervised Lea…☆18Feb 28, 2023Updated 3 years ago
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆93Feb 17, 2025Updated last year
- Efficient Feature Extraction for High-resolution Video Frame Interpolation (BMVC 2022)☆14Aug 24, 2023Updated 2 years ago
- ☆67Updated this week
- Run evals using LLM☆27Jan 8, 2026Updated 5 months ago
- A curated list of resources related to structured generation 🔥☆24Jul 25, 2025Updated 10 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Test-time-training on nearest neighbors for large language models☆50Apr 18, 2024Updated 2 years ago
- The Official Implementation of Ada-KV [NeurIPS 2025]☆135Nov 26, 2025Updated 6 months ago
- ☆12Jan 31, 2024Updated 2 years ago
- Example code using the DSPy framework.☆20May 30, 2024Updated 2 years ago
- LLM inference in C/C++☆33Updated this week
- Tokenizer 비교 실험☆11Jan 3, 2022Updated 4 years ago
- Cold Compress is a hackable, lightweight, and open-source toolkit for creating and benchmarking cache compression methods built on top of…☆151Aug 9, 2024Updated last year
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆49Feb 4, 2026Updated 4 months ago
- Official implementation for paper TEVAD: Improved video anomaly detection with captions☆40Apr 5, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Cosine Similarity Fusion Network for Real-Time RGB-X Semantic Segmentation of Driving Scenes☆19May 7, 2026Updated last month
- Showcasing the power of Ruby on Rails.☆12Jun 7, 2020Updated 6 years ago
- Boosting 4-bit inference kernels with 2:4 Sparsity☆96Sep 4, 2024Updated last year
- Automate the creation of high quality research papers in latex. Powered by Swarms 🤖☆11Dec 1, 2025Updated 6 months ago
- Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]☆148Sep 20, 2024Updated last year
- Multimodal language model benchmark, featuring challenging examples☆187Dec 18, 2024Updated last year
- Odysseus: Playground of LLM Sequence Parallelism☆81Jun 17, 2024Updated last year