☆86Nov 21, 2025Updated 5 months ago
Alternatives and similar repositories for mistral-evals
Users that are interested in mistral-evals are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆13Jun 22, 2025Updated 10 months ago
- ☆21Dec 14, 2024Updated last year
- simplest online-softmax notebook for explain Flash Attention☆16Jan 27, 2026Updated 3 months ago
- A Bayesian model for time-series count data with weekend effects and a lagged reporting process☆10Mar 7, 2022Updated 4 years ago
- The Community Guide to Open-Source AI Software Craft☆36Jul 16, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆139May 29, 2025Updated 11 months ago
- Python client SDK for Ultravox.☆16Dec 10, 2025Updated 4 months ago
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆12Apr 12, 2026Updated 3 weeks ago
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 6 months ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆42Dec 29, 2025Updated 4 months ago
- ☆14Jan 22, 2025Updated last year
- Quantized Attention on GPU☆44Nov 22, 2024Updated last year
- [ICML 2024] SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models☆22May 28, 2024Updated last year
- Code for paper: "Privately generating tabular data using language models".☆15Jun 13, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.☆21Jul 13, 2022Updated 3 years ago
- A zero-config OpenAI client with support for 20+ providers, API key rotation, rate limits, optional LangChain integration and more.☆19Dec 11, 2025Updated 4 months ago
- SHUbeamer是为了帮助上海大学师生撰写演示文稿而编写的LaTex Beamer模版文件☆10Dec 1, 2021Updated 4 years ago
- Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"☆65Oct 19, 2024Updated last year
- A Datasette plugin for making data visualizations with Observable Plot☆26Oct 21, 2025Updated 6 months ago
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆26Mar 6, 2025Updated last year
- [ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scie…☆191Jun 8, 2025Updated 10 months ago
- 丢小墙小程序项目,使用腾讯云开发☆11Dec 10, 2022Updated 3 years ago
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆92Feb 17, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Run evals using LLM☆27Jan 8, 2026Updated 3 months ago
- A curated list of resources related to structured generation 🔥☆23Jul 25, 2025Updated 9 months ago
- ☆24Jul 10, 2025Updated 9 months ago
- add attention mechanism in InvertedResidual block about shuffleNetV2☆10Mar 2, 2024Updated 2 years ago
- Eagle: Frontier Vision-Language Models with Data-Centric Strategies☆943Oct 25, 2025Updated 6 months ago
- DeepSeek Essentials, published by Packt☆33Apr 15, 2026Updated 2 weeks ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆49Feb 4, 2026Updated 3 months ago
- Official implementation for paper TEVAD: Improved video anomaly detection with captions☆40Apr 5, 2023Updated 3 years ago
- Data for the MTEB leaderboard☆53Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆19Apr 12, 2024Updated 2 years ago
- NLP with Rust for Python 🦀🐍☆73May 13, 2025Updated 11 months ago
- [AAAI'25]: Building a Multi-modal Spatiotemporal Expert for Zero-shot Action Recognition with CLIP☆20Aug 5, 2025Updated 9 months ago
- Code, Data and Model for Paper "Learning from Peers in Reasoning Models"☆26May 13, 2025Updated 11 months ago
- [NeurIPS 2025] More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models☆78May 31, 2025Updated 11 months ago
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆33Nov 4, 2024Updated last year
- Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]☆147Sep 20, 2024Updated last year