A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.
☆177Feb 26, 2026Updated last week
Alternatives and similar repositories for eureka-ml-insights
Users that are interested in eureka-ml-insights are comparing it to the libraries listed below
Sorting:
- This repository hosts the instructions and workshop materials for Lab 333 - Evaluate Reasoning Models for Your Generative AI Solutions☆19May 21, 2025Updated 9 months ago
- ☆11Jun 21, 2025Updated 8 months ago
- ☆14Jun 4, 2025Updated 9 months ago
- Production-ready Infrastructure as Code, applications, pluggable components, and PlatformOps toolchains that empower organizations to ach…☆51Updated this week
- Super gorgeous, easy-to-use and convenient mind map application☆15Nov 5, 2024Updated last year
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆50Feb 4, 2026Updated last month
- Quickstart sample for using the Azure AI Studio with the SDK or CLI options - and the PromptFlow framework.☆19May 28, 2024Updated last year
- Code that accompanies the public release of the paper Lost in Conversation (https://arxiv.org/abs/2505.06120)☆220Jun 23, 2025Updated 8 months ago
- eShopLite - Semantic Search is a reference .NET application implementing an eCommerce site with Search features using Keyword Search and …☆13Apr 24, 2025Updated 10 months ago
- ☆15Jul 31, 2025Updated 7 months ago
- Indexing framework designed for the automated creation of structured knowledge bases in Azure AI Search☆14Jun 18, 2025Updated 8 months ago
- GitHub Copilot Adoption Plan - Workshops - Full Solution☆18Feb 18, 2026Updated 2 weeks ago
- [EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs☆29May 22, 2025Updated 9 months ago
- Activate GenAI with Azure☆23Jan 26, 2026Updated last month
- This lab is a 1-day/2-day end-to-end SLM workshop led and developed by AI GBB. Attendees will learn how to quickly and easily perform the…☆45Jan 22, 2026Updated last month
- NeuMeta transforms neural networks by allowing a single model to adapt on the fly to different sizes, generating the right weights when n…☆44Nov 8, 2024Updated last year
- BikeSharing360 Cognitive Services Kiosk Demo App☆39Nov 28, 2022Updated 3 years ago
- ☆77Updated this week
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆13Jun 22, 2025Updated 8 months ago
- decontamination☆26Updated this week
- What Would Portland Do? Generative agent experience☆13Mar 13, 2024Updated last year
- Code for reproducing our paper: LMSOC: An Approach for Socially Sensitive Pretraining☆13Oct 22, 2021Updated 4 years ago
- Retail Search with AI☆14Feb 14, 2026Updated 3 weeks ago
- ☆11Apr 21, 2023Updated 2 years ago
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆23Sep 21, 2025Updated 5 months ago
- 一个开源数学大模型项目,旨在探索大模型是否具有数学创造能力,以及大模型在前沿数学研究中的潜在能力。☆17May 16, 2025Updated 9 months ago
- This repository contains data, code and models for contextual noncompliance.☆25Jul 18, 2024Updated last year
- Dockerized LLM inference server with constrained output (JSON mode), built on top of vLLM and outlines. Faster, cheaper and without rate …☆27Feb 17, 2024Updated 2 years ago
- This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception o…☆28Jul 9, 2025Updated 7 months ago
- ☆28Apr 28, 2024Updated last year
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆91Jan 9, 2026Updated last month
- SK Multi agentic advanced orchestration example☆15Feb 20, 2026Updated 2 weeks ago
- A sample demo for building and testing react components and includes a set of unique features including AI component generation and autom…☆15Jun 27, 2024Updated last year
- This is a solution accelerator for creating personalized content recommendations based on user activity.☆13Mar 26, 2024Updated last year
- A simple example of VAEs with KANs☆12May 17, 2024Updated last year
- ☆13Aug 26, 2024Updated last year
- In this codebase we establish a benchmark for egocentric user adaptation based on Ego4d.First, we start from a population model which ha…☆15Jan 16, 2025Updated last year
- Python package for extractive NLP using the OpenAI API☆17Aug 28, 2024Updated last year
- T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation (ICCV'25)☆44Oct 6, 2025Updated 5 months ago