definitive-io/human-eval-sampling-benchmark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/definitive-io/human-eval-sampling-benchmark)

definitive-io / human-eval-sampling-benchmark

OpenAI's human-eval sampling benchmark

☆13

Alternatives and similar repositories for human-eval-sampling-benchmark

Users that are interested in human-eval-sampling-benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

definitive-io / code-indexer-loop
View on GitHub
Code Indexer Loop is a Python library for indexing and retrieving source code files through an integrated vector database that's continuo…
☆175Apr 9, 2024Updated 2 years ago
definitive-io / openassistants
View on GitHub
Build robust, production grade function calling assistants that work. Declarative and extensible. Built on top of LangChain ⚡️
☆76May 21, 2024Updated 2 years ago
groq / groq-changelog
View on GitHub
Groq Public Changelog
☆18May 6, 2026Updated 2 months ago
groq / groq-typescript
View on GitHub
The official Node.js / Typescript library for the Groq API
☆259Jul 18, 2026Updated last week
agiguild / agiguild
View on GitHub
AGI Guild
☆16Feb 18, 2024Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
groq / groq-python
View on GitHub
The official Python Library for the Groq API
☆613Jul 18, 2026Updated last week
webtimemachine / wtm2
View on GitHub
☆22Jan 11, 2025Updated last year
cividi / spatial-data-package-platform
View on GitHub
Django + Vue platform for publishing Spatial Data Packages.
☆12Mar 6, 2023Updated 3 years ago
mikedbjones / longtrends
View on GitHub
Package to download long-term Google Trends
☆16Jul 19, 2023Updated 3 years ago
groq / openbench
View on GitHub
Provider-agnostic, open-source evaluation infrastructure for language models
☆791Jun 26, 2026Updated 3 weeks ago
samirbajaj-zz / cs229-project
View on GitHub
Recommender System Project for CS229 at Stanford in Fall 2012
☆15Dec 9, 2012Updated 13 years ago
yisding / litllm
View on GitHub
☆19Aug 25, 2025Updated 10 months ago
rolfmorel / macht
View on GitHub
A 2048 clone in python with Terminal UI
☆28Dec 28, 2015Updated 10 years ago
yacineMTB / just-large-models
View on GitHub
Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.
☆44Sep 6, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
dylibso / mcp-otel
View on GitHub
An example of distributed tracing an MCP enabled agent
☆15Feb 14, 2026Updated 5 months ago
ZachHandley / LlamaIndexAPI
View on GitHub
A Docker image with Llama Index, Lang Chain, and a few other popular AI packages installed by default
☆11Nov 19, 2025Updated 8 months ago
ArmelRandy / tree-of-problems
View on GitHub
[EMNLP 2024] Tree of Problems: Improving structured problem solving with compositionality
☆20Mar 4, 2025Updated last year
strangeloopcanon / ReflectGPT
View on GitHub
Add ability to interrupt own message
☆14Apr 21, 2024Updated 2 years ago
shoggoth13 / aityping
View on GitHub
☆13Jul 16, 2023Updated 3 years ago
jeroenjanssens / sample
View on GitHub
Filter lines from standard input according to some probability, with a given delay, and for a certain duration.
☆26Feb 17, 2023Updated 3 years ago
centminmod / centminmod-sysbench
View on GitHub
sysbench.sh benchmark tool written specifically for centminmod.com LEMP stack servers
☆23May 27, 2025Updated last year
mricon / howler
View on GitHub
Alert when users log in from new locations
☆42Jun 2, 2017Updated 9 years ago
orchest / orchest-examples
View on GitHub
Awesome Orchest projects, both official and submitted by the community.
☆25Aug 31, 2023Updated 2 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
shroominic / funcchain
View on GitHub
⛓️ build cognitive systems, pythonic
☆341Nov 19, 2024Updated last year
suhashm / ParkMate
View on GitHub
Insight Data Engineering Project
☆15Jun 1, 2021Updated 5 years ago
charlesxu90 / Face_recognition
View on GitHub
Matlab code for face recognition (CS229 Course Project).
☆13Jun 17, 2014Updated 12 years ago
kyle8581 / LanguageModelsasCompilers
View on GitHub
Official implementation of Language Models as Compilers: Simulating the Execution Of Pseudocode Improves Algorithmic Reasoning in Languag…
☆23Apr 8, 2024Updated 2 years ago
GraphIndex-org / semantic-mapper
View on GitHub
☆22Aug 24, 2023Updated 2 years ago
groq / groq-desktop-beta
View on GitHub
Local Groq Desktop chat app with MCP support
☆398Jun 26, 2026Updated 3 weeks ago
Tachikoma000 / playgrounds_subgraph_connector
View on GitHub
The `PlaygroundsSubgraphConnector` is a tool designed for agents to seamlessly interface with and query subgraphs on The Graph's decentra…
☆13Oct 12, 2023Updated 2 years ago
databricks-industry-solutions / mfg-llm-qa-bot
View on GitHub
☆19Mar 5, 2024Updated 2 years ago
lgrees / resy-cli
View on GitHub
A CLI to easily schedule restaurant reservations in advance.
☆57Apr 20, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
daily-co / pcc-groq-llama
View on GitHub
Voice AI agent starter kit with Groq, Llama 4, and (optionally) Twilio
☆72Sep 16, 2025Updated 10 months ago
lxe / llama-peft-tuner
View on GitHub
Tune LLaMa-7B on Alpaca Dataset using PEFT / LORA Based on @zphang's https://github.com/zphang/minimal-llama scripts.
☆25Mar 15, 2023Updated 3 years ago
vercel-labs / ai-sdk-starter-groq
View on GitHub
☆51Dec 12, 2025Updated 7 months ago
lucianotonet / groq-php
View on GitHub
A powerful PHP library for integration with the GroqCloud API
☆81Aug 8, 2025Updated 11 months ago
run-llama / liteparse-desktop
View on GitHub
Desktop app to parse PDF files locally and display markdown/bboxes. Powered by Tauri and liteparse
☆23Jul 8, 2026Updated 2 weeks ago
groq / groq-autosheet
View on GitHub
A browser spreadsheet with an integrated AI chat (with MCP support) powered by Groq inference
☆32Jul 16, 2026Updated last week
dataprofessor / builder
View on GitHub
☆24Jun 27, 2024Updated 2 years ago