☆43Jan 18, 2025Updated last year
Alternatives and similar repositories for eval
Users that are interested in eval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Nov 7, 2025Updated 6 months ago
- Implementation of 12 AI agents evaluation techniques☆43Jul 31, 2025Updated 9 months ago
- Custom nodes for ComfyUI to generate empty latent space compatible with Hunyuan models for both image and video generation.☆10Dec 29, 2024Updated last year
- Basic rover demo from Raspberry Pi with remote teleop over LiveKit☆18Jul 10, 2025Updated 10 months ago
- ☆26May 28, 2025Updated 11 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Codebase for fine-tuning Llama2 70B to generate math test questions and answers.☆11Aug 30, 2024Updated last year
- ☆12Jan 7, 2025Updated last year
- PFI: Prompt Flow Integrity to Prevent Privilege Escalation in LLM Agents☆28Mar 26, 2025Updated last year
- Docker image for Dataiku Science Studio☆10Apr 20, 2017Updated 9 years ago
- Dead simple cron service for making HTTP calls on a regular schedule.☆14Jul 11, 2020Updated 5 years ago
- ☆13Oct 14, 2024Updated last year
- Streamlit application to keep GPT3 Experimentation sane☆23Jul 14, 2021Updated 4 years ago
- Reusable AI coding agent skills for building voice AI with LiveKit☆52Feb 25, 2026Updated 2 months ago
- An experiment in permeable publishing.☆11Jan 17, 2017Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Playwright (with stealth) Model Context Protocol Server - Tool to automate Browsers and APIs in Claude Desktop, Cline, Cursor IDE and Mor…☆21Apr 9, 2025Updated last year
- Experiment deploying Rstudio to Google AppEngine☆11Sep 3, 2017Updated 8 years ago
- MCP server for scheduling and running AI prompts, HTTP/webhook requests, and shell commands☆19Updated this week
- ☆19May 1, 2026Updated 3 weeks ago
- TikTag: Breaking ARM's Memory Tagging Extension with Speculative Execution (IEEE S&P 2025)☆86Nov 25, 2024Updated last year
- ☆16Apr 29, 2025Updated last year
- ☆29Jun 12, 2025Updated 11 months ago
- A Node.js package to fetch statistics from the Chrome Web Store☆14May 6, 2024Updated 2 years ago
- [ICML 2026] effGen: Enabling Small Language Models as Capable Autonomous Agents☆163Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A lightweight code assistant with tool-using capabilities built on HuggingFace's smolagents.☆41Jun 11, 2025Updated 11 months ago
- MNIST accelerator using binary qunatization on Xilinx pynq-z2☆14Sep 4, 2024Updated last year
- |LIVE NOW| Meerkat API Documentation☆10Oct 12, 2015Updated 10 years ago
- Easy to understand self-driving example for MonsterBorg☆13Oct 29, 2019Updated 6 years ago
- ☆14Mar 11, 2025Updated last year
- An open-source simulator framework for neural processing units☆39Mar 23, 2026Updated 2 months ago
- A platform for managing the submission and review of research outputs☆10Dec 14, 2022Updated 3 years ago
- ☆12Sep 1, 2024Updated last year
- Framework which makes large scale crawling of URLs with VisibleV8 easy.☆11Jan 28, 2026Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Generic Differential Evolution for Rust☆21Sep 12, 2016Updated 9 years ago
- PHP7.1 Fluent, immutable SQL query builder.☆12Jul 22, 2019Updated 6 years ago
- ☆30Nov 4, 2025Updated 6 months ago
- Sample files for fuzzing ImageMagick☆19May 10, 2017Updated 9 years ago
- Use MCP tools with Gemini Live API☆25Oct 6, 2025Updated 7 months ago
- SynthEHRella is a benchmarking package used for evaluating synthetic Electronic Health Records (EHR) data generation methods.☆18Sep 17, 2025Updated 8 months ago
- Define AI tools in YAML with natural language schemas. All tool usage is automatically stored in Qdrant vector database, enabling semanti…☆24Jul 5, 2025Updated 10 months ago