Automated testing and benchmarking for code generation agents.
☆18Jun 27, 2023Updated 3 years ago
Alternatives and similar repositories for agenteval
Users that are interested in agenteval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Web UI for Bark by Suno.ai built with next.js☆12Jun 15, 2023Updated 3 years ago
- Give topics & subtopics and generate wiki articles in markdown language with your openai api key☆13Jul 14, 2023Updated 2 years ago
- Chaining AI & API agents to streamline software development and achieve goals collaboratively.☆24Mar 3, 2024Updated 2 years ago
- Open-source AI for voice control, rivaling Alexa and Siri☆13Mar 9, 2024Updated 2 years ago
- Code generation with LLMs 🔗☆53Aug 4, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- URS Benchmark: Evaluating LLMs on User Reported Scenarios☆31May 30, 2025Updated last year
- Request and collect feedback on messages using reacjis☆20Feb 26, 2026Updated 4 months ago
- Official code repository for Findings of EMNLP 2022 paper: PseudoReasoner: Leveraging Pseudo Labels for Commonsense Knowledge Base Popula…☆11Oct 18, 2022Updated 3 years ago
- OpenAI-powered JSON data generator.☆18Apr 7, 2023Updated 3 years ago
- [EMNLP 2023] Question Answering as Programming for Solving Time-Sensitive Questions☆12Dec 18, 2023Updated 2 years ago
- Clarify your words with emojis☆12Aug 25, 2016Updated 9 years ago
- The Effect of Sampling Temperature on Problem Solving in Large Language Models☆25Nov 25, 2024Updated last year
- ☆16Mar 3, 2024Updated 2 years ago
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆23Aug 2, 2025Updated 11 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆22Jun 26, 2024Updated 2 years ago
- THINK LESS, SCREAM MORE!☆11Feb 17, 2016Updated 10 years ago
- Codes for the paper "CausalCite: A Causal Formulation of Paper Citations" (2023)☆16Jan 11, 2024Updated 2 years ago
- ☆10Oct 21, 2022Updated 3 years ago
- always amend and --force push☆12Nov 28, 2017Updated 8 years ago
- HTM Learning Algorithm Implementation for learning and generating musical sequences☆10Apr 14, 2015Updated 11 years ago
- Recursive self-improvement☆56Jan 27, 2024Updated 2 years ago
- baikal.ai's pre-trained BERT models: descriptions and sample codes☆12Jun 24, 2021Updated 5 years ago
- Let's play with canvas drawing and WebAudio API, see if something interesting might appear. A first attempt to "Code like no one's watchi…☆11Jul 30, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [UNMAINTAINED] Tessel 1's getting started page☆32Oct 26, 2015Updated 10 years ago
- JavaScript wrapper for Giphy's API.☆16Jan 29, 2018Updated 8 years ago
- Data and graphs for repos and events from We Build SG☆16Aug 29, 2018Updated 7 years ago
- Web VJing for everyone.☆11May 26, 2016Updated 10 years ago
- Implementation of Variational Hierarchical User-based Conversation Model☆10Jul 2, 2021Updated 5 years ago
- An official codebase for "NormLens: Reading Books is Great, But Not if You Are Driving! Visually Grounded Reasoning about Defeasible Comm…☆10May 9, 2024Updated 2 years ago
- Basic Geometry and Linear Algebra library☆16Feb 14, 2023Updated 3 years ago
- Convert your Raspberry Pi into a DMX512 controller☆11Apr 14, 2024Updated 2 years ago
- Title says it all, doesn't it?☆21Aug 3, 2014Updated 11 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Jan 19, 2024Updated 2 years ago
- New abstractions for Tessel Neopixels☆16Jul 15, 2020Updated 5 years ago
- RATT: A Thought Structure for Coherent and Correct LLM Reasoning☆15Jul 11, 2024Updated last year
- A Python client for the GraphSense REST interface.☆20Sep 5, 2025Updated 9 months ago
- An Infr app that helps you replay & talk to everything you've ever seen.☆15Sep 19, 2023Updated 2 years ago
- Render React components to DMX light systems.☆18Mar 24, 2018Updated 8 years ago
- Get ready to have your mind blown by the magic of vw CSS units and take your CSS acrobatics to the next level.☆15Jul 7, 2022Updated 3 years ago