Automated testing and benchmarking for code generation agents.
☆18Jun 27, 2023Updated 2 years ago
Alternatives and similar repositories for agenteval
Users that are interested in agenteval are comparing it to the libraries listed below
Sorting:
- Web UI for Bark by Suno.ai built with next.js☆12Jun 15, 2023Updated 2 years ago
- Give topics & subtopics and generate wiki articles in markdown language with your openai api key☆13Jul 14, 2023Updated 2 years ago
- Chaining AI & API agents to streamline software development and achieve goals collaboratively.☆24Mar 3, 2024Updated 2 years ago
- Open-source AI for voice control, rivaling Alexa and Siri☆13Mar 9, 2024Updated 2 years ago
- Benchmark results from code generation with LLMs☆17Sep 1, 2023Updated 2 years ago
- Code generation with LLMs 🔗☆53Aug 4, 2023Updated 2 years ago
- URS Benchmark: Evaluating LLMs on User Reported Scenarios☆30May 30, 2025Updated 9 months ago
- LoRA fine-tuned Stable Diffusion Deployment☆31Feb 15, 2023Updated 3 years ago
- Request and collect feedback on messages using reacjis☆20Feb 26, 2026Updated 3 weeks ago
- Neural network class for molecular dynamics to predict potential energy, forces and non-adiabatic couplings.☆11Nov 10, 2022Updated 3 years ago
- Official code repository for Findings of EMNLP 2022 paper: PseudoReasoner: Leveraging Pseudo Labels for Commonsense Knowledge Base Popula…☆11Oct 18, 2022Updated 3 years ago
- OpenAI-powered JSON data generator.☆18Apr 7, 2023Updated 2 years ago
- [EMNLP 2023] Question Answering as Programming for Solving Time-Sensitive Questions☆12Dec 18, 2023Updated 2 years ago
- Clarify your words with emojis☆12Aug 25, 2016Updated 9 years ago
- ☆16Mar 3, 2024Updated 2 years ago
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆23Aug 2, 2025Updated 7 months ago
- FlexGen with docker☆29Mar 20, 2023Updated 3 years ago
- ☆16Nov 11, 2022Updated 3 years ago
- ☆22Jun 26, 2024Updated last year
- Codes for the paper "CausalCite: A Causal Formulation of Paper Citations" (2023)☆16Jan 11, 2024Updated 2 years ago
- ☆11Mar 10, 2023Updated 3 years ago
- always amend and --force push☆12Nov 28, 2017Updated 8 years ago
- Ask Poddy: Run Open Source LLMs and Embeddings as OpenAI-Compatible Serverless Endpoints (Tutorial)☆11Jul 19, 2024Updated last year
- Open Pixel Control protocol☆15May 6, 2018Updated 7 years ago
- [DEPRICATED]Session exercises☆19Dec 19, 2024Updated last year
- Recursive self-improvement☆57Jan 27, 2024Updated 2 years ago
- Safari Reader Mode Source Code☆20Mar 5, 2024Updated 2 years ago
- baikal.ai's pre-trained BERT models: descriptions and sample codes☆12Jun 24, 2021Updated 4 years ago
- [UNMAINTAINED] Tessel 1's getting started page☆32Oct 26, 2015Updated 10 years ago
- JavaScript wrapper for Giphy's API.☆16Jan 29, 2018Updated 8 years ago
- The official code respository for "Rethinking the role of frames for SE(3)-invariant crystal structure modeling" (ICLR 2025)☆16Oct 16, 2025Updated 5 months ago
- Data and graphs for repos and events from We Build SG☆16Aug 29, 2018Updated 7 years ago
- Implementation of Variational Hierarchical User-based Conversation Model☆10Jul 2, 2021Updated 4 years ago
- Web VJing for everyone.☆11May 26, 2016Updated 9 years ago
- The code of COMMA: Modeling Relationship among Motivations, Emotions and Actions in Language-based Human Activities. https://aclanthology…☆12Oct 12, 2022Updated 3 years ago
- Python implementation of SWEM (Simple Word-Embedding-based Methods)☆30Jun 21, 2022Updated 3 years ago
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Jan 19, 2024Updated 2 years ago
- Title says it all, doesn't it?☆21Aug 3, 2014Updated 11 years ago
- ☆17May 31, 2023Updated 2 years ago