cursor / evalLinks
☆138Updated 2 years ago
Alternatives and similar repositories for eval
Users that are interested in eval are comparing it to the libraries listed below
Sorting:
- Cognition's results and methodology on SWE-bench☆122Updated last year
- ☆106Updated 2 years ago
- ☆57Updated 2 years ago
- 🔓 The open-source autonomous agent LLM initiative 🔓☆91Updated last year
- ☆80Updated 2 years ago
- Learn how to use logit bias with OpenAI models to create highly-powerful classifiers in minutes.☆34Updated 2 years ago
- Collection of ChatGPT plugins☆105Updated 2 years ago
- ☆45Updated 2 years ago
- GPT Index Extension for Google Chrome☆42Updated 2 years ago
- 🎸 Integrating AI plugins to LLMs☆229Updated 2 years ago
- Code generation with LLMs 🔗☆53Updated 2 years ago
- ☆85Updated 2 years ago
- ☆74Updated last year
- ☆36Updated last year
- Run and save the code in Chat-GPT directly in your browser, Supports upto 70+ languages.☆40Updated 11 months ago
- Harness used to benchmark aider against SWE Bench benchmarks☆78Updated last year
- My journey learning LangChain☆69Updated last year
- A Toolkit for Creating and Deploying LangChain Apps☆170Updated 2 years ago
- Aider's refactoring benchmark exercises based on popular python repos☆78Updated last year
- LLM finetuning☆42Updated 2 years ago
- A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.☆30Updated 2 years ago
- ☆143Updated 2 years ago
- CLARA: Code Language Assistant & Repository Analyzer☆94Updated 2 years ago
- ☆164Updated 4 months ago
- A feed of trending repos/models from GitHub, Replicate, HuggingFace, and Reddit.☆137Updated 6 months ago
- 🐤 A minimal viable logger for Prompt/LLM Engineering. Use your IDE as Logging UI - a fast, simple, extensible, zero dependency Node.js l…☆144Updated last year
- automatically generate @openai plugins by specifying your API in markdown in smol-developer style☆118Updated 2 years ago
- Code examples from the ChatGPT plugin documentation.☆57Updated 2 years ago
- Just a bunch of benchmark logs for different LLMs☆119Updated last year
- ☆198Updated 2 years ago