jamesmurdza / humaneval-langchain
Benchmark results from code generation with LLMs
β17Updated last year
Alternatives and similar repositories for humaneval-langchain:
Users that are interested in humaneval-langchain are comparing it to the libraries listed below
- An LLM playground similar to the OpenAI API playgroundβ21Updated last year
- Website with current metrics on the fastest AI models.β40Updated 3 months ago
- Code generation with LLMs πβ54Updated last year
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) withβ¦β22Updated last year
- Nexusflow function call, tool use, and agent benchmarks.β19Updated 2 months ago
- LLMs as Collaboratively Edited Knowledge Basesβ44Updated last year
- Automated testing and benchmarking for code generation agents.β18Updated last year
- MCP Server implementation for Claudeβ19Updated 2 months ago
- GPT-4 Level Conversational QA Trained In a Few Hoursβ58Updated 6 months ago
- β18Updated 11 months ago
- Using modal.com to process FineWeb-edu dataβ20Updated 2 months ago
- A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) within reach of everyone, particuβ¦β34Updated last year
- β51Updated 6 months ago
- Browser-based Voice Assistantβ44Updated last year
- β20Updated 3 weeks ago
- A Next.js version of Claude Aritfacts , inspired by llamacoderβ19Updated 4 months ago
- Opinionated Langchain setup with Qdrant vector store and Kong gatewayβ31Updated last year
- A semi-scalable system to scrape the chatgpt API to make input/output pairsβ38Updated last year
- LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.β97Updated this week
- A function to do allβ35Updated 10 months ago
- β25Updated last year
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1β¦β14Updated last year
- LLM finetuningβ42Updated last year
- g1: Using GPT-4o to create o1-like reasoning chainsβ20Updated 5 months ago
- A Python package to dynamically load functions for OpenAI Assistantβ55Updated last year
- Automatic conversation between 2 OpenAI GPT powered characters who participate in a Turing test together.β10Updated last year
- β20Updated 11 months ago
- Contains the model patches and the eval logs from the passing swe-bench-lite run.β10Updated 7 months ago
- β36Updated last year