mrconter1 / PullRequestBenchmarkLinks
Evaluating LLMs performance in PR reviews as an indicator for their capability in creating PRs.
☆12Updated last year
Alternatives and similar repositories for PullRequestBenchmark
Users that are interested in PullRequestBenchmark are comparing it to the libraries listed below
Sorting:
- Mistral7B playing DOOM☆139Updated last year
- Mistral7B playing DOOM☆29Updated last year
- Tokenflood is a load testing framework for simulating arbitary loads on instruction-tuned LLMs☆44Updated 3 weeks ago
- Arduino-based USB rotary controller for arcade Arkanoid, Tempest, etc.☆76Updated last year
- One-Click RAG Implementation, Simple and Portable☆30Updated 4 months ago
- Platform for apps☆31Updated this week
- ☆35Updated last year
- Interactive Fiction in the Age of AI☆36Updated last week
- Implement recursion using English as the programming language and an LLM as the runtime.☆240Updated 2 years ago
- "It Runs Doom." "Zork?" "Yes." Wadzilla converts Doom WAD files into ZIL text format suitable for compilation to an Infocom-style game…☆39Updated last year
- Safe Python Code Execution Environment for Language Models☆17Updated this week
- An easily-trained baby GPT that can stand in for the real thing. Based on Andrej Karpathy's makemore, but set up to mimic a llama-cpp ser…☆28Updated 2 years ago
- A multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private co…☆297Updated last month
- I was missing turboc, so I wanted to recreate and modernise the color scheme☆17Updated last year
- Benchmark that evaluates LLMs using 759 NYT Connections puzzles extended with extra trick words☆190Updated last month
- Guards and protection agnostic to your model or provider☆40Updated last year
- ☆164Updated 10 months ago
- Experimental LLM Inference UX to aid in creative writing☆128Updated last year
- ☆127Updated 2 years ago
- RetroChat is a powerful command-line interface for interacting with various AI language models. It provides a seamless experience for eng…☆84Updated 6 months ago
- a curated list of data for reasoning ai☆141Updated last year
- Editor with LLM generation tree exploration☆83Updated 11 months ago
- Fork of primordial privateGPT being used as the back-end to the Memory Cache application☆31Updated 2 years ago
- AI Shell☆16Updated 2 years ago
- Fork of anarki Arc with changes to the news code to support twostopbits.com☆19Updated 3 weeks ago
- Large-Language-Model to Machine Interface project.☆19Updated 2 years ago
- A fork of OpenBLAS with Armv8-A SVE (Scalable Vector Extension) support☆17Updated 5 years ago
- ☆115Updated last year
- Data about 349K OpenAI Custom GPTs☆149Updated last year
- ☆40Updated 4 months ago