Artefact2 / llm-evalLinks
A super simple web interface to perform blind tests on LLM outputs.
☆28Updated last year
Alternatives and similar repositories for llm-eval
Users that are interested in llm-eval are comparing it to the libraries listed below
Sorting:
- Scripts to create your own moe models using mlx☆90Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- Distributed Inference for mlx LLm☆93Updated last year
- Implementation of nougat that focuses on processing pdf locally.☆81Updated 7 months ago
- ☆161Updated 3 weeks ago
- Benchmarks comparing PyTorch and MLX on Apple Silicon GPUs☆88Updated last year
- A guidance compatibility layer for llama-cpp-python☆36Updated last year
- For inferring and serving local LLMs using the MLX framework☆109Updated last year
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆85Updated last week
- ☆38Updated last year
- Public reports detailing responses to sets of prompts by Large Language Models.☆31Updated 7 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆146Updated 6 months ago
- inference code for mixtral-8x7b-32kseqlen☆101Updated last year
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆110Updated 2 years ago
- ☆86Updated last year
- ☆36Updated last year
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆90Updated this week
- Generates grammer files from typescript for LLM generation☆38Updated last year
- Port of Andrej Karpathy's nanoGPT to Apple MLX framework.☆112Updated last year
- ☆116Updated 8 months ago
- GRDN.AI app for garden optimization☆70Updated last year
- GGUF implementation in C as a library and a tools CLI program☆284Updated this week
- Inference of Mamba models in pure C☆191Updated last year
- ☆46Updated last year
- Embedding models from Jina AI☆64Updated last year
- ☆88Updated 7 months ago
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆169Updated last year
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆222Updated last year
- Just a bunch of benchmark logs for different LLMs☆120Updated last year
- Command-line script for inferencing from models such as MPT-7B-Chat☆100Updated 2 years ago