Artefact2 / llm-eval
A super simple web interface to perform blind tests on LLM outputs.
☆28Updated last year
Alternatives and similar repositories for llm-eval:
Users that are interested in llm-eval are comparing it to the libraries listed below
- Scripts to create your own moe models using mlx☆89Updated last year
- GRDN.AI app for garden optimization☆70Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆56Updated last year
- The DPAB-α Benchmark☆19Updated 3 months ago
- ☆38Updated last year
- Testing LLM reasoning abilities with lineage relationship quizzes.☆26Updated last month
- ☆12Updated 6 months ago
- LLM inference in C/C++☆71Updated this week
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- Implementation of nougat that focuses on processing pdf locally.☆81Updated 3 months ago
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆25Updated 9 months ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆86Updated last month
- Testing LLM reasoning abilities with family relationship quizzes.☆62Updated 2 months ago
- ☆66Updated 10 months ago
- Distributed Inference for mlx LLm☆87Updated 8 months ago
- ☆53Updated 11 months ago
- ☆112Updated 3 months ago
- ☆153Updated 9 months ago
- ☆31Updated last year
- Eh, simple and works.☆27Updated last year
- A guidance compatibility layer for llama-cpp-python☆34Updated last year
- Embedding models from Jina AI☆58Updated last year
- inference code for mixtral-8x7b-32kseqlen☆99Updated last year
- Thematic Generalization Benchmark: measures how effectively various LLMs can infer a narrow or specific "theme" (category/rule) from a sm…☆43Updated this week
- Generates grammer files from typescript for LLM generation☆37Updated last year
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- ☆30Updated 9 months ago
- Port of Facebook's LLaMA model in C/C++☆20Updated last year
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated last year