talkiq/llm-evaluate

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/talkiq/llm-evaluate)

talkiq / llm-evaluate

☆11

Alternatives and similar repositories for llm-evaluate

Users that are interested in llm-evaluate are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

talkiq / yaaredis
View on GitHub
Redis client for Python asyncio (w/ redis server, cluster, and sentinel support)
☆15Mar 23, 2023Updated 3 years ago
slab / nebulex_local_multilevel_adapter
View on GitHub
Nebulex adapter with a fast local L1 cache and a larger shared L2 cache
☆15Jun 20, 2023Updated 3 years ago
zoedsoupe / mentor
View on GitHub
LLM strutured outputs done in a composable and extensible way for elixir
☆22Jul 18, 2025Updated last year
noc-turne / LLM_Light_Testing
View on GitHub
本项目提出了一个基于python的大语言模型推理服务自动化测试框架，用于评估大语言模型的推理效果以及性能，具有易用性、易拓展性、高效性和可靠性等特点。
☆11Feb 26, 2026Updated 5 months ago
agoodway / livefilter
View on GitHub
A flexible and composable filtering library for LiveView using PgRest
☆26Jun 8, 2026Updated last month
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
cxysteven / MapBJ
View on GitHub
☆12Apr 13, 2017Updated 9 years ago
ahgraber / stopsloppypasta
View on GitHub
Sloppypasta | n. | Verbatim LLM output copy-pasted at someone, unread, unedited, and usually unrequested. From slop (low-quality AI-gener…
☆27Updated this week
YunjiaXi / InfoDeepSeek
View on GitHub
Code for InfoDeepSeek: Benchmarking Agentic Information Seeking for Retrieval-Augmented Generation
☆19May 29, 2025Updated last year
JinBlack / bash-ai
View on GitHub
a command line tool that let you express commands in natural language using openai api..
☆40Dec 4, 2025Updated 7 months ago
Yushi-Hu / Multilingual-AWE
View on GitHub
☆12Nov 18, 2020Updated 5 years ago
rjmacarthy / gpt-code-reviewer
View on GitHub
Use ChatGPT to conduct code reviews on your pull requests.
☆15Jun 25, 2025Updated last year
zoulala / CCKS_QA
View on GitHub
REF//biendata.com/competition/CCKS2018_3/make-submission/
☆17Aug 12, 2018Updated 7 years ago
CognitiveAIGroup / IQTest
View on GitHub
☆16Sep 5, 2020Updated 5 years ago
EndingCredits / json2vec
View on GitHub
Implementation of Semantic Tree-structured Learning Algorithm (STRLA) for JSON data, aka. JSON Neural Network
☆21Apr 22, 2020Updated 6 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
InstantMirage / Pathfinder2eModBG3
View on GitHub
A mod to convert BG3 to PF2 rules
☆22Updated this week
kazoo-classic / kazoo
View on GitHub
Kazoo v4.3 Community Fork
☆15Mar 15, 2026Updated 4 months ago
benjreinhart / server_sent_events
View on GitHub
Lightweight, ultra-fast, fully spec-conformant Server Sent Event parser
☆34May 20, 2026Updated 2 months ago
josevalim / schemecto
View on GitHub
Schemaless Ecto changesets with support for nesting and JSON Schemas
☆35Jan 13, 2026Updated 6 months ago
telephoneorg / docker-kazoo
View on GitHub
Kazoo Dockerized, for Kubernetes
☆20Dec 12, 2017Updated 8 years ago
taidopurason / tokenizer-extension
View on GitHub
☆15Dec 4, 2025Updated 7 months ago
CCIIPLab / DPT
View on GitHub
The code of IJCAI2022 paper, Declaration-based Prompt Tuning for Visual Question Answering
☆20May 10, 2022Updated 4 years ago
ayminovitch / fine-tune-codebase
View on GitHub
Fine-Tune Codebase Model 🚀 A scalable and efficient tool for fine-tuning large language models (LLMs) on codebases. Supports LoRA, mixed…
☆42Jun 17, 2025Updated last year
aws-samples / evaluating-large-language-models-using-llm-as-a-judge
View on GitHub
☆22Jan 13, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
mhansen / rtl_433_prometheus
View on GitHub
Prometheus time-series DB exporter for rtl_433 433MHz radio packet decoder
☆20Apr 8, 2025Updated last year
marcelog / elixir_mod_event
View on GitHub
Elixir client for the FreeSWITCH mod_event_socket
☆25Nov 20, 2023Updated 2 years ago
LM-Kit / lm-kit-net-samples
View on GitHub
.NET samples for LM-Kit.NET
☆48Updated this week
siatnlp / LegalQA
View on GitHub
A Chinese question answering dataset for legal advice.
☆26Apr 9, 2019Updated 7 years ago
samedhi / ctci
View on GitHub
Clojure solutions to "Cracking the Coding Interview"
☆31Jun 1, 2020Updated 6 years ago
pavanjava / qql
View on GitHub
SQL-like query language and CLI for Qdrant vector search engine
☆46Jun 13, 2026Updated last month
flowers2023 / lm-ken
View on GitHub
kenlm语言模型，并提供python的rest服务
☆30Aug 1, 2018Updated 7 years ago
dashbitco / nimble_ownership
View on GitHub
Lightweight ownership of resources across processes
☆65Oct 21, 2025Updated 9 months ago
georgeguimaraes / alike
View on GitHub
Semantic similarity testing for Elixir. Test LLM outputs, chatbots, and NLP in Elixir
☆44Jun 19, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
yz1019117968 / ICPC-21-MMTrans
View on GitHub
Source Code for "A multi-modal transformer-based code summarization approach for smart contracts"
☆27Mar 16, 2021Updated 5 years ago
Joinn99 / RocketEval-ICLR
View on GitHub
🚀 [ICLR '25] RocketEval: Efficient Automated LLM Evaluation via Grading Checklist
☆17Aug 21, 2025Updated 11 months ago
ayushgupta4897 / embedDB
View on GitHub
EmbedDB is an ultra-lightweight vector database designed for rapid prototyping of semantic search and RAG applications. The entire implem…
☆21Mar 24, 2025Updated last year
Neutralzz / RefQA
View on GitHub
The implementation of the paper "Harvesting and Refining Question-Answer Pairs for Unsupervised QA"
☆33Nov 25, 2020Updated 5 years ago
ayushgupta4897 / fast-dedupe
View on GitHub
A minimalist but optimized Python package for deduplication tasks leveraging RapidFuzz internally, enabling super-fast approximate duplic…
☆18Apr 2, 2025Updated last year
e0da / popout_for_youtube
View on GitHub
Popout for YouTube™ extension for Google Chrome™
☆26Nov 11, 2021Updated 4 years ago
drulabs / LocalDash
View on GitHub
Android local networking (NSD, Wi-Fi direct and Wi-Fi direct service discovery)
☆96Apr 16, 2019Updated 7 years ago