IlyaGusev / ping_pong_benchView external linksLinks
A benchmark for role-playing language models
☆115May 25, 2025Updated 8 months ago
Alternatives and similar repositories for ping_pong_bench
Users that are interested in ping_pong_bench are comparing it to the libraries listed below
Sorting:
- [AAAI'25] CharacterBench: Benchmarking Character Customization of Large Language Models☆19Aug 1, 2025Updated 6 months ago
- RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs☆19Feb 8, 2026Updated last week
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- cursor logs with gpt-4o using litellm proxy☆14Sep 9, 2025Updated 5 months ago
- Language modeling and instruction tuning for Russian☆466Aug 20, 2024Updated last year
- Ichigo Whisper is a compact (22M parameters), open-source speech tokenizer for the Whisper-medium, designed to enhance performance on mul…☆17Jan 20, 2025Updated last year
- a character-ai like UI for LLM☆10Dec 3, 2024Updated last year
- The tiniest sentence encoder for Russian language☆245Jul 25, 2024Updated last year
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆62Oct 7, 2024Updated last year
- ☆25Nov 13, 2025Updated 3 months ago
- The source code of the game I made for the HuggingFace game jam☆16Jul 25, 2023Updated 2 years ago
- This is the repo for DenseAttention and DANet - fast and conceptually simple modification of standard attention and Transformer☆19Dec 29, 2025Updated last month
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…☆67May 28, 2024Updated last year
- Personal voice assistant, with voice interruption and Twilio support☆18Feb 24, 2025Updated 11 months ago
- ☆20Mar 25, 2025Updated 10 months ago
- Проект языковой модели для проведения морфемного анализа, сегментации и токенизации слов русского языка.☆16Jan 10, 2025Updated last year
- Memory Agent monorepo☆81Oct 9, 2025Updated 4 months ago
- Python package wrapping llama.cpp for on-device LLM inference☆100Oct 12, 2025Updated 4 months ago
- Fast CosyVoice3 inference with tensorRT and tensorRT-LLM☆46Jan 17, 2026Updated 3 weeks ago
- T5-based (russian) text normalization☆25Jan 25, 2024Updated 2 years ago
- URL shortener based on .NET 6☆18Sep 22, 2022Updated 3 years ago
- ☆15Jul 13, 2024Updated last year
- Thematic Generalization Benchmark: measures how effectively various LLMs can infer a narrow or specific "theme" (category/rule) from a sm…☆63Sep 22, 2025Updated 4 months ago
- Official Code for "Coser: Coordinating LLM-Based Persona Simulation of Established Roles"☆172Dec 25, 2025Updated last month
- Latest SillyTavern version with Poe integration. This version only adds the Poe connection achieved by GlizzyChief to SillyTavern version…☆20Apr 27, 2024Updated last year
- ☆83Feb 28, 2025Updated 11 months ago
- Loader extension for tabbyAPI in SillyTavern☆26Jun 30, 2025Updated 7 months ago
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆26Mar 28, 2025Updated 10 months ago
- [EMNLP 2025] Dataset and Code of "PersonaGym: Evaluating Persona Agents and LLMs"☆38Aug 21, 2025Updated 5 months ago
- Game Companion AI is an advanced application designed to enhance the gaming experience by providing real-time analysis and interpretation…☆54Sep 30, 2024Updated last year
- ROS2 x OpenTelemetry: End‑to‑End Telemetry for Robotics☆71Nov 16, 2025Updated 2 months ago
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆164Dec 8, 2025Updated 2 months ago
- 👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"☆59May 31, 2024Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Dec 10, 2024Updated last year
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆59Dec 1, 2024Updated last year
- Official Implementation of OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation☆37Jul 5, 2025Updated 7 months ago
- A simple no-install web UI for Ollama and OAI-Compatible APIs!☆31Jan 30, 2025Updated last year
- ☆30Jun 25, 2024Updated last year
- A benchmark for emotional intelligence in large language models☆400Jul 26, 2024Updated last year