A benchmark for role-playing language models
☆118May 25, 2025Updated 11 months ago
Alternatives and similar repositories for ping_pong_bench
Users that are interested in ping_pong_bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆31Sep 23, 2024Updated last year
- [AAAI'25] CharacterBench: Benchmarking Character Customization of Large Language Models☆22Aug 1, 2025Updated 9 months ago
- RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs☆20Feb 8, 2026Updated 3 months ago
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆47Mar 20, 2025Updated last year
- ☆60Dec 17, 2025Updated 5 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆27Nov 13, 2025Updated 6 months ago
- ESPNet TTS with Streamlit GUI☆14Apr 30, 2023Updated 3 years ago
- Language modeling and instruction tuning for Russian☆462Aug 20, 2024Updated last year
- The tiniest sentence encoder for Russian language☆245Jul 25, 2024Updated last year
- An automated pipeline for evaluating LLMs for role-playing.☆208Sep 14, 2024Updated last year
- Training and data processing code for Saiga☆54Jan 2, 2026Updated 4 months ago
- A simple no-install web UI for Ollama and OAI-Compatible APIs!☆31Jan 30, 2025Updated last year
- ☆19Sep 29, 2024Updated last year
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆63Oct 7, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆27Nov 3, 2023Updated 2 years ago
- Loads OpenSubtitles v2018 dataset without having to load everything into memory at once. Works well with pytorch.☆13Aug 26, 2020Updated 5 years ago
- Official Code for "Coser: Coordinating LLM-Based Persona Simulation of Established Roles"☆195Apr 2, 2026Updated last month
- Open Character Training☆84Apr 4, 2026Updated last month
- best llms in russian☆62May 23, 2024Updated 2 years ago
- [EMNLP 2025] Dataset and Code of "PersonaGym: Evaluating Persona Agents and LLMs"☆42Aug 21, 2025Updated 9 months ago
- Text readability calculator for Japanese learners 🇯🇵☆23Oct 29, 2025Updated 6 months ago
- Репозиторий измеряет качество Yandexgpt, Gigachat, T-Pro, Saiga, Vikhr, Ruadapt на популярных англоязычных бенчмарках: MGSM, MATH, HumanE…☆24Apr 16, 2025Updated last year
- The Repository for the SillyTavern thinking engine!☆89Mar 2, 2026Updated 2 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Loader extension for tabbyAPI in SillyTavern☆26Jun 30, 2025Updated 10 months ago
- Проект языковой модели для проведения морфемного анализа, сегментации и токенизации слов русского языка.☆17Jan 10, 2025Updated last year
- Personal voice assistant, with voice interruption and Twilio support☆18Feb 24, 2025Updated last year
- A self-ailgnment method for role-play. Benchmark for role-play. Resources for "Large Language Models are Superpositions of All Characters…☆213May 28, 2024Updated last year
- ☆293May 27, 2025Updated 11 months ago
- ASR on WS, POST/GET FAST_API Can use many RU asr models.☆19May 12, 2026Updated last week
- ☆12Aug 15, 2023Updated 2 years ago
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.☆38Apr 1, 2025Updated last year
- Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine☆25Dec 20, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A scalable automated alignment method for large language models. Resources for "Aligning Large Language Models via Self-Steering Optimiza…☆20Nov 21, 2024Updated last year
- ☆51Sep 3, 2025Updated 8 months ago
- ☆20Mar 25, 2025Updated last year
- Merge Transformers language models by use of gradient parameters.☆214Aug 8, 2024Updated last year
- Claudetools is a Python library that enables function calling with the Claude 3 family of language models from Anthropic.☆38Jan 14, 2025Updated last year
- Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas☆1,021Apr 9, 2026Updated last month
- Memory Agent monorepo☆87Oct 9, 2025Updated 7 months ago