g0t4 / llm-colosseum
Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM
☆19Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for llm-colosseum
- best llms in russian☆39Updated 5 months ago
- Telegram bot for different language models. Supports system prompts and images☆39Updated 3 weeks ago
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆24Updated 3 weeks ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆58Updated last month
- Effective LLM Alignment Toolkit☆87Updated 3 weeks ago
- 2D Positional Embeddings for Webpage Structural Understanding 🦙👀☆93Updated 2 months ago
- Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat☆57Updated last year
- ☆53Updated last month
- Foundational Model for Speech Recognition Tasks☆113Updated 5 months ago
- Using transformers to generate Russian poetry☆35Updated last year
- CLIP implementation for Russian language☆139Updated last year
- ☆30Updated this week
- Framework for processing and filtering datasets☆25Updated 3 months ago
- Простой расстановщик ударений с обработкой омографов☆97Updated 3 weeks ago
- ☆18Updated 2 years ago
- Fine tuning of the base model from OpenAI Whisper in Russian language on the dataset Sber-golos☆36Updated 2 years ago
- RuLeanALBERT is a pretrained masked language model for the Russian language that uses a memory-efficient architecture.☆93Updated last year
- ☆26Updated last month
- ☆21Updated last year
- Telegram LLM bot backed by OpenAI, Whisper, Beam, LLaMA, Weaviate, MinIO and MongoDB☆84Updated 10 months ago
- LangChain-compatible integrations with YandexGPT and YandexGPT Embeddings☆35Updated 3 weeks ago
- EasyPortrait - Face Parsing and Portrait Segmentation Dataset☆28Updated 2 months ago
- Сайт проекта☆17Updated 2 months ago
- RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs☆15Updated last month
- Experimental adventure game with AI-generated content☆108Updated 11 months ago
- Thin wrapper around OpenAI Whisper API with streaming support☆87Updated last month
- Enterprise RAG Challenge to test accuracy of different LLM-driven assistants☆29Updated 2 months ago
- Bunch of notebooks for pre-training custom Saiga-like LLM☆13Updated 9 months ago