VikhrModels / DOoMLinks
Бенчмарк для оценки способности языковых моделей решать математические и физические задачи на русском языке
☆12Updated 3 months ago
Alternatives and similar repositories for DOoM
Users that are interested in DOoM are comparing it to the libraries listed below
Sorting:
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆43Updated 4 months ago
- Effective LLM Alignment Toolkit☆139Updated last month
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating SOTA mode…☆28Updated 4 months ago
- Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat☆60Updated last year
- ☆13Updated last year
- RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs☆17Updated 5 months ago
- Top ML papers of the week.☆38Updated this week
- Repository for the paper: "Revisiting BPR: A Replicability Study of a Common Recommender System Baseline"☆51Updated 9 months ago
- OmniFusion — a multimodal model to communicate using text and images☆229Updated last year
- Telegram bot for different language models. Supports system prompts and images☆59Updated last month
- LangChain-compatible integrations with YandexGPT and YandexGPT Embeddings☆43Updated 3 months ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆61Updated 10 months ago
- A set of scripts and configurations for pretraining of Large Language Models (LLM)☆31Updated 5 months ago
- Question-answer bot, using Retrieval-Augmented Generation method.☆15Updated last year
- Yet another common Python wrapper for Alice and Salut skills and bots in Telegram, VK, and Facebook☆27Updated 2 years ago
- Multi-node distributed LLM training framework☆15Updated 2 weeks ago
- T5-based (russian) text normalization☆22Updated last year
- ☆18Updated 4 months ago
- Framework for processing and filtering datasets☆27Updated last year
- Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM☆21Updated last year
- Creating multimodal multitask models☆50Updated 2 years ago
- ☆31Updated 10 months ago
- Сайт проекта☆18Updated 11 months ago
- ☆118Updated 4 years ago
- XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning☆14Updated last year
- Notebooks for word embeddings article on Habrahabr☆22Updated 8 years ago
- Простой нормализатор текстов перед синтезом речи☆33Updated last year
- ☆159Updated 5 months ago
- RAG pipeline implementation example for the Russian language☆21Updated 2 years ago
- ☆48Updated last month