VikhrModels / DOoMLinks
Бенчмарк для оценки способности языковых моделей решать математические и физические задачи на русском языке
☆12Updated 2 months ago
Alternatives and similar repositories for DOoM
Users that are interested in DOoM are comparing it to the libraries listed below
Sorting:
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆43Updated 3 months ago
- Effective LLM Alignment Toolkit☆137Updated 3 weeks ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating SOTA mode…☆26Updated 3 months ago
- ☆13Updated last year
- Repository for the paper: "Revisiting BPR: A Replicability Study of a Common Recommender System Baseline"☆51Updated 8 months ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆61Updated 9 months ago
- RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs☆17Updated 5 months ago
- LangChain-compatible integrations with YandexGPT and YandexGPT Embeddings