dustalov / evalica
Evalica, your favourite evaluation toolkit
☆36Updated last week
Alternatives and similar repositories for evalica:
Users that are interested in evalica are comparing it to the libraries listed below
- Effective LLM Alignment Toolkit☆128Updated 3 weeks ago
- ☆31Updated 7 months ago
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆42Updated last month
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆62Updated 7 months ago
- ☆22Updated last year
- Augmentex — a library for augmenting texts with errors☆63Updated 10 months ago
- Framework for processing and filtering datasets☆27Updated 9 months ago
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆151Updated 4 months ago
- First-of-its-kind benchmark for evaluating the protection capabilities of large language model (LLM) guard systems☆15Updated last week
- Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat☆61Updated last year
- RuCLIP tiny (Russian Contrastive Language–Image Pretraining) is a neural network trained to work with different pairs (images, texts).☆34Updated 2 years ago
- Top ML papers of the week.☆31Updated this week
- AI-generated text boundary detection with RoFT☆24Updated 8 months ago
- Репозиторий измеряет качество Yandexgpt, Gigachat, T-Pro, Saiga, Vikhr, Ruadapt на популярных англоязычных бенчмарках: MGSM, MATH, HumanE…☆23Updated 3 weeks ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating SOTA mode…☆25Updated last month
- Convert MUSE from TensorFlow to PyTorch and ONNX☆11Updated 11 months ago
- Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"☆159Updated 3 months ago
- ☆26Updated this week
- ☆57Updated last year
- RuTransform: python framework for adversarial attacks and text data augmentation for Russian☆19Updated last year
- RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs☆17Updated 2 months ago
- T5-based (russian) text normalization☆20Updated last year
- ☆47Updated 2 weeks ago
- Простой нормализатор текстов перед синтезом речи☆32Updated 11 months ago
- Repository for the paper: "Revisiting BPR: A Replicability Study of a Common Recommender System Baseline"☆51Updated 5 months ago
- Question answering on russian with XLMRobertaLarge as a service☆21Updated 3 years ago
- A set of scripts and configurations for pretraining of Large Language Models (LLM)☆29Updated 2 months ago
- 2D Positional Embeddings for Webpage Structural Understanding 🦙👀☆94Updated 8 months ago
- Проект языковой модели для проведения морфемного анализа, сегментации и токенизации слов русского языка.☆15Updated 4 months ago
- ☆12Updated last year