dustalov / evalicaLinks
Evalica, your favourite evaluation toolkit
☆38Updated 2 weeks ago
Alternatives and similar repositories for evalica
Users that are interested in evalica are comparing it to the libraries listed below
Sorting:
- Effective LLM Alignment Toolkit☆137Updated 3 weeks ago
- ☆31Updated 9 months ago
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆43Updated 3 months ago
- Top ML papers of the week.☆33Updated this week
- Framework for processing and filtering datasets☆27Updated 11 months ago
- First-of-its-kind AI benchmark for evaluating the protection capabilities of large language model (LLM) guard systems (guardrails and saf…☆39Updated 3 weeks ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆61Updated 9 months ago
- ☆22Updated last year
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆155Updated 6 months ago
- Augmentex — a library for augmenting texts with errors☆65Updated last year
- RuTransform: python framework for adversarial attacks and text data augmentation for Russian☆19Updated 2 years ago
- ☆70Updated 10 months ago
- Tools for gathering and analyzing Reddit data using LLMs - much simplified version of the Reddit Answers.☆60Updated 2 months ago
- RuCLIP tiny (Russian Contrastive Language–Image Pretraining) is a neural network trained to work with different pairs (images, texts).☆33Updated 3 years ago
- Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"☆161Updated 6 months ago
- Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat☆61Updated last year
- Convert MUSE from TensorFlow to PyTorch and ONNX☆11Updated last year
- Bunch of notebooks for pre-training custom Saiga-like LLM☆13Updated last year
- Reinforcement Learning Library.☆29Updated 2 years ago
- RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs☆17Updated 5 months ago
- ☆18Updated 8 months ago
- Curriculum training of instruction-following LLMs with Unsloth☆14Updated 4 months ago
- ☆18Updated 3 months ago
- Repository for the paper: "Revisiting BPR: A Replicability Study of a Common Recommender System Baseline"☆51Updated 8 months ago
- MMLU eval for RU/EN☆15Updated last year
- AI-generated text boundary detection with RoFT☆24Updated 10 months ago
- 2D Positional Embeddings for Webpage Structural Understanding 🦙👀☆95Updated 10 months ago
- Automatic hyperparameters tuning for topic models (ARTM approach) using evolutionary algorithms☆27Updated 5 months ago
- ☆53Updated 4 months ago
- Репозиторий измеряет качество Yandexgpt, Gigachat, T-Pro, Saiga, Vikhr, Ruadapt на популярных англоязычных бенчмарках: MGSM, MATH, HumanE…☆23Updated 3 months ago