dustalov / evalicaLinks
Evalica, your favourite evaluation toolkit
☆59Updated last week
Alternatives and similar repositories for evalica
Users that are interested in evalica are comparing it to the libraries listed below
Sorting:
- ☆31Updated last year
- Effective LLM Alignment Toolkit☆145Updated 4 months ago
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆45Updated 7 months ago
- Top ML papers of the week.☆41Updated this week
- Framework for processing and filtering datasets☆28Updated last year
- Convert MUSE from TensorFlow to PyTorch and ONNX☆11Updated last year
- Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"☆165Updated 9 months ago
- ☆22Updated 2 years ago
- Tree-based indexes for neural-search☆32Updated last year
- 2D Positional Embeddings for Webpage Structural Understanding 🦙👀☆94Updated last year
- Datamodels for hugging face tokenizers☆86Updated this week
- NLP with Rust for Python 🦀🐍☆65Updated 5 months ago
- First-of-its-kind AI benchmark for evaluating the protection capabilities of large language model (LLM) guard systems (guardrails and saf…☆44Updated last month
- ☆83Updated 3 months ago
- Augmentex — a library for augmenting texts with errors☆67Updated last year
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆62Updated last year
- T5-based (russian) text normalization☆23Updated last year
- Curriculum training of instruction-following LLMs with Unsloth☆14Updated 7 months ago
- This Russia Doesn't Exist☆14Updated last year
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, including…☆68Updated 4 months ago
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆160Updated 10 months ago
- RuCLIP tiny (Russian Contrastive Language–Image Pretraining) is a neural network trained to work with different pairs (images, texts).☆35Updated 3 years ago
- ☆70Updated last year
- A benchmark for role-playing language models☆107Updated 5 months ago
- ☆55Updated 7 months ago
- Efficiently computing & storing token n-grams from large corpora☆26Updated last year
- Tools for gathering and analyzing Reddit data using LLMs - much simplified version of the Reddit Answers.☆67Updated 5 months ago
- ☆43Updated 2 weeks ago
- ☆20Updated last year
- RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs☆19Updated 8 months ago