Flexible evaluation tool for language models
☆58Mar 18, 2026Updated this week
Alternatives and similar repositories for flexeval
Users that are interested in flexeval are comparing it to the libraries listed below
Sorting:
- DIRECT: Direct and Indirect REsponses in Conversational Text Corpus☆17Jul 1, 2021Updated 4 years ago
- The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.☆125Nov 13, 2025Updated 4 months ago
- A processor for KyotoCorpus, KWDLC, and AnnotatedFKCCorpus☆10Jun 26, 2024Updated last year
- ☆29Apr 10, 2025Updated 11 months ago
- DefSent: Sentence Embeddings using Definition Sentences☆23Aug 5, 2021Updated 4 years ago
- LEIA: Facilitating Cross-Lingual Knowledge Transfer in Language Models with Entity-based Data Augmentation☆23Apr 24, 2024Updated last year
- Japanese instruction data (日本語指示データ)☆24Jul 13, 2023Updated 2 years ago
- ☆16Nov 19, 2023Updated 2 years ago
- Discovering Universal Geometry in Embeddings with ICA (Published in EMNLP 2023)☆20Jun 17, 2025Updated 9 months ago
- 生成自動評価を行うためのPythonツール☆39Updated this week
- ☆35Dec 17, 2020Updated 5 years ago
- JMED-LLM: Japanese Medical Evaluation Dataset for Large Language Models☆56Sep 22, 2024Updated last year
- The evaluation scripts of JMTEB (Japanese Massive Text Embedding Benchmark)☆87Updated this week
- ☆24Dec 15, 2023Updated 2 years ago
- 【2023年版】BERTによるテキスト分類☆236May 28, 2024Updated last year
- Code for "Word Tour: One-dimensional Word Embeddings via the Traveling Salesman Problem" (NAACL 2022)☆111May 14, 2025Updated 10 months ago
- Pytorch implementation and pre-trained Japanese model for CANINE, the efficient character-level transformer.☆89Nov 3, 2023Updated 2 years ago
- Multimodal dataset for ad text generation in Japanese [Mita+, ACL2024]☆26Aug 13, 2024Updated last year
- ☆19May 23, 2024Updated last year
- A soft and fast pattern matcher for billion-scale corpora.☆75Feb 26, 2025Updated last year
- JGLUE: Japanese General Language Understanding Evaluation☆337Mar 31, 2025Updated 11 months ago
- ☆11Sep 7, 2021Updated 4 years ago
- This is a repository of yohei's lecture pdf of 2018 Cookpad Summer Internship 5 DAY R&D.☆28Mar 4, 2019Updated 7 years ago
- Training and evaluation scripts for JGLUE, a Japanese language understanding benchmark☆18Mar 3, 2026Updated 2 weeks ago
- Utility scripts for preprocessing Wikipedia texts for NLP☆78Apr 9, 2024Updated last year
- 📝 A list of pre-trained BERT models for Japanese with word/subword tokenization + vocabulary construction algorithm information☆131Mar 15, 2023Updated 3 years ago
- Whisper of the arxiv: read comments in tex of papers☆33May 16, 2018Updated 7 years ago
- Tutorials for PyTorch Geometric(PyG)☆20Jan 6, 2020Updated 6 years ago
- Preferred Generation Benchmark☆92Mar 6, 2026Updated 2 weeks ago
- ☯️ AllenNLP training configurations for promising models on Named Entity Recognition. (BiLSTM-CRF, BiLSTM-CNN-CRF, BERT, BERT-CRF)☆15Nov 26, 2020Updated 5 years ago
- Official implementation of "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models"☆121Oct 6, 2025Updated 5 months ago
- full text search engine based on compact data structures☆13Jan 26, 2015Updated 11 years ago
- Zero-Shot Cross-Lingual Semantic Parsing (Sherborne & Lapata, ACL 2022)☆17May 16, 2022Updated 3 years ago
- LaTeX document class for the proceedings of ANLP☆21Oct 28, 2025Updated 4 months ago
- YAST - Yet Another SPLADE or Sparse Trainer☆21Jun 16, 2025Updated 9 months ago
- ☆16Mar 4, 2024Updated 2 years ago
- A simple implementation of SimCSE☆78Oct 31, 2022Updated 3 years ago
- Japanese-BPEEncoder☆41Sep 12, 2021Updated 4 years ago
- 🚀 A demonstration of hyperparameter optimization using Optuna for models implemented with AllenNLP.☆16Nov 28, 2020Updated 5 years ago