SakanaAI / EDINET-BenchLinks
Evaluating the performance of LLMs on Japanese challenging financial tasks.
☆18Updated 2 weeks ago
Alternatives and similar repositories for EDINET-Bench
Users that are interested in EDINET-Bench are comparing it to the libraries listed below
Sorting:
- Japanese LLaMa experiment☆53Updated 6 months ago
- ☆26Updated 7 months ago
- ☆23Updated last year
- ☆16Updated last year
- ☆22Updated 4 months ago
- ☆60Updated last year
- Swallowプロジェクト 大規模言語モデル 評価スクリプト☆17Updated 2 months ago
- 【2024年版】BERTによるテキスト分類☆29Updated 11 months ago
- ☆34Updated 2 months ago
- ☆16Updated 9 months ago
- ☆47Updated 6 months ago
- Official implementation of "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models"☆106Updated 4 months ago
- 青空文庫からテキストをいい感じに取り出します☆11Updated 4 years ago
- Easily turn large English text datasets into Japanese text datasets using open LLMs.☆20Updated 5 months ago
- Unofficial entropix impl for Gemma2 and Llama and Qwen2 and Mistral☆17Updated 5 months ago
- 生成自動評価を行うためのPythonツール☆26Updated last week
- ☆15Updated 4 months ago
- A lightweight framework for evaluating visual-language models.☆30Updated last week
- Mixtral-based Ja-En (En-Ja) Translation model☆19Updated 5 months ago
- ☆15Updated 9 months ago
- ☆17Updated last year
- ☆16Updated 5 months ago
- ☆11Updated last year
- ☆29Updated last year
- Japanese instruction data (日本語指示データ)☆24Updated last year
- Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch☆20Updated last year
- A soft and fast pattern matcher for billion-scale corpora.☆57Updated 4 months ago
- JMultiWOZ: A Large-Scale Japanese Multi-Domain Task-Oriented Dialogue Dataset, LREC-COLING 2024☆24Updated last year
- Japanese Language Model Financial Evaluation Harness☆75Updated last month
- ☆50Updated last year