SakanaAI / EDINET-BenchLinks
Evaluating the performance of LLMs on Japanese challenging financial tasks.
☆24Updated 3 months ago
Alternatives and similar repositories for EDINET-Bench
Users that are interested in EDINET-Bench are comparing it to the libraries listed below
Sorting:
- Swallowプロジェクト 事後学習済み大規模言語モデル 評価フレームワーク☆21Updated 2 weeks ago
 - Official implementation of "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models"☆118Updated 3 weeks ago
 - ☆61Updated last year
 - Japanese LLaMa experiment☆53Updated 2 weeks ago
 - ☆27Updated 11 months ago
 - A Chrome extension that helps you translate Kaggle notebook with translate engine like Google Translate.☆34Updated 7 months ago
 - ☆38Updated 6 months ago
 - Swallowプロジェクト 大規模言語モデル 評価スクリプト☆22Updated last month
 - Ongoing Research Project for continaual pre-training LLM(dense mode)☆42Updated 8 months ago
 - ☆24Updated last year
 - ☆23Updated 9 months ago
 - ☆50Updated last year
 - Support Continual pre-training & Instruction Tuning forked from llama-recipes☆33Updated last year
 - ☆49Updated 10 months ago
 - Project of llm evaluation to Japanese tasks☆90Updated last week
 - A lightweight framework for evaluating visual-language models.☆38Updated last week
 - JQaRA: Japanese Question Answering with Retrieval Augmentation - 検索拡張(RAG)評価のための日本語Q&Aデータセット☆37Updated last month
 - 生成自動評価を行うためのPythonツール☆31Updated 2 weeks ago
 - ☆20Updated last year
 - 【2024年版】BERTによるテキスト分類☆29Updated last year
 - Flexible evaluation tool for language models☆52Updated 3 weeks ago
 - ☆139Updated this week
 - Unofficial entropix impl for Gemma2 and Llama and Qwen2 and Mistral☆17Updated 9 months ago
 - ☆22Updated 2 years ago
 - ☆17Updated last month
 - ☆16Updated last year
 - Mixtral-based Ja-En (En-Ja) Translation model☆19Updated 9 months ago
 - ☆16Updated 10 months ago
 - Preferred Generation Benchmark☆85Updated last week
 - A soft and fast pattern matcher for billion-scale corpora.☆63Updated 8 months ago