Northern-System-Service / gpt4-autoeval
GPT-4 を用いて、言語モデルの応答を自動評価するスクリプト
☆16Updated 7 months ago
Alternatives and similar repositories for gpt4-autoeval:
Users that are interested in gpt4-autoeval are comparing it to the libraries listed below
- Japanese Language Model Financial Evaluation Harness☆70Updated last month
- ☆113Updated 2 weeks ago
- Preferred Generation Benchmark☆60Updated last week
- ☆44Updated last month
- ☆25Updated 2 months ago
- LLMとLoRAを用いたテキスト分類☆96Updated last year
- ☆83Updated last year
- 【2024年版】BERTによるテキスト分類☆27Updated 6 months ago
- Swallowプロジェクト 大規模言語モデル 評価スクリプト☆14Updated 6 months ago
- The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.☆119Updated 2 months ago
- ☆55Updated 7 months ago
- alpacaデータセットを日本語化したものです☆89Updated last year
- ☆167Updated 7 months ago
- Japanese LLaMa experiment☆52Updated last month
- ☆22Updated last year
- ☆15Updated 4 months ago
- ☆14Updated 4 months ago
- ☆22Updated last year
- ☆25Updated 3 weeks ago
- ☆17Updated last year
- A framework for few-shot evaluation of autoregressive language models.☆149Updated 4 months ago
- LLaVA-JP is a Japanese VLM trained by LLaVA method☆59Updated 6 months ago
- Japanese instruction data (日本語指示データ)☆22Updated last year
- DeepLearningのAttentionモデルをPytorchの低レベルAPIを使って1から制作しようという試みのリポジトリです。☆46Updated last year
- ☆33Updated 5 months ago
- ☆52Updated 7 months ago
- Exploring Japanese SimCSE☆67Updated last year
- JMED-LLM: Japanese Medical Evaluation Dataset for Large Language Models☆47Updated 3 months ago
- ☆50Updated last year
- Easily turn large English text datasets into Japanese text datasets using open LLMs.☆17Updated 2 months ago