Northern-System-Service / gpt4-autoeval
GPT-4 を用いて、言語モデルの応答を自動評価するスクリプト
☆15Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for gpt4-autoeval
- Japanese Language Model Financial Evaluation Harness☆66Updated 3 weeks ago
- LLMとLoRAを用いたテキスト分類☆93Updated last year
- Japanese LLaMa experiment☆52Updated 8 months ago
- alpacaデータセットを日本語化したものです☆89Updated last year
- ☆82Updated last year
- ☆102Updated this week
- ☆164Updated 5 months ago
- The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.☆118Updated 3 weeks ago
- ☆13Updated 2 months ago
- ☆50Updated last year
- A framework for few-shot evaluation of autoregressive language models.☆146Updated 2 months ago
- ☆52Updated 5 months ago
- 【2024年版】BERTによるテキスト分類☆24Updated 4 months ago
- LLaVA-JP is a Japanese VLM trained by LLaVA method☆55Updated 4 months ago
- ☆21Updated last year
- Exploring Japanese SimCSE☆62Updated last year
- Japanese-BPEEncoder☆39Updated 3 years ago
- JMED-LLM: Japanese Medical Evaluation Dataset for Large Language Models☆44Updated 2 months ago
- DeepLearningのAttentionモデルをPytorchの低レベルAPIを使って1から制作しようという試みのリポジトリです。☆43Updated last year
- ☆52Updated 5 months ago
- ☆142Updated last year
- JQaRA: Japanese Question Answering with Retrieval Augmentation - 検索拡張(RAG)評価のための日本語Q&Aデータセット☆24Updated last month
- ☆25Updated 5 months ago
- ☆33Updated 3 months ago
- ☆22Updated 11 months ago
- A Slack Bot for summarizing arXiv papers, powered by OpenAI LLMs.☆68Updated last year
- Japanese instruction data (日本語指示データ)☆22Updated last year
- ☆168Updated last month
- 「大規模言語モデル入門」(2023)と「大規模言語モデル入門Ⅱ〜生成型LLMの実装と評価」(2024)のGitHubリポジトリ☆332Updated last month
- RealPersonaChat: A Realistic Persona Chat Corpus with Interlocutors' Own Personalities☆48Updated 8 months ago