rioyokotalab / Megatron-Llama2
2023 ABCI Llama-2 継続学習プロジェクト
☆13Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for Megatron-Llama2
- LEIA: Facilitating Cross-Lingual Knowledge Transfer in Language Models with Entity-based Data Augmentation☆21Updated 6 months ago
- Support Continual pre-training & Instruction Tuning forked from llama-recipes☆32Updated 8 months ago
- Example of using Epochraft to train HuggingFace transformers models with PyTorch FSDP☆12Updated 9 months ago
- Swallowプロジェクト 大規模言語モデル 評価スクリプト☆10Updated 4 months ago
- ☆41Updated 9 months ago
- Project of llm evaluation to Japanese tasks☆76Updated last month
- ☆51Updated 5 months ago
- Japanese LLaMa experiment☆50Updated 8 months ago
- Ongoing Research Project for continaual pre-training LLM(dense mode)☆27Updated 2 weeks ago
- A framework for few-shot evaluation of language models.☆17Updated last week
- The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.☆117Updated 2 weeks ago
- ☆100Updated this week
- Small and Efficient Mathematical Reasoning LLMs☆71Updated 9 months ago
- JMED-LLM: Japanese Medical Evaluation Dataset for Large Language Models☆42Updated last month
- ☆43Updated last year
- codebase release for EMNLP2023 paper publication☆19Updated 8 months ago
- JQaRA: Japanese Question Answering with Retrieval Augmentation - 検索拡張(RAG)評価のための日本語Q&Aデータセット☆23Updated last month
- Finetune mistral-7b-instruct for sentence embeddings☆70Updated 6 months ago
- Checkpointable dataset utilities for foundation model training☆32Updated 9 months ago
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆25Updated last year
- Mamba training library developed by kotoba technologies☆67Updated 9 months ago
- A simple implementation of SimCSE☆74Updated 2 years ago
- 🤖 A collection of AI agents includes research papers, blogs, and products focused on developing autonomous systems.☆43Updated 5 months ago
- 日本語マルチタスク言語理解ベンチマーク Japanese Massive Multitask Language Understanding Benchmark☆25Updated 8 months ago
- ☆33Updated 3 months ago
- LLM構築用の日本語チャットデータセット☆78Updated 9 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆114Updated this week
- ☆14Updated 2 months ago
- Reward Model framework for LLM RLHF☆58Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆72Updated last month