rioyokotalab / Megatron-Llama2

2023 ABCI Llama-2 継続学習プロジェクト

☆13

Related projects ⓘ

Alternatives and complementary repositories for Megatron-Llama2

leia-llm / leia
LEIA: Facilitating Cross-Lingual Knowledge Transfer in Language Models with Entity-based Data Augmentation
☆21Updated 6 months ago
kotoba-tech / kotoba-recipes
Support Continual pre-training & Instruction Tuning forked from llama-recipes
☆32Updated 8 months ago
iwiwi / epochraft-hf-fsdp
Example of using Epochraft to train HuggingFace transformers models with PyTorch FSDP
☆12Updated 9 months ago
swallow-llm / swallow-evaluation
Swallowプロジェクト大規模言語モデル評価スクリプト
☆10Updated 4 months ago
llm-jp / llm-jp-corpus
☆41Updated 9 months ago
wandb / llm-leaderboard
Project of llm evaluation to Japanese tasks
☆76Updated last month
llm-jp / llm-jp-sft
☆51Updated 5 months ago
lighttransport / japanese-llama-experiment
Japanese LLaMa experiment
☆50Updated 8 months ago
okoge-kaz / llm-recipes
Ongoing Research Project for continaual pre-training LLM(dense mode)
☆27Updated 2 weeks ago
huggingface / lm-evaluation-harness
A framework for few-shot evaluation of language models.
☆17Updated last week
HojiChar / HojiChar
The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.
☆117Updated 2 weeks ago
llm-jp / llm-jp-eval
☆100Updated this week
akjindal53244 / Arithmo
Small and Efficient Mathematical Reasoning LLMs
☆71Updated 9 months ago
sociocom / JMED-LLM
JMED-LLM: Japanese Medical Evaluation Dataset for Large Language Models
☆42Updated last month
jungokasai / IgakuQA
☆43Updated last year
IBM / ensemble-instruct
codebase release for EMNLP2023 paper publication
☆19Updated 8 months ago
hotchpotch / JQaRA
JQaRA: Japanese Question Answering with Retrieval Augmentation - 検索拡張(RAG)評価のための日本語Q&Aデータセット
☆23Updated last month
kamalkraj / e5-mistral-7b-instruct
Finetune mistral-7b-instruct for sentence embeddings
☆70Updated 6 months ago
iwiwi / epochraft
Checkpointable dataset utilities for foundation model training
☆32Updated 9 months ago
CarperAI / decontamination
This repository contains code for cleaning your training data of benchmark data to help combat data snooping.
☆25Updated last year
kotoba-tech / kotomamba
Mamba training library developed by kotoba technologies
☆67Updated 9 months ago
hppRC / simple-simcse
A simple implementation of SimCSE
☆74Updated 2 years ago
algomatic-inc / awesome-ai-agents-guide
🤖 A collection of AI agents includes research papers, blogs, and products focused on developing autonomous systems.
☆43Updated 5 months ago
nlp-waseda / JMMLU
日本語マルチタスク言語理解ベンチマーク Japanese Massive Multitask Language Understanding Benchmark
☆25Updated 8 months ago
ku-nlp / ja-vicuna-qa-benchmark
☆33Updated 3 months ago
masanorihirano / llm-japanese-dataset
LLM構築用の日本語チャットデータセット
☆78Updated 9 months ago
dwzhu-pku / LongEmbed
LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)
☆114Updated this week
ce-lery / japanese-mistral-300m-recipe
☆14Updated 2 months ago
explodinggradients / nemesis
Reward Model framework for LLM RLHF
☆58Updated last year
salesforce / summary-of-a-haystack
Codebase accompanying the Summary of a Haystack paper.
☆72Updated last month