ShinoharaHare / LLM-Training
A distributed training framework for large language models powered by Lightning.
☆21Updated last month
Alternatives and similar repositories for LLM-Training:
Users that are interested in LLM-Training are comparing it to the libraries listed below
- Lightweight and Effective Chinese LLM.☆22Updated 2 months ago
- Official code for the ACL 2024 paper: Chat Vector: A Simple Approach to Equip LLMs with Instruction Following and Model Alignment in New …☆49Updated 11 months ago
- Generative Fusion Decoding (GFD) is a novel framework for integrating Large Language Models (LLMs) into multi-modal text recognition syst…☆79Updated 7 months ago
- ☆12Updated 4 months ago
- ☆45Updated last week
- The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".☆167Updated 3 weeks ago
- A Traditional-Chinese instruction-following model with datasets based on Alpaca.☆136Updated 2 years ago
- ☆56Updated last week
- Code for paper "Patch-Level Training for Large Language Models"☆82Updated 5 months ago
- Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding☆10Updated last year
- finetune llama2 with traditional chinese dataset☆38Updated last year
- [NAACL 2025] A Closer Look into Mixture-of-Experts in Large Language Models☆51Updated 2 months ago
- Official github repo for TMMLU+, Large scale traditional chinese massive multitask language understanding☆46Updated 9 months ago
- [NeurIPS 2023 spotlight] Official implementation of HGRN in our NeurIPS 2023 paper - Hierarchically Gated Recurrent Neural Network for Se…☆64Updated last year
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆83Updated 7 months ago
- JAX implementation of the bart-base model☆30Updated 2 years ago
- Textless (ASR-transcript free) Spoken Question Answering. The official release of NMSQA dataset and the implementation of "DUAL: Textless…☆35Updated last year
- ☆12Updated last year
- just collections about Llama2☆44Updated 7 months ago
- ☆17Updated 3 months ago
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆90Updated last year
- 台灣閩南語大型語言模型 (Taiwanese Hokkien LLMs)☆35Updated 8 months ago
- LongMIT: Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets☆36Updated 6 months ago
- Code and model for ICASSP 2025 Paper "Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data"☆83Updated 2 months ago
- one script for xls-r/xlsr/whisper fine-tuning☆41Updated last year
- ROUGE score calculator with traditional chinese word segmentation☆9Updated 4 years ago
- ☆13Updated 3 months ago
- The official repository of Quamba☆39Updated 3 weeks ago
- LightThinker: Thinking Step-by-Step Compression☆35Updated last week
- ASR text preprocessing utility☆21Updated 8 months ago