ShinoharaHare / LLM-TrainingLinks
A distributed training framework for large language models powered by Lightning.
☆24Updated 5 months ago
Alternatives and similar repositories for LLM-Training
Users that are interested in LLM-Training are comparing it to the libraries listed below
Sorting:
- [Kaggle-2nd] Lightweight yet Effective Chinese LLM.☆52Updated 6 months ago
- Generative Fusion Decoding (GFD) is a novel framework for integrating Large Language Models (LLMs) into multi-modal text recognition syst…☆86Updated 5 months ago
- Official code for the ACL 2024 paper: Chat Vector: A Simple Approach to Equip LLMs with Instruction Following and Model Alignment in New …☆58Updated last year
- A method that directly addresses the modality gap by aligning speech token with the corresponding text transcription during the tokenizat…☆106Updated 4 months ago
- just collections about Llama2☆44Updated last year
- Evaluate your agent memory on real-world dialogues, not LLM-simulated dialogues.☆35Updated 6 months ago
- Evaluation code for benchmarking VLMs in traditional chinese understanding☆13Updated 2 weeks ago
- ☆50Updated last month
- A Traditional-Chinese instruction-following model with datasets based on Alpaca.☆137Updated 2 years ago
- [ACL 2024 Demo] SeaLLMs - Large Language Models for Southeast Asia☆172Updated last year
- 台灣閩南語大型語言模型 (Taiwanese Hokkien LLMs)☆53Updated last year
- Experimental playground for benchmarking language model (LM) architectures, layers, and tricks on smaller datasets. Designed for flexible…☆92Updated 2 weeks ago
- Code for the ICML 2025 paper "SelfCite Self-Supervised Alignment for Context Attribution in Large Language Models"☆21Updated 3 weeks ago
- Baichuan-Omni: Towards Capable Open-source Omni-modal LLM 🌊☆271Updated 11 months ago
- Training code for Baby-Llama, our submission to the strict-small track of the BabyLM challenge.☆85Updated 2 years ago
- ☆20Updated last year
- Repository for "TESS-2: A Large-Scale, Generalist Diffusion Language Model"☆53Updated 10 months ago
- ☆76Updated 3 months ago
- finetune llama2 with traditional chinese dataset☆39Updated 2 years ago
- AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension☆124Updated last year
- Code and model for ICASSP 2025 Paper "Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data"☆119Updated 5 months ago
- Leaderboard and code for "Speech-IFEval", Interspeech 2025☆23Updated 7 months ago
- SiLLM is a Simultaneous Machine Translation (SiMT) Framework. It utilizes a Large Language model as the translation model and employs a t…☆17Updated last year
- Official release of StyleTalk dataset.☆70Updated last year
- Official GitHub repository for paper "SAKURA: On the Multi-hop Reasoning of Large Audio-Language Models Based on Speech and Audio Informa…☆20Updated 4 months ago
- Due to the huge vocaburary size (151,936) of Qwen models, the Embedding and LM Head weights are excessively heavy. Therefore, this projec…☆31Updated this week
- Code for DeSTA2.5-Audio, general-purpose LALM☆128Updated 3 weeks ago
- We systematically studied the influencing factors when LLM generates benchmarks,By using our code, you can generate high-quality QA datas…☆20Updated 7 months ago
- Code for NeurIPS 2023 paper "DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation".☆63Updated last year
- Python script for manipulating the existing tokenizer.☆21Updated 3 weeks ago