Scripts of LLM pre-training and fine-tuning (w/wo LoRA, DeepSpeed)
☆87Jan 30, 2024Updated 2 years ago
Alternatives and similar repositories for LLM-Pretrain-SFT
Users that are interested in LLM-Pretrain-SFT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Repository for paper "Ontology-Free General-Domain Knowledge Graph-to-Text Generation Dataset Synthesis using Large Language Mod…☆15Nov 25, 2024Updated last year
- Master the essential steps of pretraining large language models (LLMs). Learn to create high-quality datasets, configure model architectu…☆27Aug 7, 2024Updated last year
- ☆28Apr 9, 2025Updated last year
- [IEEE TKDE] A LLM-based Recommender System with user&item Tokenizers and a generative retrieval paradigm.☆30Mar 11, 2026Updated 3 months ago
- text to speech☆10Mar 19, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code implementation of synthetic continued pretraining☆162Jan 6, 2025Updated last year
- Official Code for "Seed-Free Synthetic Data Generation Framework for Instruction-Tuning LLMs: A Case Study in Thai"☆22May 9, 2025Updated last year
- This repository provides UNOFFICIAL Bunched LPCNet implementations with Pytorch.☆14Jun 17, 2021Updated 5 years ago
- This is the repo with the code to conduct a comparative analysis of different audio representation models.☆12Aug 31, 2023Updated 2 years ago
- ☆15Oct 11, 2019Updated 6 years ago
- This repository is the implementation of the paper, "Score-balanced Loss for Multi-aspect Pronunciation Assessment" (Interspeech 2023).☆22Apr 29, 2024Updated 2 years ago
- Papers about event extraction and event relation extraction☆13May 17, 2023Updated 3 years ago
- 签证官揭开关于美国学生签证申请的谣言☆11May 30, 2018Updated 8 years ago
- EMNLP'2023: Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Exploration☆36Mar 10, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- WIP Tensorflow implementation of https://github.com/mozilla/TTS☆15Apr 11, 2020Updated 6 years ago
- ZIQI-Eval: A Music Evaluation Benchmark for Large Language Models☆18Jul 23, 2024Updated last year
- A Tensorflow Implementation of the FastSpeech 2: Fast and High-Quality End-to-End Text to Speech☆11Aug 12, 2020Updated 5 years ago
- Big Data Resources and References☆13Sep 4, 2024Updated last year
- Graph Neural Networks for Drug Efficacy Prediction☆12Sep 11, 2022Updated 3 years ago
- This repository is the implementation of the ProTACT architecture, introduced in the paper "Prompt- and Trait Relation-aware Cross-prompt…☆22Feb 5, 2025Updated last year
- ☆15Apr 11, 2024Updated 2 years ago
- Simple Shell using C (Process API)☆11Nov 9, 2017Updated 8 years ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆37Jun 17, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ParetoDrug☆11Sep 3, 2024Updated last year
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Aug 20, 2024Updated last year
- Few-Shot Relation Extraction with AllenNLP☆12Jan 27, 2019Updated 7 years ago
- Python implementation of the 15 puzzle game☆10Dec 30, 2016Updated 9 years ago
- ☆13Dec 29, 2021Updated 4 years ago
- This is the pipeline of our new article "Enzyme Co-Scientist: Harnessing Large Language Models for Enzyme Kinetic Data Extraction from Li…☆17May 23, 2025Updated last year
- ☆10Dec 23, 2020Updated 5 years ago
- ☆20Aug 14, 2025Updated 10 months ago
- 中文Mixtral混合专家大模型(Chinese Mixtral MoE LLMs)☆613Apr 19, 2026Updated 2 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Provides a minimal implementation to extract FLAN datasets for further processing☆11Feb 1, 2023Updated 3 years ago
- Training hybrid models for dummies.☆31Nov 1, 2025Updated 7 months ago
- FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS☆20Nov 15, 2022Updated 3 years ago
- ☆13Nov 11, 2022Updated 3 years ago
- A Java library for manipulating JSGF Grammars.☆12Dec 30, 2021Updated 4 years ago
- ☆12Feb 22, 2023Updated 3 years ago
- An end to end ML project. Using MLflow for experiment tracking and model registry. Prefect for workflow orchestration. S3 for artifacts s…☆12Sep 11, 2022Updated 3 years ago