xyjigsaw/LLM-Pretrain-SFT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xyjigsaw/LLM-Pretrain-SFT)

xyjigsaw / LLM-Pretrain-SFT

Scripts of LLM pre-training and fine-tuning (w/wo LoRA, DeepSpeed)

☆87

Alternatives and similar repositories for LLM-Pretrain-SFT

Users that are interested in LLM-Pretrain-SFT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

daehuikim / WikiOFGraph
View on GitHub
Official Repository for paper "Ontology-Free General-Domain Knowledge Graph-to-Text Generation Dataset Synthesis using Large Language Mod…
☆15Nov 25, 2024Updated last year
ksm26 / Pretraining-LLMs
View on GitHub
Master the essential steps of pretraining large language models (LLMs). Learn to create high-quality datasets, configure model architectu…
☆27Aug 7, 2024Updated last year
sunericd / SpatialAgingClock
View on GitHub
☆28Apr 9, 2025Updated last year
Quhaoh233 / TokenRec
View on GitHub
[IEEE TKDE] A LLM-based Recommender System with user&item Tokenizers and a generative retrieval paradigm.
☆30Mar 11, 2026Updated 3 months ago
shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ZitongYang / Synthetic_Continued_Pretraining
View on GitHub
Code implementation of synthetic continued pretraining
☆162Jan 6, 2025Updated last year
parinzee / seed-free-synthetic-instruct
View on GitHub
Official Code for "Seed-Free Synthetic Data Generation Framework for Instruction-Tuning LLMs: A Case Study in Thai"
☆22May 9, 2025Updated last year
BridgetteSong / BunchedLPCnet
View on GitHub
This repository provides UNOFFICIAL Bunched LPCNet implementations with Pytorch.
☆14Jun 17, 2021Updated 5 years ago
crypto-code / Music-Representation-Comparison
View on GitHub
This is the repo with the code to conduct a comparative analysis of different audio representation models.
☆12Aug 31, 2023Updated 2 years ago
MlWoo / WaveRNN-TF
View on GitHub
☆15Oct 11, 2019Updated 6 years ago
doheejin / SB_loss_PA
View on GitHub
This repository is the implementation of the paper, "Score-balanced Loss for Multi-aspect Pronunciation Assessment" (Interspeech 2023).
☆22Apr 29, 2024Updated 2 years ago
YangLock / EventPapersArchive
View on GitHub
Papers about event extraction and event relation extraction
☆13May 17, 2023Updated 3 years ago
ricefryegg / F1-Visa-Myths
View on GitHub
签证官揭开关于美国学生签证申请的谣言
☆11May 30, 2018Updated 8 years ago
fanqiwan / Explore-Instruct
View on GitHub
EMNLP'2023: Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Exploration
☆36Mar 10, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
erogol / TTS_tf
View on GitHub
WIP Tensorflow implementation of https://github.com/mozilla/TTS
☆15Apr 11, 2020Updated 6 years ago
zcli-charlie / ZIQI-Eval
View on GitHub
ZIQI-Eval: A Music Evaluation Benchmark for Large Language Models
☆18Jul 23, 2024Updated last year
dathudeptrai / FastSpeech2
View on GitHub
A Tensorflow Implementation of the FastSpeech 2: Fast and High-Quality End-to-End Text to Speech
☆11Aug 12, 2020Updated 5 years ago
NajiElKotob / Awesome-BigData
View on GitHub
Big Data Resources and References
☆13Sep 4, 2024Updated last year
zqfang / drugai
View on GitHub
Graph Neural Networks for Drug Efficacy Prediction
☆12Sep 11, 2022Updated 3 years ago
doheejin / ProTACT
View on GitHub
This repository is the implementation of the ProTACT architecture, introduced in the paper "Prompt- and Trait Relation-aware Cross-prompt…
☆22Feb 5, 2025Updated last year
Adaxry / Unified_Layer_Skipping
View on GitHub
☆15Apr 11, 2024Updated 2 years ago
danisfermi / myShell
View on GitHub
Simple Shell using C (Process API)
☆11Nov 9, 2017Updated 8 years ago
Lexsi-Labs / aligntune
View on GitHub
Aligntune : A Modular Toolkit for Post Training Alignment of LLMs
☆37Jun 17, 2026Updated last week
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
CNDOTA / ParetoDrug
View on GitHub
ParetoDrug
☆11Sep 3, 2024Updated last year
AI-S2-Lab / EmoPP
View on GitHub
[NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech
☆22Aug 20, 2024Updated last year
ChristophAlt / fewrel
View on GitHub
Few-Shot Relation Extraction with AllenNLP
☆12Jan 27, 2019Updated 7 years ago
xeliot / 15PuzzleGame
View on GitHub
Python implementation of the 15 puzzle game
☆10Dec 30, 2016Updated 9 years ago
GIST-CSBL / DeSIDE-DDI
View on GitHub
☆13Dec 29, 2021Updated 4 years ago
JackKuo666 / LLM-BioDataExtractor
View on GitHub
This is the pipeline of our new article "Enzyme Co-Scientist: Harnessing Large Language Models for Enzyme Kinetic Data Extraction from Li…
☆17May 23, 2025Updated last year
sangho1130 / KOR_HCC
View on GitHub
☆10Dec 23, 2020Updated 5 years ago
chenyaofo / CCA-Attention
View on GitHub
☆20Aug 14, 2025Updated 10 months ago
ymcui / Chinese-Mixtral
View on GitHub
中文Mixtral混合专家大模型（Chinese Mixtral MoE LLMs）
☆613Apr 19, 2026Updated 2 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
Muennighoff / FLAN
View on GitHub
Provides a minimal implementation to extract FLAN datasets for further processing
☆11Feb 1, 2023Updated 3 years ago
Zyphra / zcookbook
View on GitHub
Training hybrid models for dummies.
☆31Nov 1, 2025Updated 7 months ago
monglechap / fluenttts
View on GitHub
FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS
☆20Nov 15, 2022Updated 3 years ago
nayeon7lee / factuality_enhanced_lm_hf
View on GitHub
☆13Nov 11, 2022Updated 3 years ago
tsengia / JSGFKit
View on GitHub
A Java library for manipulating JSGF Grammars.
☆12Dec 30, 2021Updated 4 years ago
amine0110 / preporcess-volume-medical-imaging
View on GitHub
☆12Feb 22, 2023Updated 3 years ago
Nerdward / Mlops_project
View on GitHub
An end to end ML project. Using MLflow for experiment tracking and model registry. Prefect for workflow orchestration. S3 for artifacts s…
☆12Sep 11, 2022Updated 3 years ago