Scripts of LLM pre-training and fine-tuning (w/wo LoRA, DeepSpeed)
☆87Jan 30, 2024Updated 2 years ago
Alternatives and similar repositories for LLM-Pretrain-SFT
Users that are interested in LLM-Pretrain-SFT are comparing it to the libraries listed below
Sorting:
- Master the essential steps of pretraining large language models (LLMs). Learn to create high-quality datasets, configure model architectu…☆27Aug 7, 2024Updated last year
- ☆26Apr 9, 2025Updated 11 months ago
- text to speech☆10Mar 19, 2024Updated 2 years ago
- Code implementation of synthetic continued pretraining☆156Jan 6, 2025Updated last year
- Difference-based Contrastive Learning for Korean Sentence Embeddings☆23Mar 11, 2026Updated last week
- This repository provides UNOFFICIAL Bunched LPCNet implementations with Pytorch.☆14Jun 17, 2021Updated 4 years ago
- trying to reproduce suno v3☆34Jan 29, 2025Updated last year
- I-SHEEP: Iterative Self-enHancEmEnt Paradigm of LLMs through Self-Instruct and Self-Assessment☆17Jan 16, 2025Updated last year
- ☆13Apr 7, 2025Updated 11 months ago
- WIP Tensorflow implementation of https://github.com/mozilla/TTS☆15Apr 11, 2020Updated 5 years ago
- ☆13May 26, 2023Updated 2 years ago
- ZIQI-Eval: A Music Evaluation Benchmark for Large Language Models☆16Jul 23, 2024Updated last year
- A cutlass cute implementation of headdim-64 flashattentionv2 TensorRT plugin for LightGlue. Run on Jetson Orin NX 8GB with TensorRT 8.5.…☆19Mar 3, 2025Updated last year
- [ICASSP'23] PAGE: A Position-Aware Graph-based model for Emotion cause entailment☆16Jun 1, 2023Updated 2 years ago
- Using business-level retrieval system (BM25) with Python in just a few lines.☆31Feb 3, 2023Updated 3 years ago
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Aug 20, 2024Updated last year
- Source code and dataset of EMNLP2017 paper "Incorporating Relation Paths in Neural Relation Extraction".☆39Nov 25, 2019Updated 6 years ago
- AutoML 2024: HPOD: Hyperparameter Optimization for Unsupervised Outlier Detection☆13Jul 12, 2024Updated last year
- This repository provides the code for implementing RPG described in our KDD'25 paper "Generating Long Semantic IDs in Parallel for Recomm…☆119Sep 8, 2025Updated 6 months ago
- Python implementation of the 15 puzzle game☆10Dec 30, 2016Updated 9 years ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆416Jun 25, 2025Updated 8 months ago
- Neural-Guided Room Layout Generation with Bubble Diagram Constraints☆13May 19, 2023Updated 2 years ago
- Neural Fuzzy Repair (NFR) is a data augmentation pipeline, which integrates fuzzy matches (i.e. similar translations) into neural machine…☆12Aug 14, 2024Updated last year
- This is the pipeline of our new article "Enzyme Co-Scientist: Harnessing Large Language Models for Enzyme Kinetic Data Extraction from Li…☆16May 23, 2025Updated 9 months ago
- ☆11Nov 12, 2025Updated 4 months ago
- ☆20Aug 14, 2025Updated 7 months ago
- 中文Mixtral混合专家大模型(Chinese Mixtral MoE LLMs)☆610Apr 30, 2024Updated last year
- Cross-Domain Deep Code Search with Few-Shot Learning☆11Jul 5, 2023Updated 2 years ago
- Cell-type Assignment and Module Extraction based on a heterogeneous graph neural network.☆10Oct 30, 2023Updated 2 years ago
- A repo for my miscellaneous codes☆11Dec 21, 2016Updated 9 years ago
- Shan Natural Language Processing tools inspired by PythaiNLP☆14Mar 1, 2026Updated 2 weeks ago
- Implementation of MelNet in PyTorch to generate high-fidelity audio samples☆24Sep 16, 2020Updated 5 years ago
- NTIRE 2019 submission repository☆55Jan 6, 2020Updated 6 years ago
- Submissions, baselines and evaluations scripts for the 2nd version of the WebNLG+ Challenge 2020☆13Feb 1, 2022Updated 4 years ago
- Xfce desktop including wine, playonlinux and pulseaudio.☆11Aug 11, 2022Updated 3 years ago
- A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.☆24Mar 29, 2021Updated 4 years ago
- This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.☆21Jul 21, 2021Updated 4 years ago
- This repository contains the implementation of a Deep Deterministic Policy Gradient (DDPG) algorithm applied to solve the Reacher environ…☆12Apr 8, 2023Updated 2 years ago
- [🎖️1등(장관상) 솔루션] 2022 국립국어원 인공 지능 언어 능력 평가 (쇼핑몰 리뷰 데이터 속성 기반 감성 분석 : Aspect-Based Sentiment Analysis)☆11Jun 6, 2023Updated 2 years ago