Scripts of LLM pre-training and fine-tuning (w/wo LoRA, DeepSpeed)
☆87Jan 30, 2024Updated 2 years ago
Alternatives and similar repositories for LLM-Pretrain-SFT
Users that are interested in LLM-Pretrain-SFT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Repository for paper "Ontology-Free General-Domain Knowledge Graph-to-Text Generation Dataset Synthesis using Large Language Mod…☆15Nov 25, 2024Updated last year
- 第二届“泰迪杯”数据分析职业技能大赛A题☆10Sep 15, 2020Updated 5 years ago
- Code implementation of synthetic continued pretraining☆158Jan 6, 2025Updated last year
- Difference-based Contrastive Learning for Korean Sentence Embeddings☆23Mar 11, 2026Updated last month
- This repository provides UNOFFICIAL Bunched LPCNet implementations with Pytorch.☆14Jun 17, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This is the repo with the code to conduct a comparative analysis of different audio representation models.☆12Aug 31, 2023Updated 2 years ago
- ☆15Oct 11, 2019Updated 6 years ago
- I-SHEEP: Iterative Self-enHancEmEnt Paradigm of LLMs through Self-Instruct and Self-Assessment☆17Jan 16, 2025Updated last year
- EMNLP'2023: Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Exploration☆36Mar 10, 2024Updated 2 years ago
- ☆14Apr 7, 2025Updated last year
- WIP Tensorflow implementation of https://github.com/mozilla/TTS☆15Apr 11, 2020Updated 6 years ago
- ZIQI-Eval: A Music Evaluation Benchmark for Large Language Models☆17Jul 23, 2024Updated last year
- ☆11Jul 11, 2023Updated 2 years ago
- A Tensorflow Implementation of the FastSpeech 2: Fast and High-Quality End-to-End Text to Speech☆11Aug 12, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Graph Neural Networks for Drug Efficacy Prediction☆12Sep 11, 2022Updated 3 years ago
- ☆15Apr 11, 2024Updated 2 years ago
- Simple Shell using C (Process API)☆11Nov 9, 2017Updated 8 years ago
- A PyTorch implementation of Knowledge Graph Embedding by Normalizing Flows.☆10Nov 22, 2022Updated 3 years ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆36Mar 23, 2026Updated last month
- JSGF Deducer based on JSGF grammar and WFST☆11Jan 11, 2018Updated 8 years ago
- ParetoDrug☆11Sep 3, 2024Updated last year
- [ICASSP'23] PAGE: A Position-Aware Graph-based model for Emotion cause entailment☆16Jun 1, 2023Updated 2 years ago
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Aug 20, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Source code and dataset of EMNLP2017 paper "Incorporating Relation Paths in Neural Relation Extraction".☆39Nov 25, 2019Updated 6 years ago
- A script for merging a LLM model and a LoRA☆13Jun 22, 2023Updated 2 years ago
- Few-Shot Relation Extraction with AllenNLP☆12Jan 27, 2019Updated 7 years ago
- Adversarial Attack for Pre-trained Code Models☆10Jul 19, 2022Updated 3 years ago
- ☆11Sep 27, 2018Updated 7 years ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆413Jun 25, 2025Updated 10 months ago
- ☆13Dec 29, 2021Updated 4 years ago
- ☆15Sep 27, 2024Updated last year
- ☆10Dec 23, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆15Nov 14, 2022Updated 3 years ago
- 中文Mixtral混合专家大模型(Chinese Mixtral MoE LLMs)☆609Apr 19, 2026Updated last week
- Training hybrid models for dummies.☆29Nov 1, 2025Updated 5 months ago
- Physical animations through reinforcement learning in Unity☆15Jun 10, 2021Updated 4 years ago
- FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS☆20Nov 15, 2022Updated 3 years ago
- 部署你自己的OpenAI api🤩, 基于flask, transformers (使用 Baichuan2-13B-Chat-4bits 模型, 可以运行在单张Tesla T4显卡) ,实现了OpenAI中Chat, Models和Completions接口,包含流式响…☆97Nov 15, 2023Updated 2 years ago
- Bias-controlled 3D generative framework for structure-based ligand design☆17Nov 2, 2022Updated 3 years ago