SeunghyunSEO/optimized_hf_llama_class_for_training

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SeunghyunSEO/optimized_hf_llama_class_for_training)

SeunghyunSEO / optimized_hf_llama_class_for_training

☆47

Alternatives and similar repositories for optimized_hf_llama_class_for_training

Users that are interested in optimized_hf_llama_class_for_training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

apoorvkh / torchrunx
View on GitHub
Easily run PyTorch on multiple GPUs & machines
☆60May 2, 2026Updated 2 months ago
ethansmith2000 / fsdp_optimizers
View on GitHub
supporting pytorch FSDP for optimizers
☆84Dec 8, 2024Updated last year
huggingface / peft-pytorch-conference
View on GitHub
Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…
☆15Oct 16, 2023Updated 2 years ago
JustlyAI / lmss_entity_extractor
View on GitHub
Tool to apply Legal Matter Specification Standard (LMSS) to documents
☆12Aug 15, 2024Updated last year
dream3d-ai / torch-submit
View on GitHub
☆10Dec 21, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
songys / 2021Langcon
View on GitHub
☆11Oct 3, 2021Updated 4 years ago
euclaise / SlimTrainer
View on GitHub
Full finetuning of large language models without large memory requirements
☆92Sep 22, 2025Updated 10 months ago
JungHoyoun / PromptCompressor
View on GitHub
☆12Apr 29, 2024Updated 2 years ago
davanstrien / data-for-fine-tuning-llms
View on GitHub
☆80Jun 5, 2024Updated 2 years ago
zirui-ray-liu / Exact
View on GitHub
☆21Mar 23, 2022Updated 4 years ago
upskyy / Paper-Review
View on GitHub
Paper Review about Speech Recognition · NLP
☆10Mar 25, 2021Updated 5 years ago
tunib-ai / joker
View on GitHub
AI model designed to test the effectiveness in handling external ethical attacks.
☆11Feb 9, 2026Updated 5 months ago
baikalai / baikal-bert
View on GitHub
baikal.ai's pre-trained BERT models: descriptions and sample codes
☆12Jun 24, 2021Updated 5 years ago
michaelfeil / candle-flash-attn-v3
View on GitHub
☆15Dec 21, 2025Updated 7 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Linzwcs / AFT
View on GitHub
☆13Jan 22, 2025Updated last year
detail-novelist / novelist-triton-server
View on GitHub
Deploy KoGPT with Triton Inference Server
☆14Nov 18, 2022Updated 3 years ago
upskyy / Automatic-Speech-Recognition-Models
View on GitHub
End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
☆10Jan 21, 2022Updated 4 years ago
TTS-Research / PEL-TTS
View on GitHub
☆14Aug 16, 2023Updated 2 years ago
jason9693 / FROZEN
View on GitHub
☆14May 3, 2022Updated 4 years ago
Alignment-Lab-AI / datagen
View on GitHub
a pipeline for using api calls to agnostically convert unstructured data into structured training data
☆32Sep 22, 2024Updated last year
john-hewitt / implicit-ins
View on GitHub
Codebase for Instruction Following without Instruction Tuning
☆36Sep 24, 2024Updated last year
cat-state / tinypar
View on GitHub
☆20Jul 12, 2023Updated 3 years ago
Nordeus / heroic-rl
View on GitHub
Reinforcement Learning Agent that plays Heroic - Magic Duel
☆15Jun 23, 2020Updated 6 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
minyoungg / LTE
View on GitHub
☆71Jul 11, 2024Updated 2 years ago
nano-R1 / resources
View on GitHub
Compiling useful links, papers, benchmarks, ideas, etc.
☆45Mar 16, 2025Updated last year
songys / AwesomeKorean_Speech
View on GitHub
음성인식과 신호처리
☆14Sep 12, 2021Updated 4 years ago
Kaixiong-Zhou / EGNN
View on GitHub
Energetic GraphNeural Networks (EGNN) implementation based on Dirichlet Energy Constrained Learning.
☆27Nov 1, 2021Updated 4 years ago
astramind-ai / BitMat
View on GitHub
An efficent implementation of the method proposed in "The Era of 1-bit LLMs"
☆155Oct 15, 2024Updated last year
TsinghuaC3I / FS-GEN
View on GitHub
Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding.
☆13Nov 19, 2024Updated last year
joey00072 / ohara
View on GitHub
Collection of autoregressive model implementation
☆84Jun 10, 2026Updated last month
phillip0726 / NaverBlog-Twitter-Youtube-crawling
View on GitHub
We can crawl NaverBlog, Twitter, Youtube!!
☆13Sep 13, 2019Updated 6 years ago
ahxt / G2R
View on GitHub
[WWW2022] Geometric Graph Representation Learning via Maximizing Rate Reduction
☆26May 27, 2022Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
naver-ai / elva
View on GitHub
On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning, …
☆20Mar 13, 2026Updated 4 months ago
scaleapi / mrt
View on GitHub
https://scale.com/research/mrt
☆20Mar 16, 2026Updated 4 months ago
deepspeedai / deepspeed-gpt-neox
View on GitHub
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
☆21Nov 28, 2022Updated 3 years ago
abacaj / train-with-fsdp
View on GitHub
☆93Oct 5, 2023Updated 2 years ago
gpt4life / alpagasus
View on GitHub
Unofficial implementation of AlpaGasus
☆94Sep 23, 2023Updated 2 years ago
JonasGeiping / linear_cross_entropy_loss
View on GitHub
A fusion of a linear layer and a cross entropy loss, written for pytorch in triton.
☆75Aug 2, 2024Updated last year
lovit / petitions_archive
View on GitHub
청와대 국민청원 데이터 아카이브
☆16Aug 29, 2020Updated 5 years ago