Fine-Tuning Llama3-8B LLM in a multi-GPU environment using DeepSpeed
☆21May 27, 2024Updated last year
Alternatives and similar repositories for LLM_fine_tuning_llama3_8b
Users that are interested in LLM_fine_tuning_llama3_8b are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11May 16, 2025Updated 10 months ago
- Public code repo for EMNLP 2024 Findings paper "MACAROON: Training Vision-Language Models To Be Your Engaged Partners"☆14Sep 28, 2024Updated last year
- So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…☆17Mar 26, 2025Updated last year
- ☆30Sep 11, 2024Updated last year
- ☆24Feb 4, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Fine tuning of the Retrieval-Augmented Generation (RAG) with a custom knowledge source.☆13Feb 10, 2021Updated 5 years ago
- [NeurIPS 2025 Spotlight] Implementation of "KLASS: KL-Guided Fast Inference in Masked Diffusion Models"☆25Jan 3, 2026Updated 2 months ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- Code and notebooks and data for the paper "Domain Specific Question Answering Over Knowledge Graphs Using Logical Programming and Large L…☆12Jan 23, 2024Updated 2 years ago
- [🎖️1등(장관상) 솔루션] 2022 국립국어원 인공 지능 언어 능력 평가 (쇼핑몰 리뷰 데이터 속성 기반 감성 분석 : Aspect-Based Sentiment Analysis)☆11Jun 6, 2023Updated 2 years ago
- Experiment with NVIDIA Triton and Whisper☆15Apr 29, 2024Updated last year
- High-performance vector search engine with no loss of accuracy through GPU and dynamic placement☆31Jul 12, 2025Updated 8 months ago
- ☆15Jan 21, 2025Updated last year
- [WIP] Better (FP8) attention for Hopper☆32Feb 24, 2025Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Expanded KR-BERT by adding more training data☆13Apr 23, 2021Updated 4 years ago
- For converting LLM datasets from one format into another.☆22Nov 12, 2025Updated 4 months ago
- Jupyter notebooks for course Finetuning Large Language Models, taught by Sharon Zhou (Lamini) and Andrew Ng (DeepLearning.AI).☆16Oct 21, 2023Updated 2 years ago
- Training PyTorch Faster-RCNN on custom dataset☆14Jun 2, 2021Updated 4 years ago
- Data Science & Machine Learning Project applied to Healthcare☆16Dec 1, 2021Updated 4 years ago
- ☆12Jul 26, 2024Updated last year
- Annotation builder to use segmentation in Mask_RCNN, even if your annotations are rectangular instead of polygon.☆15Feb 16, 2022Updated 4 years ago
- 🧀 KoBART summarization using pytorch☆13Jun 7, 2023Updated 2 years ago
- Python port to the normalizer in https://github.com/twitter/twitter-korean-text☆13Apr 26, 2016Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- An automation platform for graphically modeled workflows. Focus on network automation. Open Source under Apache License.☆11Nov 13, 2025Updated 4 months ago
- ☆43Jan 27, 2026Updated 2 months ago
- Semantic and Instance Segmentation on iOS Using a Flask API — DeepLabV3+ and Mask R-CNN☆20Oct 3, 2020Updated 5 years ago
- LoRA supervised fine-tuning, RLHF (PPO) and RAG with llama-3-8B on the TLDR summarization dataset☆14Feb 2, 2025Updated last year
- Use quantized versions of Whisper to speed up inference☆12Oct 16, 2024Updated last year
- Numbeo Unofficial API☆16Oct 16, 2022Updated 3 years ago
- An AR+AR TTS attempt.☆18Jan 13, 2025Updated last year
- An agent for CUDA compute-communication kernel co-design☆33Updated this week
- memory-efficient fine-tuning; support 24G GPU memory fine-tuning 7B☆21May 26, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- diffusers with search engine☆12Jan 13, 2026Updated 2 months ago
- finetune script for SDXL adapted from waifu-diffusion trainer☆11Aug 21, 2023Updated 2 years ago
- [ICLR 2026] dParallel: Learnable Parallel Decoding for dLLMs☆62Feb 22, 2026Updated last month
- Low memory full parameter finetuning of LLMs☆54Jul 18, 2025Updated 8 months ago
- Code for our ICLR Trustworthy ML 2020 workshop paper "Improved Image Wasserstein Attacks and Defenses"☆14Apr 28, 2020Updated 5 years ago
- Polish datsets for grammatical error correction☆12Oct 13, 2023Updated 2 years ago
- Open-source framework for turning expert knowledge into PII-free synthetic conversational data and production-ready LoRA adapters.☆58Updated this week