Fine-Tuning Llama3-8B LLM in a multi-GPU environment using DeepSpeed
☆21May 27, 2024Updated 2 years ago
Alternatives and similar repositories for LLM_fine_tuning_llama3_8b
Users that are interested in LLM_fine_tuning_llama3_8b are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Programming and numeric computing platform for math modeling and visualization with fully functional programming language☆12May 11, 2024Updated 2 years ago
- Bangla PDF to text converter that works on Windows, macOS, and Linux without any extra downloads or configurations.☆21Oct 12, 2024Updated last year
- Transformer Architecture written with CUDA, C++ and LibTorch.☆10Jul 26, 2025Updated 10 months ago
- ☆31Sep 11, 2024Updated last year
- Fine tuning of the Retrieval-Augmented Generation (RAG) with a custom knowledge source.☆13Feb 10, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- There is a Bengali Tutorial blog post for this repository. If you understand Bengali then check it out.☆11Jan 19, 2023Updated 3 years ago
- [NeurIPS 2025 Spotlight] Implementation of "KLASS: KL-Guided Fast Inference in Masked Diffusion Models"☆32Jan 3, 2026Updated 4 months ago
- ☆11Feb 22, 2023Updated 3 years ago
- Experiment with NVIDIA Triton and Whisper☆15Apr 29, 2024Updated 2 years ago
- High-performance vector search engine with no loss of accuracy through GPU and dynamic placement☆32Jul 12, 2025Updated 10 months ago
- Expanded KR-BERT by adding more training data☆13Apr 23, 2021Updated 5 years ago
- For converting LLM datasets from one format into another.☆22Nov 12, 2025Updated 6 months ago
- Resources on personal finance and investing!☆13Aug 29, 2021Updated 4 years ago
- Traffic Light recognition using FasterRCNN in Pytorch☆11Jul 23, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆12May 3, 2026Updated 3 weeks ago
- ☆15Jan 12, 2025Updated last year
- Image Segmentation On Custom Dataset Using YOLOv8☆19Jan 12, 2023Updated 3 years ago
- 🧀 KoBART summarization using pytorch☆13Jun 7, 2023Updated 2 years ago
- An automation platform for graphically modeled workflows. Focus on network automation. Open Source under Apache License.☆11Apr 1, 2026Updated last month
- SemEval 2024 Task 1 : Textual Semantic Relatedness☆26Jun 22, 2024Updated last year
- Semantic and Instance Segmentation on iOS Using a Flask API — DeepLabV3+ and Mask R-CNN☆20Oct 3, 2020Updated 5 years ago
- LoRA supervised fine-tuning, RLHF (PPO) and RAG with llama-3-8B on the TLDR summarization dataset☆14Feb 2, 2025Updated last year
- Use quantized versions of Whisper to speed up inference☆12Oct 16, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This is the homework repository of Deep Learning for Human Language Processing☆10Oct 5, 2020Updated 5 years ago
- Numbeo Unofficial API☆17Oct 16, 2022Updated 3 years ago
- Trying to deconstruct RWKV in understandable terms☆14May 6, 2023Updated 3 years ago
- ☆33Oct 30, 2023Updated 2 years ago
- An AR+AR TTS attempt.☆18Jan 13, 2025Updated last year
- This is a detailed code demo on how to conduct Full-Param Supervised Fine-tuning (SFT) and DPO (Direct Preference Optimization)☆19Jan 9, 2025Updated last year
- Understanding Factual Errors in Summarization: Errors, Summarizers, Datasets, Error Detectors (ACL 2023)☆28Mar 26, 2024Updated 2 years ago
- diffusers with search engine☆12Jan 13, 2026Updated 4 months ago
- finetune script for SDXL adapted from waifu-diffusion trainer☆11Aug 21, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICLR 2026] dParallel: Learnable Parallel Decoding for dLLMs☆63Apr 12, 2026Updated last month
- Low memory full parameter finetuning of LLMs☆54Jul 18, 2025Updated 10 months ago
- e-books☆16Jul 20, 2018Updated 7 years ago
- Code for our ICLR Trustworthy ML 2020 workshop paper "Improved Image Wasserstein Attacks and Defenses"☆14Apr 28, 2020Updated 6 years ago
- Fine tuned llama 3 models for context based question answering in bengali language.☆18Oct 14, 2024Updated last year
- ☆26Oct 13, 2023Updated 2 years ago
- Fine tuning Mistral-7b with PEFT(Parameter Efficient Fine-Tuning) and LoRA(Low-Rank Adaptation) on Puffin Dataset(multi-turn conversation…☆12Nov 23, 2023Updated 2 years ago