Fine-Tuning Llama3-8B LLM in a multi-GPU environment using DeepSpeed
☆21May 27, 2024Updated 2 years ago
Alternatives and similar repositories for LLM_fine_tuning_llama3_8b
Users that are interested in LLM_fine_tuning_llama3_8b are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11May 16, 2025Updated last year
- Public code repo for EMNLP 2024 Findings paper "MACAROON: Training Vision-Language Models To Be Your Engaged Partners"☆14Sep 28, 2024Updated last year
- Simple shared preference project for Android. You can check the tutorial in Bengali☆12Oct 31, 2019Updated 6 years ago
- ☆11Apr 15, 2022Updated 4 years ago
- Bangla PDF to text converter that works on Windows, macOS, and Linux without any extra downloads or configurations.☆21Oct 12, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆13Apr 3, 2026Updated 2 months ago
- ☆24Apr 29, 2026Updated last month
- There is a Bengali Tutorial blog post for this repository. If you understand Bengali then check it out.☆11Jan 19, 2023Updated 3 years ago
- [NeurIPS 2025 Spotlight] Implementation of "KLASS: KL-Guided Fast Inference in Masked Diffusion Models"☆32Jan 3, 2026Updated 5 months ago
- Data Augmentation Toolkit for Korean text.☆52Nov 16, 2021Updated 4 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 4 years ago
- An end to end ML project. Using MLflow for experiment tracking and model registry. Prefect for workflow orchestration. S3 for artifacts s…☆12Sep 11, 2022Updated 3 years ago
- Microsoft Phi 2 Streamlit App, deployed on HuggingFace Spaces is based on the Microsoft Phi 2 small language model (SLM) for text generat…☆15May 1, 2024Updated 2 years ago
- Code and notebooks and data for the paper "Domain Specific Question Answering Over Knowledge Graphs Using Logical Programming and Large L…☆12Jan 23, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- High-performance vector search engine with no loss of accuracy through GPU and dynamic placement☆32Jul 12, 2025Updated 11 months ago
- Machine Learning and Deep Learning Tutorial☆17Jan 4, 2026Updated 5 months ago
- [WIP] Better (FP8) attention for Hopper☆34Feb 24, 2025Updated last year
- Expanded KR-BERT by adding more training data☆13Apr 23, 2021Updated 5 years ago
- For converting LLM datasets from one format into another.☆22Nov 12, 2025Updated 7 months ago
- Traffic Light recognition using FasterRCNN in Pytorch☆11Jul 23, 2023Updated 2 years ago
- Image Segmentation On Custom Dataset Using YOLOv8☆19Jan 12, 2023Updated 3 years ago
- Annotation builder to use segmentation in Mask_RCNN, even if your annotations are rectangular instead of polygon.☆15Feb 16, 2022Updated 4 years ago
- An automation platform for graphically modeled workflows. Focus on network automation. Open Source under Apache License.☆11Apr 1, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- SemEval 2024 Task 1 : Textual Semantic Relatedness☆27Jun 22, 2024Updated last year
- 2021 ~ present. NLP 관련 공부 기록☆20Feb 13, 2026Updated 4 months ago
- Use quantized versions of Whisper to speed up inference☆12Oct 16, 2024Updated last year
- ☆33Oct 30, 2023Updated 2 years ago
- An AR+AR TTS attempt.☆18Jan 13, 2025Updated last year
- This is a detailed code demo on how to conduct Full-Param Supervised Fine-tuning (SFT) and DPO (Direct Preference Optimization)☆19Jan 9, 2025Updated last year
- Code for the MTEB leaderboard☆31Feb 4, 2025Updated last year
- This is a sample Flutter Weather Forecast App for Android and iOS. Without using any state management packages.☆32Aug 18, 2021Updated 4 years ago
- finetune script for SDXL adapted from waifu-diffusion trainer☆11Aug 21, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Check my Bengali tutorial post of Android File Upload service from this URL☆29Dec 22, 2020Updated 5 years ago
- [ICLR 2026] dParallel: Learnable Parallel Decoding for dLLMs☆64Apr 12, 2026Updated 2 months ago
- Low memory full parameter finetuning of LLMs☆54Jul 18, 2025Updated 11 months ago
- e-books☆16Jul 20, 2018Updated 7 years ago
- Code for our ICLR Trustworthy ML 2020 workshop paper "Improved Image Wasserstein Attacks and Defenses"☆14Apr 28, 2020Updated 6 years ago
- ☆17Jan 31, 2025Updated last year
- Fine tuned llama 3 models for context based question answering in bengali language.☆18Oct 14, 2024Updated last year