mallik3006 / LLM_fine_tuning_llama3_8bView external linksLinks
Fine-Tuning Llama3-8B LLM in a multi-GPU environment using DeepSpeed
☆19May 27, 2024Updated last year
Alternatives and similar repositories for LLM_fine_tuning_llama3_8b
Users that are interested in LLM_fine_tuning_llama3_8b are comparing it to the libraries listed below
Sorting:
- ☆11May 16, 2025Updated 9 months ago
- An automation platform for graphically modeled workflows. Focus on network automation. Open Source under Apache License.☆11Nov 13, 2025Updated 3 months ago
- Numbeo Unofficial API☆15Oct 16, 2022Updated 3 years ago
- [🎖️1등(장관상) 솔루션] 2022 국립국어원 인공 지능 언어 능력 평가 (쇼핑몰 리뷰 데이터 속성 기반 감성 분석 : Aspect-Based Sentiment Analysis)☆11Jun 6, 2023Updated 2 years ago
- A simple repository showcasing a few LLM Evaluation strategies and leverages W&B Sweeps to optimize the LLM system.☆12Jul 11, 2023Updated 2 years ago
- User Management Application build with Spring Boot, Thymeleaf & MySQL Database☆12Dec 20, 2024Updated last year
- An end to end ML project. Using MLflow for experiment tracking and model registry. Prefect for workflow orchestration. S3 for artifacts s…☆12Sep 11, 2022Updated 3 years ago
- finetune script for SDXL adapted from waifu-diffusion trainer☆11Aug 21, 2023Updated 2 years ago
- Data Science & Machine Learning Project applied to Healthcare☆15Dec 1, 2021Updated 4 years ago
- diffusers with search engine☆12Jan 13, 2026Updated last month
- Multi-hop Evidence Retrieval for Cross-document Relation Extraction☆11Sep 1, 2023Updated 2 years ago
- https://deep-learning-101.github.io/Natural-Language-Processing Natural Language Processing (自然語言處理)☆13Feb 1, 2026Updated 2 weeks ago
- Active learning symbolic regression CFD + AI = Wow☆17Apr 21, 2022Updated 3 years ago
- Training PyTorch Faster-RCNN on custom dataset☆14Jun 2, 2021Updated 4 years ago
- Code for our ICLR Trustworthy ML 2020 workshop paper "Improved Image Wasserstein Attacks and Defenses"☆14Apr 28, 2020Updated 5 years ago
- Using fourier interpolation to merge large language models☆11Jan 6, 2026Updated last month
- ☆15Jan 21, 2025Updated last year
- ☆14Jan 12, 2025Updated last year
- An example project demonstrating how to perform OCR with multi-modal LLMs☆10Mar 14, 2024Updated last year
- ☆12Feb 22, 2023Updated 2 years ago
- Experiment with NVIDIA Triton and Whisper☆15Apr 29, 2024Updated last year
- ☆12Dec 24, 2024Updated last year
- Official repository for "Construction of Hierarchical Neural Architecture Search Spaces based on Context-free Grammars" (NeurIPS 2023)☆17Oct 26, 2023Updated 2 years ago
- Functions used Markov Chains to generate random sentences.☆15Feb 1, 2020Updated 6 years ago
- ☆11Aug 15, 2023Updated 2 years ago
- ☆12Jul 14, 2021Updated 4 years ago
- ☆21Apr 4, 2025Updated 10 months ago
- ☆55Sep 5, 2025Updated 5 months ago
- Data Augmentation Toolkit for Korean text.☆52Nov 16, 2021Updated 4 years ago
- The project aims to detect ships in million pixels satellite images using different object detection algorithms. This makes use of variou…☆15Jun 28, 2020Updated 5 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- Fine tuning Mistral-7b with PEFT(Parameter Efficient Fine-Tuning) and LoRA(Low-Rank Adaptation) on Puffin Dataset(multi-turn conversation…☆13Nov 23, 2023Updated 2 years ago
- Ghost blog configured for IBM Bluemix☆15Mar 18, 2016Updated 9 years ago
- A project comparing the implementations of a basic AI agent using Langchain and PydanticAI frameworks☆17Jan 27, 2025Updated last year
- 基于scrapy的音频网站爬取☆12Nov 11, 2016Updated 9 years ago
- Image Segmentation On Custom Dataset Using YOLOv8☆19Jan 12, 2023Updated 3 years ago
- Dippy Synthetic Speech Subnet☆17Sep 11, 2025Updated 5 months ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 4 months ago
- Feature selection for tabular datasets using advanced filter and wrapper methods☆18Mar 9, 2025Updated 11 months ago