Fine-Tuning Llama3-8B LLM in a multi-GPU environment using DeepSpeed
☆21May 27, 2024Updated last year
Alternatives and similar repositories for LLM_fine_tuning_llama3_8b
Users that are interested in LLM_fine_tuning_llama3_8b are comparing it to the libraries listed below
Sorting:
- An automation platform for graphically modeled workflows. Focus on network automation. Open Source under Apache License.☆11Nov 13, 2025Updated 3 months ago
- Numbeo Unofficial API☆15Oct 16, 2022Updated 3 years ago
- A simple repository showcasing a few LLM Evaluation strategies and leverages W&B Sweeps to optimize the LLM system.☆12Jul 11, 2023Updated 2 years ago
- Demonstrate using MCP with Pydantic AI framework☆14Mar 14, 2025Updated 11 months ago
- Scrapy抓取豆瓣图书☆10Aug 19, 2016Updated 9 years ago
- ☆12Jul 26, 2024Updated last year
- diffusers with search engine☆12Jan 13, 2026Updated last month
- ☆10May 14, 2020Updated 5 years ago
- finetune script for SDXL adapted from waifu-diffusion trainer☆11Aug 21, 2023Updated 2 years ago
- Multi-hop Evidence Retrieval for Cross-document Relation Extraction☆11Sep 1, 2023Updated 2 years ago
- Data Science & Machine Learning Project applied to Healthcare☆16Dec 1, 2021Updated 4 years ago
- Neo4j 大规模 三元组 CVS 导入进数据库☆11Jul 31, 2020Updated 5 years ago
- Training PyTorch Faster-RCNN on custom dataset☆14Jun 2, 2021Updated 4 years ago
- Basic openAI chat Bot on neo4j knowledge graph☆12Oct 4, 2023Updated 2 years ago
- ☆15Jan 12, 2025Updated last year
- Active learning symbolic regression CFD + AI = Wow☆17Apr 21, 2022Updated 3 years ago
- ☆12Feb 22, 2023Updated 3 years ago
- Using fourier interpolation to merge large language models☆11Jan 6, 2026Updated 2 months ago
- https://deep-learning-101.github.io/Natural-Language-Processing Natural Language Processing (自然語言處理)☆14Mar 2, 2026Updated last week
- ☆11Aug 15, 2023Updated 2 years ago
- Language Collection☆14Dec 20, 2025Updated 2 months ago
- ☆12Jul 14, 2021Updated 4 years ago
- Trying to deconstruct RWKV in understandable terms☆14May 6, 2023Updated 2 years ago
- Not Actively Maintained - A simple web crawler desgined to showcase scalability with Scala and GridGain☆24Aug 18, 2020Updated 5 years ago
- Functions used Markov Chains to generate random sentences.☆15Feb 1, 2020Updated 6 years ago
- ☆22Apr 4, 2025Updated 11 months ago
- Data Augmentation Toolkit for Korean text.☆52Nov 16, 2021Updated 4 years ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 4 months ago
- Coral Edge TPU compilable version of DeepLab V3☆14Jan 4, 2023Updated 3 years ago
- 基于scrapy的音频网站爬取☆12Nov 11, 2016Updated 9 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- A project comparing the implementations of a basic AI agent using Langchain and PydanticAI frameworks☆17Jan 27, 2025Updated last year
- Utilizing nbdev in Google Colaboratory☆14Apr 12, 2023Updated 2 years ago
- A python library for prediction of drug metabolites☆19Mar 19, 2018Updated 7 years ago
- ☆13Nov 18, 2014Updated 11 years ago
- Keeping track of all bitcoin seized and sold by the US Marshals & GSA☆15Mar 31, 2023Updated 2 years ago
- ☆17Jan 31, 2025Updated last year
- An open-source project building a customizable ChatGPT-like clone. Built with Django and Next.js, it features chat history, streaming res…☆16Mar 5, 2024Updated 2 years ago
- ☆15Mar 17, 2021Updated 4 years ago