Code example for pretraining an LLM with vanilla PyTorch training loop
☆10Jun 6, 2024Updated last year
Alternatives and similar repositories for LLMs-Pretraining-with-PyTorch
Users that are interested in LLMs-Pretraining-with-PyTorch are comparing it to the libraries listed below
Sorting:
- Variable Selection Network with PyTorch☆11May 29, 2024Updated last year
- ☆35Feb 20, 2026Updated last week
- Heart rate prediction from altitude changes, speed and cadence during running. Data collection was made using Garmin sports watch, fit f…☆25Jun 6, 2023Updated 2 years ago
- TensorFlow JS Experiments at Google I/O Extended Hanoi 2018☆20Jan 4, 2019Updated 7 years ago
- Fine-tuning Large Language Models (LLMs) for Text Classification Task☆35Jun 6, 2024Updated last year
- PyTorch solution of Vietnamese Named Entity Recognition task with Google AI's BERT model.☆23Dec 8, 2022Updated 3 years ago
- learn deep learning with framework pytorch☆35Nov 26, 2021Updated 4 years ago
- ☆16May 2, 2025Updated 10 months ago
- ITMO AI Talent Hub Speech Recognition and Generation course☆13May 22, 2025Updated 9 months ago
- Token classification using Phobert Models for Vietnamese☆13Jul 8, 2022Updated 3 years ago
- The Object-Oriented-Programming (OOP) version of the "Coffee Machine Project" from Dr. Angela Yu's Python Bootcamp (London App Brewery)☆16Jan 7, 2023Updated 3 years ago
- ☆36Apr 25, 2021Updated 4 years ago
- A framework for steering MoE models by detecting and controlling behavior-linked experts.☆29Sep 12, 2025Updated 5 months ago
- various tools to download, convert and process the full text of scientific articles☆10Apr 2, 2024Updated last year
- Code of the exercises, tests and solutions for the programming tracks in exercism.org☆10Aug 22, 2025Updated 6 months ago
- ☆17Mar 28, 2025Updated 11 months ago
- ☆16Sep 4, 2025Updated 5 months ago
- Deep Unsupervised Learning Course Tracking☆10Oct 23, 2020Updated 5 years ago
- Read Image Files Concurrently☆17Sep 17, 2024Updated last year
- EMNLP 2025 | RouterLens☆28Sep 15, 2025Updated 5 months ago
- ☆12Apr 27, 2023Updated 2 years ago
- License plate reader source code☆10May 14, 2024Updated last year
- Demo of crawl 20 years lottery data and do EDA☆10May 6, 2021Updated 4 years ago
- Easy OCR demo + Invoice for Youtube☆11Jul 15, 2020Updated 5 years ago
- Script download ebook from VNUHCM Library☆12Mar 11, 2023Updated 2 years ago
- ChatGPT solutions for the MLE interview☆14Dec 9, 2022Updated 3 years ago
- LAGr: Label Aligned Graphs for Better Systematic Generalization in Semantic Parsing☆10Jun 1, 2022Updated 3 years ago
- Top 9 private leaderboard & Top 17 public leaderboard☆10Dec 1, 2022Updated 3 years ago
- Extract data insights and visualisations with natural language☆13May 27, 2024Updated last year
- macrogpt大模型全量预训练(1b3,32层), 多卡deepspeed/单卡adafactor☆15Nov 30, 2023Updated 2 years ago
- ☆13Jul 25, 2020Updated 5 years ago
- Demo of using Airflow☆11Jun 24, 2022Updated 3 years ago
- Chia tiền điện nước dễ dàng☆18Jan 2, 2026Updated 2 months ago
- Automatize local data analysis with team of tool-using GPT agents☆15Apr 1, 2024Updated last year
- Hackintosh GL552VX Mojave☆13May 23, 2019Updated 6 years ago
- ☆12Jul 15, 2021Updated 4 years ago
- This is the idea from scikit-learn to implement the task of multi-label for Chinese text.☆13Apr 17, 2017Updated 8 years ago
- 🧠 ResNet: Deep Residual Learning for Image Recognition☆10Sep 18, 2021Updated 4 years ago
- CS231n Convolutional Neural Networks for Visual Recognition☆12Aug 17, 2021Updated 4 years ago