Code example for pretraining an LLM with vanilla PyTorch training loop
☆10Jun 6, 2024Updated last year
Alternatives and similar repositories for LLMs-Pretraining-with-PyTorch
Users that are interested in LLMs-Pretraining-with-PyTorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Variable Selection Network with PyTorch☆11May 29, 2024Updated last year
- ☆40Mar 12, 2026Updated last week
- ☆16Sep 4, 2025Updated 6 months ago
- PyTorch solution of Vietnamese Named Entity Recognition task with Google AI's BERT model.☆23Dec 8, 2022Updated 3 years ago
- TensorFlow JS Experiments at Google I/O Extended Hanoi 2018☆20Jan 4, 2019Updated 7 years ago
- Token classification using Phobert Models for Vietnamese☆13Jul 8, 2022Updated 3 years ago
- A framework for steering MoE models by detecting and controlling behavior-linked experts.☆31Sep 12, 2025Updated 6 months ago
- Fine-tuning Large Language Models (LLMs) for Text Classification Task☆35Jun 6, 2024Updated last year
- Script download ebook from VNUHCM Library☆12Mar 11, 2023Updated 3 years ago
- Heart rate prediction from altitude changes, speed and cadence during running. Data collection was made using Garmin sports watch, fit f…☆25Jun 6, 2023Updated 2 years ago
- “Simplicity is the ultimate sophistication”☆20Nov 24, 2025Updated 3 months ago
- ITMO AI Talent Hub Speech Recognition and Generation course☆13Mar 14, 2026Updated last week
- Demo of using Airflow☆11Jun 24, 2022Updated 3 years ago
- CS231n Convolutional Neural Networks for Visual Recognition☆12Aug 17, 2021Updated 4 years ago
- MMLU eval for RU/EN☆15Jul 31, 2023Updated 2 years ago
- The code implementation for TTCS: Test-Time Curriculum Synthesis for Self-Evolving.☆39Mar 8, 2026Updated 2 weeks ago
- various tools to download, convert and process the full text of scientific articles☆10Apr 2, 2024Updated last year
- Easy OCR demo + Invoice for Youtube☆11Jul 15, 2020Updated 5 years ago
- EMNLP 2025 | RouterLens☆29Sep 15, 2025Updated 6 months ago
- Demo of crawl 20 years lottery data and do EDA☆11May 6, 2021Updated 4 years ago
- [DEPRECATED] AutoCrawler - automate extracting main information from website☆16Jun 10, 2021Updated 4 years ago
- [ACL 2025] Adaptive Retrieval without Self-Knowledge? Bringing Uncertainty Back Home☆17May 17, 2025Updated 10 months ago
- An interactive GUI application to inpaint images☆19May 6, 2022Updated 3 years ago
- ☆16May 2, 2025Updated 10 months ago
- Full stack Arduino temperature monitor☆11Oct 31, 2017Updated 8 years ago
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Jun 1, 2024Updated last year
- Deep Unsupervised Learning Course Tracking☆10Oct 23, 2020Updated 5 years ago
- ☆13Jul 25, 2020Updated 5 years ago
- learn deep learning with framework pytorch☆35Nov 26, 2021Updated 4 years ago
- F^3 is Python-based framework for valuing forward looking financial products on Heterogeneous Parallel Computing Platforms☆13Jan 5, 2016Updated 10 years ago
- ☆36Apr 25, 2021Updated 4 years ago
- 🧠 ResNet: Deep Residual Learning for Image Recognition☆10Sep 18, 2021Updated 4 years ago
- Breakout board for the 74HC4051 8-channel analog multiplexer/demultiplexer☆12May 18, 2022Updated 3 years ago
- Solution for Zalo AI Challenge 2022 - E2E Question Answering☆110Dec 25, 2022Updated 3 years ago
- Demo of deploy YOLOv6 model as API☆10Aug 6, 2022Updated 3 years ago
- ☆12Nov 1, 2023Updated 2 years ago
- The Object-Oriented-Programming (OOP) version of the "Coffee Machine Project" from Dr. Angela Yu's Python Bootcamp (London App Brewery)☆16Jan 7, 2023Updated 3 years ago
- Large Language Models (LLMs) Learning Resources☆19Jun 16, 2024Updated last year
- Tập dữ liệu câu hỏi về người trong tiếng Việt đã được gán nhãn☆16Jul 30, 2015Updated 10 years ago