Code example for pretraining an LLM with vanilla PyTorch training loop
☆10Jun 6, 2024Updated last year
Alternatives and similar repositories for LLMs-Pretraining-with-PyTorch
Users that are interested in LLMs-Pretraining-with-PyTorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Variable Selection Network with PyTorch☆11May 29, 2024Updated last year
- ☆51Mar 12, 2026Updated 2 months ago
- ☆16Sep 4, 2025Updated 8 months ago
- PyTorch solution of Vietnamese Named Entity Recognition task with Google AI's BERT model.☆23Dec 8, 2022Updated 3 years ago
- TensorFlow JS Experiments at Google I/O Extended Hanoi 2018☆20Jan 4, 2019Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Token classification using Phobert Models for Vietnamese☆13Jul 8, 2022Updated 3 years ago
- Fine-tuning Large Language Models (LLMs) for Text Classification Task☆35Jun 6, 2024Updated last year
- Script download ebook from VNUHCM Library☆12Mar 11, 2023Updated 3 years ago
- Heart rate prediction from altitude changes, speed and cadence during running. Data collection was made using Garmin sports watch, fit f…☆26Jun 6, 2023Updated 2 years ago
- “Simplicity is the ultimate sophistication”☆19Nov 24, 2025Updated 5 months ago
- ITMO AI Talent Hub Speech Recognition and Generation course☆13Apr 16, 2026Updated last month
- Demo of using Airflow☆11Jun 24, 2022Updated 3 years ago
- CS231n Convolutional Neural Networks for Visual Recognition☆12Aug 17, 2021Updated 4 years ago
- A framework for steering MoE models by detecting and controlling behavior-linked experts.☆34Sep 12, 2025Updated 8 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- The code implementation for TTCS: Test-Time Curriculum Synthesis for Self-Evolving.☆47Apr 22, 2026Updated last month
- MMLU eval for RU/EN☆16Jul 31, 2023Updated 2 years ago
- AI Video Generator API — Veo 3, Openai Sora by GeminiGenAI. Create stunning AI videos with Google’s Veo 3 at up to 97% lower cost. Featur…☆46Nov 7, 2025Updated 6 months ago
- various tools to download, convert and process the full text of scientific articles☆10Apr 2, 2024Updated 2 years ago
- Easy OCR demo + Invoice for Youtube☆11Jul 15, 2020Updated 5 years ago
- [EMNLP 2025] RouterLens☆29Sep 15, 2025Updated 8 months ago
- Demo of crawl 20 years lottery data and do EDA☆11May 6, 2021Updated 5 years ago
- [DEPRECATED] AutoCrawler - automate extracting main information from website☆16Jun 10, 2021Updated 4 years ago
- [ACL 2025] Adaptive Retrieval without Self-Knowledge? Bringing Uncertainty Back Home☆19May 17, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- An interactive GUI application to inpaint images☆19May 6, 2022Updated 4 years ago
- Full stack Arduino temperature monitor☆11Oct 31, 2017Updated 8 years ago
- Deep Unsupervised Learning Course Tracking☆10Oct 23, 2020Updated 5 years ago
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Jun 1, 2024Updated last year
- ☆13Jul 25, 2020Updated 5 years ago
- learn deep learning with framework pytorch☆35Nov 26, 2021Updated 4 years ago
- F^3 is Python-based framework for valuing forward looking financial products on Heterogeneous Parallel Computing Platforms☆13Jan 5, 2016Updated 10 years ago