Master the essential steps of pretraining large language models (LLMs). Learn to create high-quality datasets, configure model architectures, execute training runs, and assess model performance for efficient and effective LLM pretraining.
☆25Aug 7, 2024Updated last year
Alternatives and similar repositories for Pretraining-LLMs
Users that are interested in Pretraining-LLMs are comparing it to the libraries listed below
Sorting:
- ملخص شامل لمفاهيم وشروط شهادة CompTIA Security+ SY0-701، مستند إلى وثيقة CompTIA Security Plus SY0-701 Exam Objectives. يقدم معلومات مركز…☆14Mar 14, 2025Updated 11 months ago
- Scripts of LLM pre-training and fine-tuning (w/wo LoRA, DeepSpeed)☆87Jan 30, 2024Updated 2 years ago
- A n body simulation of our solar system completed in python☆11Dec 6, 2021Updated 4 years ago
- Computational predictor of protein intrinsic disorder and its functions☆10Dec 4, 2023Updated 2 years ago
- Program to plot a Ramachandran plot of all dihedral angles from a given PDB file. Background is empirically generated from the peptides …☆12Feb 25, 2025Updated last year
- ☆15Oct 14, 2025Updated 4 months ago
- Simple Flutter State Management☆10Oct 30, 2025Updated 4 months ago
- Insurance Fraud Detection using Machine Learning☆13May 7, 2024Updated last year
- Calculate allowed interactions in QED☆10Nov 2, 2022Updated 3 years ago
- OPUS-Rota4: A Gradient-Based Protein Side-Chain Modeling Framework Assisted by Deep Learning-Based Predictors☆11Apr 14, 2022Updated 3 years ago
- ☆12Apr 21, 2025Updated 10 months ago
- ☆10Aug 15, 2022Updated 3 years ago
- Trading signals processing solution that supports signals filtering and posting to broker or exchanges that are not integrated into your …☆10May 9, 2021Updated 4 years ago
- ☆11Dec 5, 2024Updated last year
- ☆10Jul 1, 2023Updated 2 years ago
- Tiny AI model embedded in NES ROMs to generate character names in-game.☆29Sep 28, 2025Updated 5 months ago
- Code for the paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers" with GPT-J implementation.☆15Mar 22, 2023Updated 2 years ago
- RAG-based Web Scraping☆14Jul 22, 2024Updated last year
- NMT based SMILES to IUPAC Translator☆16Jul 16, 2025Updated 7 months ago
- ☆27Aug 14, 2025Updated 6 months ago
- Demos for the 2022 Many Electron Collaboration Workshop on PySCF☆12Jun 21, 2022Updated 3 years ago
- A visualization experience of AI/ML academic papers hosted on ArXiV - for project work at the University of California, Berkeley MIDS pro…☆10Feb 10, 2023Updated 3 years ago
- HIP: Hessians with Interatomic Potentials☆28Feb 6, 2026Updated 3 weeks ago
- Ready to go Deno Starter Kit for back-end web server development.☆10Jul 24, 2024Updated last year
- ☆14Feb 6, 2025Updated last year
- ☆11Nov 12, 2025Updated 3 months ago
- ☆11Jan 8, 2026Updated last month
- OMNI-P2x: A universal neural network potential for excited states☆12Feb 26, 2026Updated last week
- ☆10Jan 3, 2023Updated 3 years ago
- macrogpt大模型全量预训练(1b3,32层), 多卡deepspeed/单卡adafactor☆15Nov 30, 2023Updated 2 years ago
- Example smart contract setup for mapping an end-user wallet to an ownable smart contract as its 'proxy wallet'.☆10Apr 17, 2023Updated 2 years ago
- QUESTDB: A Database of Highly-Accurate Excitation Energies☆18Dec 9, 2025Updated 2 months ago
- Force Fields☆14Oct 25, 2022Updated 3 years ago
- Contains Jupyter notebooks and other materials prepared for the course Numerical Methods offered at TIFR Hyderabad (https://moldis-group.…☆12Dec 26, 2022Updated 3 years ago
- An Elixir wrapper of the ollama REST API☆12Feb 14, 2025Updated last year
- A program to automatically perform microkinetic modeling and generate microkinetic volcano plots for homogeneous catalysis reactions usin…☆14Jan 1, 2026Updated 2 months ago
- What Would Portland Do? Generative agent experience☆13Mar 13, 2024Updated last year
- Code for "What really matters in matrix-whitening optimizers?"☆22Oct 31, 2025Updated 4 months ago
- An Image Recognition tutorial written for the HyperionDev blog☆10Dec 19, 2017Updated 8 years ago