Master the essential steps of pretraining large language models (LLMs). Learn to create high-quality datasets, configure model architectures, execute training runs, and assess model performance for efficient and effective LLM pretraining.
☆27Aug 7, 2024Updated last year
Alternatives and similar repositories for Pretraining-LLMs
Users that are interested in Pretraining-LLMs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ⚡️Design, generate, iterate and live preview Nextjs pages with 🤖 GPT! Would now be considered a v0 clone. Quick hack to generate & serve…☆31Mar 3, 2024Updated 2 years ago
- ReasonFlow is a novel framework designed to implement o1-like reasoning capabilities in large language models.☆19Feb 25, 2025Updated last year
- A no-string API framework for deploying schema-based reasoning into third-party apps☆23Mar 6, 2026Updated 2 weeks ago
- GPL bases sources for Intel NNP-I card☆18Nov 13, 2023Updated 2 years ago
- Benchmark Large Language Models Reliably On Your Data☆18Dec 27, 2025Updated 2 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Codes and data for KDD 2024 Research Track paper "ProCom: A Few-shot Targeted Community Detection Algorithm"☆11Aug 15, 2024Updated last year
- ☆12Feb 22, 2023Updated 3 years ago
- A collection of example for learning how to use Golang.☆14May 4, 2019Updated 6 years ago
- An end to end ML project. Using MLflow for experiment tracking and model registry. Prefect for workflow orchestration. S3 for artifacts s…☆12Sep 11, 2022Updated 3 years ago
- tuimorphic choose-your-own-adventure story game☆18Mar 3, 2026Updated 3 weeks ago
- A repository of Dockerfiles, scripts, yaml files, Helm Charts, etc. used to build and scale the sample AI workflows with python, kubernet…☆12Feb 22, 2024Updated 2 years ago
- Python wrapper around Yuta Mori's implementation of SA-IS suffix array construction.☆12Oct 26, 2012Updated 13 years ago
- ☆38Jun 14, 2025Updated 9 months ago
- A forum for the NomadNetwork☆18Mar 6, 2026Updated 2 weeks ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Expanded KR-BERT by adding more training data☆13Apr 23, 2021Updated 4 years ago
- Calculate allowed interactions in QED☆10Nov 2, 2022Updated 3 years ago
- A Node.js REST interface for an Apache Spark REPL which can execute JavaScript.☆17Mar 21, 2016Updated 10 years ago
- For converting LLM datasets from one format into another.☆22Nov 12, 2025Updated 4 months ago
- No Longer Maintained - Frame.js is a flow control library and script loader for Javascript applications☆74Jan 18, 2014Updated 12 years ago
- Pretrained Language Model(from huggingface)을 사용하여 간단하게 비슷한 의미를 가지는 문장을 찾을 수 있는 metric을 제공☆13Jul 6, 2023Updated 2 years ago
- Named-Entity-Recognition Workshop☆16May 27, 2019Updated 6 years ago
- Micro neural network with multi-dimensional layers, multi-shaped data, fully or locally meshing, conv2D, unconv2D, Qlearning, ... for tes…☆10Jan 8, 2021Updated 5 years ago
- Implementation of learning rate finder in TensorFlow☆12Mar 4, 2019Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code for "What really matters in matrix-whitening optimizers?"☆23Oct 31, 2025Updated 4 months ago
- Fine-Tuning Llama3-8B LLM in a multi-GPU environment using DeepSpeed☆21May 27, 2024Updated last year
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆62Nov 4, 2024Updated last year
- This project has moved to GitLab.com☆11Jun 4, 2018Updated 7 years ago
- Tensorflow tf.metrics tutorial☆12Aug 30, 2018Updated 7 years ago
- A very simple app that displays random inspirational quotations. Written in Python using the Kivy library for cross-platform support (And…☆13Nov 2, 2018Updated 7 years ago
- LoRA supervised fine-tuning, RLHF (PPO) and RAG with llama-3-8B on the TLDR summarization dataset☆14Feb 2, 2025Updated last year
- OPUS-Rota4: A Gradient-Based Protein Side-Chain Modeling Framework Assisted by Deep Learning-Based Predictors☆11Apr 14, 2022Updated 3 years ago
- A n body simulation of our solar system completed in python☆11Dec 6, 2021Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Minimal (truly) muP implementation, consistent with TP4 and TP5 papers notation☆14Jan 2, 2026Updated 2 months ago
- Code for the paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers" with GPT-J implementation.☆15Mar 22, 2023Updated 3 years ago
- ☆13Aug 28, 2018Updated 7 years ago
- Codes for Evolving Plastic ANNs☆14Dec 18, 2022Updated 3 years ago
- This is a detailed code demo on how to conduct Full-Param Supervised Fine-tuning (SFT) and DPO (Direct Preference Optimization)☆18Jan 9, 2025Updated last year
- The official baseline implementations for Chronocept☆10Dec 21, 2025Updated 3 months ago
- This is the official implementation for the paper: Ferret: Federated Full-Parameter Tuning at Scale for Large Language Models☆19Sep 11, 2024Updated last year