Master the essential steps of pretraining large language models (LLMs). Learn to create high-quality datasets, configure model architectures, execute training runs, and assess model performance for efficient and effective LLM pretraining.
☆26Aug 7, 2024Updated last year
Alternatives and similar repositories for Pretraining-LLMs
Users that are interested in Pretraining-LLMs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Scripts of LLM pre-training and fine-tuning (w/wo LoRA, DeepSpeed)☆87Jan 30, 2024Updated 2 years ago
- This is the pipeline of our new article "Enzyme Co-Scientist: Harnessing Large Language Models for Enzyme Kinetic Data Extraction from Li…☆17May 23, 2025Updated last year
- ☆20Aug 14, 2025Updated 9 months ago
- Proof of concept extension of sendme to use global content discovery☆22Jan 17, 2025Updated last year
- Cell-type Assignment and Module Extraction based on a heterogeneous graph neural network.☆10Oct 30, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Reverse engineered API for Quora's Poe - access Claude (Anthropic) and ChatGPT (OpenAI)☆15Sep 8, 2023Updated 2 years ago
- A collection of example for learning how to use Golang.☆14May 4, 2019Updated 7 years ago
- An end to end ML project. Using MLflow for experiment tracking and model registry. Prefect for workflow orchestration. S3 for artifacts s…☆12Sep 11, 2022Updated 3 years ago
- 🤯 Thoughts Lab 是基于 uni-app 开发的『工具集小程序』,采用通用型分层架构,支持多端发布、跨框架迁移 🤞☆16Dec 13, 2022Updated 3 years ago
- [🎖️1등(장관상) 솔루션] 2022 국립국어원 인공 지능 언어 능력 평가 (쇼핑몰 리뷰 데이터 속성 기반 감성 분석 : Aspect-Based Sentiment Analysis)☆11Jun 6, 2023Updated 2 years ago
- ☆17Nov 12, 2025Updated 6 months ago
- A repository of Dockerfiles, scripts, yaml files, Helm Charts, etc. used to build and scale the sample AI workflows with python, kubernet…☆12Feb 22, 2024Updated 2 years ago
- Tweet from Kivy on Android☆12Oct 8, 2013Updated 12 years ago
- ☆17Oct 14, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Cell-type Assignment and Module Extraction based on a heterogeneous graph neural network.☆13Jan 27, 2024Updated 2 years ago
- Pin any tweet your Twitter profile.☆18May 15, 2024Updated 2 years ago
- Ollivier-Ricci Curvature for Hypergraphs: A Unified Framework (ICLR 2023)☆19Jun 14, 2023Updated 2 years ago
- rnsh is a command-line utility written in Python that facilitates shell sessions over Reticulum networks and aims to provide a similar ex…☆18Apr 26, 2026Updated last month
- A forum for the NomadNetwork☆18May 8, 2026Updated 2 weeks ago
- Javascript implementation of the Reticulum Network Stack☆22Feb 10, 2025Updated last year
- VT3D: a versatile Visualization Toolbox for 3D spatial transcriptomics atlas☆14Nov 6, 2024Updated last year
- 该存储库,主要存放根据自己所掌握的NGS差异表达分析技术方法为主,具体的实验以及方案设计等将不会存放在此存储库。☆15May 19, 2020Updated 6 years ago
- Debian package manager☆16Aug 29, 2009Updated 16 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- For converting LLM datasets from one format into another.☆22Nov 12, 2025Updated 6 months ago
- Jupyter notebooks for course Finetuning Large Language Models, taught by Sharon Zhou (Lamini) and Andrew Ng (DeepLearning.AI).☆16Oct 21, 2023Updated 2 years ago
- Pretrained Language Model(from huggingface)을 사용하여 간단하게 비슷한 의미를 가지는 문장을 찾을 수 있는 metric을 제공☆13Jul 6, 2023Updated 2 years ago
- My implementation of the 15 puzzle game using the Pygame module with Python!☆11Oct 7, 2020Updated 5 years ago
- A Simple tool to organize my roadmaps.☆18May 20, 2026Updated last week
- Code for "What really matters in matrix-whitening optimizers?"☆24Oct 31, 2025Updated 6 months ago
- Image Segmentation On Custom Dataset Using YOLOv8☆19Jan 12, 2023Updated 3 years ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆62Nov 4, 2024Updated last year
- This project has moved to GitLab.com☆11Jun 4, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- An R/Bioconductor package to identify higher-order molecular phenotypes☆19Jan 15, 2024Updated 2 years ago
- Implementation of the WWW'23 paper "Toward Degree Bias in Embedding-Based Knowledge Graph Completion"☆15Jun 17, 2023Updated 2 years ago
- OnLine Spectral Search ENgine for Proteomics big data using Apache Spark, Python/Flask, and AngularJS☆15Sep 14, 2015Updated 10 years ago
- Irc-style Chat Room for Reticulum Nomadnet☆29Oct 2, 2025Updated 7 months ago
- OPUS-Rota4: A Gradient-Based Protein Side-Chain Modeling Framework Assisted by Deep Learning-Based Predictors☆11Apr 14, 2022Updated 4 years ago
- Program to plot a Ramachandran plot of all dihedral angles from a given PDB file. Background is empirically generated from the peptides …☆13Feb 25, 2025Updated last year
- Codes for Evolving Plastic ANNs☆14Dec 18, 2022Updated 3 years ago