Llama-style transformer in PyTorch with multi-node / multi-GPU training. Includes pretraining, fine-tuning, DPO, LoRA, and knowledge distillation. Scripts for dataset mixing and training from scratch.
☆23Apr 16, 2026Updated this week
Alternatives and similar repositories for training-custom-llama
Users that are interested in training-custom-llama are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Benchmarks for different streaming models: iterators, generators, sequences, transducers, etc.☆15Aug 15, 2022Updated 3 years ago
- Top down operator precedence parser (also known as Pratt parser) implementation for OCaml. (Unreleased)☆11Feb 27, 2018Updated 8 years ago
- Popup dialog boxes for Bootstrap☆16Feb 9, 2024Updated 2 years ago
- runtime library and code-generator for BARE (https://baremessages.org/)☆23Aug 16, 2024Updated last year
- 🏴 A collection of awesome things for the tech community in Edinburgh☆18Apr 11, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A meta-language for OCaml. (Unreleased)☆26Aug 29, 2025Updated 7 months ago
- Photo Gallery is a self-hosted performant application to organize your photos. Built for speed with React and Go, explore your photos qui…☆24Updated this week
- A Windows CLI tool for copying files from/to a portable device via MTP.☆18May 15, 2022Updated 3 years ago
- Web app to play the Core + Twin Shadows campaigns in solo or co-op modes!☆13Jun 19, 2019Updated 6 years ago
- Java logging viewer☆21Mar 11, 2017Updated 9 years ago
- Snake's Food Hunt" is a competitive AI-driven game where two snakes learn to navigate, collect food, and avoid collisions using Deep Q-Le…☆10Nov 18, 2025Updated 5 months ago
- A web service for graph visualisation of authors and their coauthor communities of the DBLP Computer Science Bibliography☆13Jun 7, 2017Updated 8 years ago
- IO independent postgres protocol implementation☆18May 29, 2023Updated 2 years ago
- Datalog engine based on DuckDB☆10Mar 8, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Script to train a German n-gram Language Model on articles of Wikipedia☆14Oct 20, 2018Updated 7 years ago
- Simple, secure and composable abstraction for construction of efficient data flows.☆15Apr 21, 2017Updated 8 years ago
- Disk space usage visualizer☆35Feb 1, 2026Updated 2 months ago
- Plugin for Qmmp (Qt-based Multimedia Player) to search and play musics directly from YouTube.☆29Oct 4, 2023Updated 2 years ago
- BitVector Implementation in Nim☆17May 16, 2022Updated 3 years ago
- A floating offshore wind farm simulation and flow control framework using FLORIS, MoorPy, and deep reinforcement learning☆21Jan 28, 2026Updated 2 months ago
- A fast and frugal tree classifier for sklearn☆16Apr 8, 2026Updated last week
- line based patch, input is a unified diff☆24Nov 6, 2025Updated 5 months ago
- Ocaml library to access Amazon S3☆51Sep 30, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Convert file extensions to MIME types☆23Oct 4, 2023Updated 2 years ago
- RESTful API for an online store☆12Mar 25, 2021Updated 5 years ago
- ☆11Apr 10, 2024Updated 2 years ago
- This plugin helps you to use kibana's notifications more usefully.☆32May 15, 2018Updated 7 years ago
- WIP kernel for Nubia Z17 mini NX569J NX569H, Z17 mini S NX589 and Z17 Lite NX591☆16Mar 17, 2019Updated 7 years ago
- Experiments with the tv box H96 MAX V58 that runs the new Rockchip RK3588 SoC.☆21Aug 11, 2023Updated 2 years ago
- Implementation of Hippoformer, Integrating Hippocampus-inspired Spatial Memory with Transformers☆50Feb 5, 2026Updated 2 months ago
- AppOptics APM Instrumentation Agent for Node.js☆11Sep 3, 2024Updated last year
- Downloading videos from YouTube in C++☆26Nov 14, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- An open source robot reinforcement learing plantform using stable-baselines and OpenAI Gym☆10Mar 24, 2023Updated 3 years ago
- Cash is a Unix shell that is embedded within Objective Caml. It's a Caml implementation of (an as large as possible subset of) the API of…☆11Sep 7, 2013Updated 12 years ago
- [DEPRECATED] Utils library for Brazilian-specific businesses.☆14Jul 21, 2021Updated 4 years ago
- SQLite extension library, written in Lua and C. Based on EAV/CR storage, implements most of data schema refactoring patterns and more☆17Dec 26, 2020Updated 5 years ago
- An experiment in functionally-reactive Flux☆13Apr 28, 2015Updated 10 years ago
- Complete, typesafe representation of Vega-Lite in OCaml☆10Nov 13, 2017Updated 8 years ago
- OCaml implementation of sets as hash tables by Jean-Christophe Filliatre☆10Feb 13, 2025Updated last year