Llama-style transformer in PyTorch with multi-node / multi-GPU training. Includes pretraining, fine-tuning, DPO, LoRA, and knowledge distillation. Scripts for dataset mixing and training from scratch.
☆22Mar 24, 2026Updated this week
Alternatives and similar repositories for training-custom-llama
Users that are interested in training-custom-llama are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Benchmarks for different streaming models: iterators, generators, sequences, transducers, etc.☆15Aug 15, 2022Updated 3 years ago
- Top down operator precedence parser (also known as Pratt parser) implementation for OCaml. (Unreleased)☆11Feb 27, 2018Updated 8 years ago
- Popup dialog boxes for Bootstrap☆16Feb 9, 2024Updated 2 years ago
- runtime library and code-generator for BARE (https://baremessages.org/)☆23Aug 16, 2024Updated last year
- 🏴 A collection of awesome things for the tech community in Edinburgh☆18Apr 11, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A meta-language for OCaml. (Unreleased)☆26Aug 29, 2025Updated 7 months ago
- Photo Gallery is a self-hosted performant application to organize your photos. Built for speed with React and Go, explore your photos qui…☆25Apr 12, 2025Updated 11 months ago
- A Windows CLI tool for copying files from/to a portable device via MTP.☆17May 15, 2022Updated 3 years ago
- Web app to play the Core + Twin Shadows campaigns in solo or co-op modes!☆13Jun 19, 2019Updated 6 years ago
- Java logging viewer☆21Mar 11, 2017Updated 9 years ago
- Snake's Food Hunt" is a competitive AI-driven game where two snakes learn to navigate, collect food, and avoid collisions using Deep Q-Le…☆10Nov 18, 2025Updated 4 months ago
- A web service for graph visualisation of authors and their coauthor communities of the DBLP Computer Science Bibliography☆13Jun 7, 2017Updated 8 years ago
- IO independent postgres protocol implementation☆18May 29, 2023Updated 2 years ago
- Datalog engine based on DuckDB☆10Mar 8, 2023Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Script to train a German n-gram Language Model on articles of Wikipedia☆14Oct 20, 2018Updated 7 years ago
- Simple, secure and composable abstraction for construction of efficient data flows.☆15Apr 21, 2017Updated 8 years ago
- Disk space usage visualizer☆32Feb 1, 2026Updated last month
- Plugin for Qmmp (Qt-based Multimedia Player) to search and play musics directly from YouTube.☆29Oct 4, 2023Updated 2 years ago
- BitVector Implementation in Nim☆17May 16, 2022Updated 3 years ago
- A floating offshore wind farm simulation and flow control framework using FLORIS, MoorPy, and deep reinforcement learning☆20Jan 28, 2026Updated 2 months ago
- A fast and frugal tree classifier for sklearn☆16Feb 9, 2026Updated last month
- line based patch, input is a unified diff☆24Nov 6, 2025Updated 4 months ago
- Ocaml library to access Amazon S3☆51Sep 30, 2025Updated 6 months ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Convert file extensions to MIME types☆23Oct 4, 2023Updated 2 years ago
- RESTful API for an online store☆12Mar 25, 2021Updated 5 years ago
- ☆11Apr 10, 2024Updated last year
- This plugin helps you to use kibana's notifications more usefully.☆32May 15, 2018Updated 7 years ago
- WIP kernel for Nubia Z17 mini NX569J NX569H, Z17 mini S NX589 and Z17 Lite NX591☆16Mar 17, 2019Updated 7 years ago
- Experiments with the tv box H96 MAX V58 that runs the new Rockchip RK3588 SoC.☆21Aug 11, 2023Updated 2 years ago
- AppOptics APM Instrumentation Agent for Node.js☆11Sep 3, 2024Updated last year
- Implementation of Hippoformer, Integrating Hippocampus-inspired Spatial Memory with Transformers☆49Feb 5, 2026Updated last month
- Downloading videos from YouTube in C++☆26Nov 14, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- An open source robot reinforcement learing plantform using stable-baselines and OpenAI Gym☆10Mar 24, 2023Updated 3 years ago
- Cash is a Unix shell that is embedded within Objective Caml. It's a Caml implementation of (an as large as possible subset of) the API of…☆11Sep 7, 2013Updated 12 years ago
- SQLite extension library, written in Lua and C. Based on EAV/CR storage, implements most of data schema refactoring patterns and more☆17Dec 26, 2020Updated 5 years ago
- [DEPRECATED] Utils library for Brazilian-specific businesses.☆14Jul 21, 2021Updated 4 years ago
- An experiment in functionally-reactive Flux☆13Apr 28, 2015Updated 10 years ago
- Complete, typesafe representation of Vega-Lite in OCaml☆10Nov 13, 2017Updated 8 years ago
- OCaml implementation of sets as hash tables by Jean-Christophe Filliatre☆10Feb 13, 2025Updated last year