Pretraining data reconstruction scripts for Apertus
☆124Oct 27, 2025Updated 6 months ago
Alternatives and similar repositories for pretrain-data
Users that are interested in pretrain-data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆19Feb 25, 2024Updated 2 years ago
- Muon fsdp 2☆56Aug 8, 2025Updated 8 months ago
- This GUI aims to simplify the process of converting GGUF files to llamafile format by providing an intuitive and convenient way for users…☆14Jan 2, 2026Updated 4 months ago
- The test set for Koala☆45Mar 31, 2023Updated 3 years ago
- An app which uses inpainting to create an infinitely scrolling image☆11Jun 11, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Super simple script to create Debian packages☆12Jan 11, 2022Updated 4 years ago
- API client for Aleph, supports bulk entity and document upload.☆29Mar 5, 2026Updated 2 months ago
- A license system and code obfuscator for Python.☆28Apr 17, 2021Updated 5 years ago
- Ladakh☆11Jan 27, 2023Updated 3 years ago
- [DEPRECATED] Ethereum Verified Contracts☆12Jun 29, 2018Updated 7 years ago
- standard form private license for developers☆13May 16, 2021Updated 4 years ago
- ☆33Apr 26, 2026Updated last week
- Highly concurrent and fast content processing for Mighty Inference Server☆10Feb 6, 2023Updated 3 years ago
- Python 3 alternative command line interface for AWS Route 53; enables simple record management and dynamic DNS☆11Apr 24, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆57Feb 10, 2025Updated last year
- ☆12Mar 15, 2024Updated 2 years ago
- Configurator provides a GUI interface for your application, dynamically generated from a JSON file.☆18May 7, 2024Updated last year
- Logrotate integration for Capistrano☆14Nov 12, 2025Updated 5 months ago
- Tiny evaluation of leading LLMs on competitive programming problems☆14Apr 10, 2026Updated 3 weeks ago
- win32 native frontend for llama-cli☆14Nov 2, 2024Updated last year
- The New Kids☆13Feb 22, 2026Updated 2 months ago
- Ukrainian ELECTRA model☆12Mar 11, 2023Updated 3 years ago
- Badgers: Bad Data Generators☆14Jan 29, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A living document about DIY room correction☆15Feb 10, 2020Updated 6 years ago
- ☆10Oct 20, 2023Updated 2 years ago
- ☆10Jan 9, 2025Updated last year
- ☆70Mar 10, 2026Updated last month
- Web-based spreadsheet editor with support for real-time collaboration☆17Mar 17, 2022Updated 4 years ago
- Python Client Library for FROST.☆12Apr 29, 2026Updated last week
- capistrano deploy script for puma with nginx☆14Apr 28, 2019Updated 7 years ago
- ☆11Nov 2, 2024Updated last year
- an advanced level shifting neighbor☆11Dec 6, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official Implementation of Knowledge Flow Prompting☆35Oct 20, 2025Updated 6 months ago
- An OS X kernel module that protects a userland process from being terminated in any way☆14Dec 7, 2015Updated 10 years ago
- ACL24☆11Jun 7, 2024Updated last year
- Example code for the NNGeometry PyTorch library☆10Aug 20, 2025Updated 8 months ago
- ☆10Oct 16, 2017Updated 8 years ago
- Minimal (truly) muP implementation, consistent with TP4 and TP5 papers notation☆14Jan 2, 2026Updated 4 months ago
- ☆20Jul 4, 2025Updated 10 months ago