PyTorch Implementation of GPT-2
☆33Sep 4, 2024Updated last year
Alternatives and similar repositories for gpt2-from-scratch
Users that are interested in gpt2-from-scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A library for simplifying training with multi gpu setups in the HuggingFace / PyTorch ecosystem.☆16Apr 27, 2026Updated last week
- [KDD 2023] code for "Test accuracy vs. generalization gap: model selection in NLP without accessing training or testing data" https://arx…☆12Oct 17, 2022Updated 3 years ago
- High Performance FP8 GEMM Kernels for SM89 and later GPUs.☆21Jan 24, 2025Updated last year
- Interpretating the latent space representations of attention head outputs for LLMs☆39Aug 13, 2024Updated last year
- Elixir: Train a Large Language Model on a Small GPU Cluster☆15Jun 8, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 45+ production-ready tutorials on data science, MLOps, and AI tools. All code is executable and adaptable for real projects.☆24Apr 7, 2026Updated last month
- Bitcoin wallet for AI agents. Stablecoins in, Bitcoin out. Keys in hardware enclaves. Works with OpenClaw, Claude Code, or any agent harn…☆31Mar 21, 2026Updated last month
- ☆26Apr 22, 2026Updated 2 weeks ago
- This is a repository of Binary General Matrix Multiply (BGEMM) by customized CUDA kernel. Thank FP6-LLM for the wheels!☆20Aug 30, 2024Updated last year
- Machine Learning from Human Preferences☆32Mar 23, 2026Updated last month
- Localizing Memorized Sequences in Language Models☆22Oct 15, 2025Updated 6 months ago
- Unsupervised muti-metric fusion for Full-Reference (FR) Image Quality Assessment (IQA)☆11Jul 11, 2014Updated 11 years ago
- ☆15Jun 25, 2024Updated last year
- OpenClaw Daily News (with Ollama + Telegram Quick Setup Guide) | 每日新聞兼快速安裝指南☆41Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This after-effects script helps users to build composition structure for twixtor effect over one or more layers with only a single click,…☆13Mar 20, 2022Updated 4 years ago
- Radial Basis Function based N-dimensional scattered data interpolation for interpolating between unlimited multi-dimensional vectors in N…☆14Mar 2, 2021Updated 5 years ago
- ☆29Dec 15, 2025Updated 4 months ago
- [DATE'2025, TCAD'2025] Terafly : A Multi-Node FPGA Based Accelerator Design for Efficient Cooperative Inference in LLMs☆36Nov 13, 2025Updated 5 months ago
- ☆33Apr 1, 2026Updated last month
- VST that combines the classic mdaPiano and EPiano in a new plug-in☆22Oct 10, 2025Updated 6 months ago
- A Python package to make Stable Diffusion Image Generation ridiculously easy☆18Jul 1, 2024Updated last year
- A machine-readable constitution for AI — Soul’s creativity hardened into ÆON☆17Sep 14, 2025Updated 7 months ago
- An AlphaZero engine for Saiblo Connect4, featuring a pure Python implementation of key KataGo techniques.☆16Apr 21, 2026Updated 2 weeks ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Kubernetes 集群部署 - 一镜到底纯手工部署 K8S 学习集群工作原理☆16Jan 12, 2025Updated last year
- GPS-Denied Indoor Navigation System for Drones Autonomous drone navigation indoors without GPS using optical flow, IMU, and lidar sensor …☆27Nov 20, 2025Updated 5 months ago
- Error monitor for Spring Boot☆15Nov 26, 2021Updated 4 years ago
- Iterate fast on your RAG pipelines☆24Jun 21, 2025Updated 10 months ago
- lightweight and scalable whole-body teleoperation framework for humanoid robots☆93Apr 21, 2026Updated 2 weeks ago
- ☆11Jan 24, 2025Updated last year
- This is AlpaGasus2-QLoRA based on LLaMA2 with AlpaGasus mechanism using QLoRA!☆15Nov 22, 2023Updated 2 years ago
- Safe OS process execution for Elixir. Zero zombie processes, NIF-based backpressure, PTY support, and cgroup isolation.