PyTorch Implementation of GPT-2
☆34Sep 4, 2024Updated last year
Alternatives and similar repositories for gpt2-from-scratch
Users that are interested in gpt2-from-scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A library for simplifying training with multi gpu setups in the HuggingFace / PyTorch ecosystem.☆16Updated this week
- ☆14Feb 5, 2025Updated last year
- High Performance FP8 GEMM Kernels for SM89 and later GPUs.☆21Jan 24, 2025Updated last year
- ☆17Apr 6, 2026Updated last month
- Interpretating the latent space representations of attention head outputs for LLMs☆39Aug 13, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆18Apr 9, 2025Updated last year
- Point Cloud Annotation Tool. Built with PPTK and PyQt.☆15May 18, 2023Updated 3 years ago
- ☆27Apr 22, 2026Updated last month
- Code repository of the paper "Exploiting Redundancy: Separable Group Convolutional Networks on Lie Groups" https://proceedings.mlr.press/…☆11Jul 20, 2022Updated 3 years ago
- This is a repository of Binary General Matrix Multiply (BGEMM) by customized CUDA kernel. Thank FP6-LLM for the wheels!☆20Aug 30, 2024Updated last year
- Unsupervised muti-metric fusion for Full-Reference (FR) Image Quality Assessment (IQA)☆11Jul 11, 2014Updated 11 years ago
- Machine Learning from Human Preferences☆33Mar 23, 2026Updated 2 months ago
- Genarris is a random molecular crystal structure generator.☆31May 22, 2026Updated last week
- This after-effects script helps users to build composition structure for twixtor effect over one or more layers with only a single click,…☆13Mar 20, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [NeurIPS 2024] VeLoRA : Memory Efficient Training using Rank-1 Sub-Token Projections☆22Oct 15, 2024Updated last year
- Multi-agent investing agent using Claude Agent SDK☆14Oct 3, 2025Updated 7 months ago
- 百度地图坐标拾取工具☆12Jan 27, 2018Updated 8 years ago
- Stream asian dramas, series and movies from multiple providers. Powered by TMDB for metadata search☆30Updated this week
- VST that combines the classic mdaPiano and EPiano in a new plug-in☆23Oct 10, 2025Updated 7 months ago
- A Python package to make Stable Diffusion Image Generation ridiculously easy☆18Jul 1, 2024Updated last year
- A machine-readable constitution for AI — Soul’s creativity hardened into ÆON☆17Sep 14, 2025Updated 8 months ago
- 深度学习初学者理论与实践学习的资料总结☆13Apr 19, 2019Updated 7 years ago
- Deep Learning for Energy Efficient Beamforming in MU-MISO Networks: A GAT-based Approach☆15Apr 22, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Deep Learning course final project. 12th semester.☆18Apr 24, 2025Updated last year
- Error monitor for Spring Boot☆15Nov 26, 2021Updated 4 years ago
- Iterate fast on your RAG pipelines☆24Jun 21, 2025Updated 11 months ago
- lightweight and scalable whole-body teleoperation framework for humanoid robots☆103May 22, 2026Updated last week
- Simple and Easy Tool for install and manage openFrameworks libraries and projects☆24Aug 24, 2014Updated 11 years ago
- ☆11Jan 24, 2025Updated last year
- Client side vector search using EmbeddingGemma with Web AI (LiteRT.js, TensorFlow.js, and Transformers.js)☆109Mar 27, 2026Updated 2 months ago
- This repository is dedicated to diving into the world of machine learning through daily projects, tutorials, and insights.☆14Nov 19, 2024Updated last year
- Safe OS process execution for Elixir. Zero zombie processes, NIF-based backpressure, PTY support, and cgroup isolation.☆51Apr 17, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A professional list of Papers on AI for Spatial Interpolation in AI conferences and journals.☆18Jul 29, 2024Updated last year
- Some common CUDA kernel implementations (Not the fastest).☆29Dec 5, 2025Updated 5 months ago
- 🥪 Mess portal where owners can set their weekly menu, price, time, and students can purchase their desired coupons, with a QR code syste…☆11Jun 2, 2023Updated 2 years ago
- ☆10Jul 11, 2022Updated 3 years ago
- OllamaFX is a native, lightweight, and professional JavaFX desktop client for Ollama. Run Llama 3, Mistral, and Phi-3 locally with maximu…☆69May 21, 2026Updated last week
- Let's make good things!☆13Aug 22, 2018Updated 7 years ago
- ImageNet1k-pretrained SE(2) Equivariant Vision Models☆17May 6, 2024Updated 2 years ago