PyTorch Implementation of GPT-2
☆34Sep 4, 2024Updated last year
Alternatives and similar repositories for gpt2-from-scratch
Users that are interested in gpt2-from-scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A library for simplifying training with multi gpu setups in the HuggingFace / PyTorch ecosystem.☆16Jun 10, 2026Updated last week
- ☆14Feb 5, 2025Updated last year
- Elixir: Train a Large Language Model on a Small GPU Cluster☆16Jun 8, 2023Updated 3 years ago
- ☆18Apr 9, 2025Updated last year
- This is a repository of Binary General Matrix Multiply (BGEMM) by customized CUDA kernel. Thank FP6-LLM for the wheels!☆20Aug 30, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- (NeurIPS 2024) One-shot Federated Learning via Synthetic Distiller-Distillate Communication☆20Mar 11, 2025Updated last year
- ☆11Jan 24, 2025Updated last year
- This is AlpaGasus2-QLoRA based on LLaMA2 with AlpaGasus mechanism using QLoRA!☆15Nov 22, 2023Updated 2 years ago
- A machine learning solution for extracting key entity values (weight, volume, dimensions) from product images.☆18Sep 17, 2024Updated last year
- 🥪 Mess portal where owners can set their weekly menu, price, time, and students can purchase their desired coupons, with a QR code syste…☆11Jun 2, 2023Updated 3 years ago
- LLM powered drawio live editor☆59Dec 10, 2025Updated 6 months ago
- MAFIA: Multiple Application Framework for GPU architectures☆28Jan 21, 2022Updated 4 years ago
- Official implementation of ICML 2025 paper "Understanding Multimodal LLMs Under Distribution Shifts: An Information-Theoretic Approach"☆12May 27, 2025Updated last year
- Repository of useful 'stuff' for the MineRL BASALT Challenge☆16Mar 21, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- macOS userland driver for Apple Pro Speakers☆27Jan 15, 2024Updated 2 years ago
- Computes innocently excludable and includable sets of alternatives☆13Oct 14, 2021Updated 4 years ago
- [ECCV 2024] Official implementation of "Uncertainty Calibration with Energy Based Instance-wise Scaling in the Wild Dataset"☆11Aug 13, 2024Updated last year
- Open source replication of Anthropic's Crosscoders for Model Diffing☆67Oct 27, 2024Updated last year
- ☆19Jan 10, 2026Updated 5 months ago
- Quantization in the Jagged Loss Landscape of Vision Transformers☆13Oct 22, 2023Updated 2 years ago
- source code for NeurIPS'24 paper "Towards Calibrated Robust Fine-Tuning of Vision-Language Models"☆15Oct 31, 2025Updated 7 months ago
- repo of paper implementations☆20Feb 25, 2025Updated last year
- Testing OpenAi Whisper models on a Raspberry PI 5☆32Jul 30, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- PyTorch implementation of Data2Vec self-supervised approach for vision use cases.☆18Oct 7, 2022Updated 3 years ago
- Artifact from "Hardware Compute Partitioning on NVIDIA GPUs". THIS IS A FORK OF BAKITAS REPO. I AM NOT ONE OF THE AUTHORS OF THE PAPER.☆64Nov 24, 2025Updated 6 months ago
- Codebase for Paper Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs☆22Apr 24, 2025Updated last year
- The official implementation of "A2XP: Towards Private Domain Generalization".☆15Jun 14, 2024Updated 2 years ago
- Create and retrieve PDFs using HTTP GET and POST requests☆20Apr 29, 2023Updated 3 years ago
- Asrock BC-250 info and mods☆64Nov 28, 2025Updated 6 months ago
- ☆20Nov 5, 2021Updated 4 years ago
- Script to rotate webserver log file to AWS S3☆28Jul 10, 2014Updated 11 years ago
- ☆21Feb 24, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Figures with precise control over overall width, plot aspect ratio, between-plot spacing, and colorbar dimensions.☆18Dec 31, 2021Updated 4 years ago
- Tel Aviv Birdwatching 🛴 https://twitter.com/ido_co/status/1080883756184023041☆21Apr 29, 2023Updated 3 years ago
- Java-based app to manage vintage Apple QuickTake 100/150 cameras☆49Apr 12, 2026Updated 2 months ago
- Computing the greatest common divisor with transformers, source code for the paper https//arxiv.org/abs/2308.15594☆14Aug 11, 2025Updated 10 months ago
- This is the implementation of our CVPR'23 paper On the Pitfall of Mixup for Uncertainty Calibration. In the paper, we conduct a series of…☆17Mar 19, 2023Updated 3 years ago
- SQL storage for CertMagic/Caddy TLS data.☆19Nov 11, 2022Updated 3 years ago
- Generative Model for Neural Networks☆24Jul 2, 2020Updated 5 years ago