Transformer from Scratch in PyTorch
☆17Mar 26, 2022Updated 3 years ago
Alternatives and similar repositories for Transformer-from-Scratch
Users that are interested in Transformer-from-Scratch are comparing it to the libraries listed below
Sorting:
- Creates CMM script that can directly executed on Kaggle from easy merge script☆14Jan 12, 2026Updated last month
- asw.cluster R package for calculating group faultlines☆12Aug 20, 2023Updated 2 years ago
- This UE4 project contains the Telekinesis Mechanic for Control☆11Jul 26, 2020Updated 5 years ago
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated last year
- Residual Quantization Autoencoder, used for interpreting LLMs☆14Jan 1, 2025Updated last year
- A framework for steering MoE models by detecting and controlling behavior-linked experts.☆29Sep 12, 2025Updated 5 months ago
- An end-to-end open-source data stack for crawling and visualizing real estate data, facilitating insights into market trends.☆15May 23, 2024Updated last year
- A simple web-app for generating glassmorphism UI effect!☆12Aug 5, 2023Updated 2 years ago
- Fine-tuning GPT-2 to generate research paper abstracts☆12Apr 28, 2021Updated 4 years ago
- The Kingdom Hearts 3 randomizer and garden of assemblage mod.☆10Jan 3, 2023Updated 3 years ago
- Simulating the fractional quantum Hall effect with neural network variational Monte Carlo☆20Sep 12, 2025Updated 5 months ago
- ☆11Feb 6, 2018Updated 8 years ago
- Implement Fluid Simulation (FLIP) on Unreal Engine 5 with NVIDIA GVDB Library☆12Nov 30, 2023Updated 2 years ago
- ☆12Nov 1, 2023Updated 2 years ago
- Finetuning Mask2Former on semantic segmentation using custom dataset☆15May 31, 2024Updated last year
- Compression primitives for uplink compression in Federated Learning that are compatible with Secure Aggregation.☆10Jul 27, 2022Updated 3 years ago
- Learning to Skip the Middle Layers of Transformers☆17Aug 7, 2025Updated 6 months ago
- Download TikTok videos online with TikTok Video Downloader. Completely free.☆13Sep 17, 2025Updated 5 months ago
- ☆10Aug 26, 2022Updated 3 years ago
- Jupyter notebooks from our weekly (or so) hackathons☆11Dec 3, 2024Updated last year
- Code for Learning idiolectal style variation in online register☆10May 18, 2023Updated 2 years ago
- Deepseek-CoT☆10Oct 6, 2024Updated last year
- Unofficial Reproduction: Capacity estimation of lithium-ion batteries based on adaptive empirical wavelet transform and long short-term m…☆12Oct 28, 2024Updated last year
- Code for Semi-crowdsourced Clustering with Deep Generative Models☆12Dec 9, 2022Updated 3 years ago
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆10Dec 30, 2024Updated last year
- Code for Augment & Reduce, a scalable stochastic algorithm for large categorical distributions☆10May 16, 2018Updated 7 years ago
- HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models☆13Mar 6, 2025Updated 11 months ago
- Storyfinder - A Browser Plugin and Server Backend for Personalized Knowledge- and Information Management☆18Jan 7, 2026Updated last month
- Repository of the ICNLSP 2024 paper "Efficient Few-shot Learning for Multi-label Classification of Scientific Documents with Many Classes…☆17Jan 7, 2025Updated last year
- a neural network trainer for weebs☆14Feb 23, 2026Updated last week
- Flan T5 LLM fine-tuning, by attaching a regression model last hidden layers activations. Runs on colab with A100 40gb☆13Mar 24, 2023Updated 2 years ago
- ☆11Jul 25, 2021Updated 4 years ago
- Sqlite3-based logging for Python☆15May 27, 2024Updated last year
- ☆11Oct 21, 2017Updated 8 years ago
- A copy of the DirectX Headers from MinGW-64.☆13Sep 7, 2023Updated 2 years ago
- An application that brings together several anime streaming platforms☆10Mar 1, 2025Updated last year
- Completed Unreal Engine replay system tutorial (blueprint version)☆13Jun 3, 2018Updated 7 years ago
- ☆12Feb 2, 2024Updated 2 years ago
- hudi-spark-utilities-plus☆11Jul 29, 2022Updated 3 years ago