Transformers from scratch using PyTorch & NumPy.
β50Feb 7, 2025Updated last year
Alternatives and similar repositories for transformers
Users that are interested in transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is a small autograd engine, made purely from numpy and python.β27Sep 17, 2024Updated last year
- π£ Ml Design Patterns interview questions and answers to help you prepare for your next machine learning and data science interview in 20β¦β18Jan 4, 2026Updated 4 months ago
- β88Jan 24, 2026Updated 3 months ago
- Graph data models for RAG applicationsβ17Mar 28, 2024Updated 2 years ago
- β21Sep 11, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A Qwen .5B reasoning model trained on OpenR1-Math-220kβ14Oct 11, 2025Updated 6 months ago
- β21Jun 1, 2024Updated last year
- β12Feb 11, 2026Updated 2 months ago
- Multidimensional Dictionary Learningβ10Sep 27, 2017Updated 8 years ago
- Repository for the Introduction to Machine Learning and Deep Learning course as part of the International Graduate Summer School in Matheβ¦β11Aug 8, 2019Updated 6 years ago
- Nonlinear SVGD for Learning Diversified Mixture Modelsβ13Jan 23, 2019Updated 7 years ago
- β15Apr 2, 2024Updated 2 years ago
- β12May 28, 2025Updated 11 months ago
- uses all reasoning models in parallel and synthesizes an answer with o1. also has multi-chat where you can chat with any of themβ41Jan 23, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ICTNet: a novel network for semantic segmentation with the underlying architecture of a fully convolutional network, infused with featureβ¦β10May 27, 2020Updated 5 years ago
- Tutorials on how to Query Dataβ13Jan 7, 2023Updated 3 years ago
- AI-driven storytelling systemβ11Apr 24, 2025Updated last year
- rho_VAE: an autoregressive parametrization of the VAE encoderβ16Sep 17, 2019Updated 6 years ago
- Client-Server chat app that translate (without reasoning) messages based on chosen languages via a simple mapβ14Jul 18, 2024Updated last year
- β12Oct 25, 2023Updated 2 years ago
- Perplexity Lite using Langgraph, Tavily, and GPT-4.β14Jan 11, 2024Updated 2 years ago
- Pytorch implementation for the paper: Data augmentation with norm-AE and selective pseudo-labelling for unsupervised domain adaptationβ14Mar 23, 2023Updated 3 years ago
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorchβ416Nov 11, 2025Updated 5 months ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Thesis project about Visual Anomaly Detection based on Self Supervised Learning. The model identifies anomalies from information acquiredβ¦β10Apr 14, 2023Updated 3 years ago
- rl from zero pretrain, can it be done? yes.β292Sep 28, 2025Updated 7 months ago
- CATransformers is a framework for joint neural network and hardware architecture search.β22Mar 17, 2026Updated last month
- Jupyter Notebook on Conditional GANβ13May 9, 2019Updated 7 years ago
- In this small project we will predict the email that in which folder it will go in spam or primary.β11Jul 5, 2016Updated 9 years ago
- Source code for 'Pro Spark Streaming' by Zubair Nabiβ11Mar 27, 2017Updated 9 years ago
- Internet never forgots and now thought police never failsβ14Jul 27, 2025Updated 9 months ago
- learningggggggg π³β619Apr 2, 2025Updated last year
- β330Jan 23, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- β14May 13, 2020Updated 5 years ago
- β18May 4, 2025Updated last year
- Simple AI CLI that generates docs, unit tests and README.md filesβ14Mar 8, 2026Updated 2 months ago
- Simple ffmpeg-using multimedia decoderβ13Oct 4, 2019Updated 6 years ago
- Prithvi is an in-memory key-value database built from scratch in Java, without relying on external frameworks. It provides basic data stoβ¦β89Aug 2, 2025Updated 9 months ago
- β11Oct 8, 2015Updated 10 years ago
- A locally trained model of Stoney Nakoda has been developed and released. You can access the working model here or train your own instancβ¦β10Oct 29, 2025Updated 6 months ago