π§ A study guide to learn about Transformers
β12Jan 11, 2024Updated 2 years ago
Alternatives and similar repositories for How-Transformers-Work
Users that are interested in How-Transformers-Work are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repo implements and trains DallE-1 on a synthetically generated dataset which has colored mnist images on texture/solid background aβ¦β13Oct 30, 2024Updated last year
- Code for the article series on building a Python compiler and interpreterβ11Feb 13, 2025Updated last year
- Build RAG for free with local LLMs using Ollamaβ13Apr 22, 2024Updated 2 years ago
- A generic, composable multi-dimensional array library.β12Updated this week
- Tutorial for how to build BERT from scratchβ102May 22, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- various tools to download, convert and process the full text of scientific articlesβ10Apr 2, 2024Updated 2 years ago
- β15Jan 31, 2022Updated 4 years ago
- Mixture of Experts from scratchβ13Apr 12, 2024Updated 2 years ago
- LLM-driven automated knowledge graph construction from text using DSPy and Neo4jβ19Aug 19, 2024Updated last year
- Sthaan uses AI to create digital addresses with local language support in voice/text, making it easier for people to find and reach locatβ¦β12Nov 17, 2024Updated last year
- π― Speech Recognition Challenge by Speech Lab - IIT Madrasβ10Nov 5, 2020Updated 5 years ago
- Integrating Elixir, Mix and OTPβ57May 12, 2014Updated 11 years ago
- β14Jan 26, 2012Updated 14 years ago
- β22Jan 10, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- β16Jul 7, 2025Updated 9 months ago
- β12May 28, 2025Updated 11 months ago
- An implementation of the Latent Skill Embedding modelβ10Feb 19, 2016Updated 10 years ago
- Rethinking Perturbations in Encoder-Decoders for Fast Trainingβ18Nov 25, 2021Updated 4 years ago
- C++20 N-dimensional Matrix class for hobby projectβ23Nov 11, 2021Updated 4 years ago
- βοΈ Interactive playground for the "Thought Anchors: Which LLM Reasoning Steps Matter?" paper.β18Dec 20, 2025Updated 4 months ago
- β10Sep 10, 2023Updated 2 years ago
- β33Jan 17, 2025Updated last year
- A OCR Project for Reading New and Old Kannada Textsβ10Aug 31, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A Gentle Principled Introduction to Deep Reinforcement Learningβ19Apr 4, 2025Updated last year
- β16Mar 24, 2023Updated 3 years ago
- Scaler Academy - Software Engineering Bootcampβ12Jan 8, 2024Updated 2 years ago
- β13Dec 15, 2022Updated 3 years ago
- Trained a 114 million Parameter LLM from Scratch.β19Jul 21, 2024Updated last year
- Internet bandwidth monitor by domainβ15Nov 13, 2016Updated 9 years ago
- Intelehealth's Doctor Web Applicationβ17Updated this week
- Convert Standard M2 format to parallel sentences.β22Jun 20, 2020Updated 5 years ago
- Using C++'s type system effectively, safely, expressivelyβ24Oct 10, 2016Updated 9 years ago
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A talk+workshop on Accelerating Your Security Learning in 2017 given at null Bangalore 2017β13Jan 23, 2017Updated 9 years ago
- Lyric Generation using AIβ13Apr 29, 2019Updated 7 years ago
- β15Jan 11, 2024Updated 2 years ago
- Re-implementation of Andrej Karpathy's nanoGPTβ18Feb 16, 2023Updated 3 years ago
- This is the repo for the Oreilly workshopβ37Dec 10, 2025Updated 4 months ago
- How to Train Your Advisor: Steering Black-Box LLMs with Advisor Modelsβ74Feb 5, 2026Updated 3 months ago
- An LLM enabled XML generator for Indian laws in the LegalDocML and LegalRuleML formatsβ20Sep 6, 2024Updated last year