Understanding Large Language Transformer Architecture like a child
☆29Apr 3, 2024Updated 2 years ago
Alternatives and similar repositories for Understanding-Transformers-Step-by-Step-math-example
Users that are interested in Understanding-Transformers-Step-by-Step-math-example are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆203Aug 23, 2024Updated last year
- ☆38Jul 21, 2024Updated last year
- An end-to-end pipeline to optimize and host LLM for 100K parallel queries☆36Jul 6, 2025Updated 9 months ago
- alternative way to calculating self attention☆18May 25, 2024Updated last year
- A straightforward method for training your LLM, from downloading data to generating text.☆549Aug 3, 2025Updated 8 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆14Apr 22, 2024Updated last year
- mini project based on embedded systems iot devices☆12May 1, 2024Updated last year
- A Study in Artificial Intelligence - Simple scripts that explore capabilities provided by neural networks (NN), generative pre-trained tr…☆12Feb 17, 2025Updated last year
- ☆19Aug 7, 2024Updated last year
- This project provides an AI-driven test case generator using FastAPI. The application accepts a GitHub repository name and generates test…☆20Jun 7, 2024Updated last year
- A local LLM chatbot using Code Llama☆23Feb 12, 2024Updated 2 years ago
- Content Moderation API for Online Chat Application☆12Dec 29, 2021Updated 4 years ago
- We have listed some of the free and powerful GenAI APIs and explore their benefit and usage.☆15Feb 3, 2024Updated 2 years ago
- ☆14Nov 16, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Creating the DeepSeek V3 model from scratch☆27Mar 28, 2025Updated last year
- In this blog, we will build a small scale text-to-video model from scratch. We will input a text prompt, and our trained model will gener…☆225Jun 23, 2024Updated last year
- Gemini, as capable as GPT-4, provides a free API with limited access. I tested it with the help of prompt engineering and found that it c…☆36Jan 19, 2024Updated 2 years ago
- ☆11Sep 25, 2022Updated 3 years ago
- ☆17Dec 15, 2025Updated 4 months ago
- ☆15Apr 21, 2024Updated last year
- Make your first Pull Request on Hacktoberfest 2021. Don't forget to spread love and if you like give us a ⭐️⭐️⭐️☆18Oct 3, 2023Updated 2 years ago
- Champion at Brainhack TIL 2023: Team 10000SGDMRT☆18May 29, 2024Updated last year
- RC car which you can control from PC.☆14Nov 15, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This cover everything you need to know if you want to learn Machine Learning from basics to advance. It covers how to do exploratory data…☆12Sep 21, 2020Updated 5 years ago
- Ultimate NLP Toolkit for GPUs: RAPIDS-AI, PyTorch, NeMo, Tensorboard, TensorRT, CUDA 10.1☆10Mar 19, 2020Updated 6 years ago
- Intelligent Help for Efficient Programming☆18Jan 11, 2024Updated 2 years ago
- ☆14May 16, 2023Updated 2 years ago
- Notes for AWS Solutions Architect Associate☆10Aug 27, 2022Updated 3 years ago
- Ready-to-run hub to engage and extend your Google Summer of Code Community☆21Jan 15, 2026Updated 3 months ago
- ☆12Sep 1, 2020Updated 5 years ago
- Full stack web application using ReactJS, NodeJS and MySQL.☆21Mar 10, 2025Updated last year
- Examples to use Azure with LLMs for Chat☆17Jan 8, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A dedicated Colab notebooks to experiment (Nanonets OCR, Monkey OCR, OCRFlux 3B, Typhoo OCR 3B & more..) On T4 GPU - free tier☆23Feb 12, 2026Updated 2 months ago
- Demonstration showing how to deploy Streamlit using Azure App Services☆17Oct 23, 2023Updated 2 years ago
- Easily make and share gifs of your favorite YouTube moments. Built to self host with Python, AI, and Docker. Free and open source.☆17Dec 3, 2024Updated last year
- A chatbot with IBM Voice Gateway which enables direct voice interactions over a telephone with an artificial intelligence (AI) selfservic…☆20Sep 16, 2019Updated 6 years ago
- Applied Computational Thinking with Python Second Edition, Published by Packt☆14Mar 2, 2026Updated last month
- Various examples for using Not Diamond to route model prompts.☆19Jun 17, 2025Updated 10 months ago
- Source Code for the ICML 2020 Paper on Uncertainty & Robustness in Deep Learning☆17Aug 28, 2023Updated 2 years ago