Understanding Large Language Transformer Architecture like a child
☆34Apr 3, 2024Updated 2 years ago
Alternatives and similar repositories for Understanding-Transformers-Step-by-Step-math-example
Users that are interested in Understanding-Transformers-Step-by-Step-math-example are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆209Aug 23, 2024Updated last year
- ☆12Jan 24, 2025Updated last year
- ☆40Jul 21, 2024Updated last year
- A straightforward explanation of how DeepSeek R1 works☆18Feb 7, 2025Updated last year
- An end-to-end pipeline to optimize and host LLM for 100K parallel queries☆36Jul 6, 2025Updated 11 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆17May 23, 2025Updated last year
- alternative way to calculating self attention☆18May 25, 2024Updated 2 years ago
- Research code and scripts used in the paper Semantic Role Labeling as Syntactic Dependency Parsing.☆15Jun 12, 2023Updated 3 years ago
- Distributed Data Systems with Azure Databricks, published by Packt☆12Jan 18, 2023Updated 3 years ago
- swe-jyotisa-lib (beta version)☆19May 4, 2026Updated last month
- A Study in Artificial Intelligence - Simple scripts that explore capabilities provided by neural networks (NN), generative pre-trained tr…☆12Feb 17, 2025Updated last year
- ☆19Aug 7, 2024Updated last year
- A local LLM chatbot using Code Llama☆23Feb 12, 2024Updated 2 years ago
- End-to-end Azure DE project with Australia Health Expenditure dataset. Services used include Azure Data Factory, DataBricks, Data Lake, K…☆13Feb 25, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆32Sep 7, 2025Updated 9 months ago
- Emacs integration between fuz and ivy.☆12Dec 22, 2019Updated 6 years ago
- Creating the DeepSeek V3 model from scratch☆28Mar 28, 2025Updated last year
- In this blog, we will build a small scale text-to-video model from scratch. We will input a text prompt, and our trained model will gener…☆228Jun 23, 2024Updated last year
- ☆17Dec 15, 2025Updated 6 months ago
- ☆15Apr 21, 2024Updated 2 years ago
- (mirror) Emacs mode to indent, navigate around and act on indentation units: perfect for yaml, python and the like.☆15Jun 6, 2019Updated 7 years ago
- ☆11Sep 7, 2020Updated 5 years ago
- Code for the Data Without Labels☆31May 14, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Easy way to enable/disable proxies in Emacs and Elisp.☆15Mar 30, 2022Updated 4 years ago
- A straightforward method for training your LLM, from downloading data to generating text.☆6,525Updated this week
- Tools to easily integrate Anthropic Model Context Protocol(MCP) with Langchain☆17Feb 17, 2025Updated last year
- count lines of code over emacs buffers☆17Jul 28, 2017Updated 8 years ago
- Examples to use Azure with LLMs for Chat☆18Jan 8, 2024Updated 2 years ago
- A dedicated Colab notebooks to experiment (Nanonets OCR, Monkey OCR, OCRFlux 3B, Typhoo OCR 3B & more..) On T4 GPU - free tier☆25Feb 12, 2026Updated 4 months ago
- Demonstration showing how to deploy Streamlit using Azure App Services☆17Oct 23, 2023Updated 2 years ago
- Easily make and share gifs of your favorite YouTube moments. Built to self host with Python, AI, and Docker. Free and open source.☆17Dec 3, 2024Updated last year
- A chatbot with IBM Voice Gateway which enables direct voice interactions over a telephone with an artificial intelligence (AI) selfservic…☆20Sep 16, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Various examples for using Not Diamond to route model prompts.☆19Jun 17, 2025Updated last year
- Examples written in Emacs Lisp☆17Dec 30, 2019Updated 6 years ago
- Port of BaseFlight (with MultiWii 2.3 features) for STM32F4DISCOVERY board + GY-86 (mpu6050 + hmc5883 + ms5611) sensors board☆15Feb 3, 2014Updated 12 years ago
- NLP/LLM Mlops Pipeline to dev/train/evaluation, scalable deploy and monitoring systems.☆22Mar 15, 2024Updated 2 years ago
- Perplexity Lite using Langgraph, Tavily, and GPT-4.☆25May 1, 2024Updated 2 years ago
- ☆24Jun 12, 2024Updated 2 years ago
- Source Code for the ICML 2020 Paper on Uncertainty & Robustness in Deep Learning☆17Aug 28, 2023Updated 2 years ago