Understanding Large Language Transformer Architecture like a child
☆33Apr 3, 2024Updated 2 years ago
Alternatives and similar repositories for Understanding-Transformers-Step-by-Step-math-example
Users that are interested in Understanding-Transformers-Step-by-Step-math-example are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11May 29, 2024Updated 2 years ago
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆209May 12, 2024Updated 2 years ago
- Dataset for Conversation Semantic Role Labeling☆11Aug 26, 2021Updated 4 years ago
- ☆39Jul 21, 2024Updated last year
- ☆35Apr 21, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A straightforward explanation of how DeepSeek R1 works☆18Feb 7, 2025Updated last year
- Deep semantic role labeling using Tensorflow☆17Sep 30, 2018Updated 7 years ago
- alternative way to calculating self attention☆18May 25, 2024Updated 2 years ago
- ☆14Apr 22, 2024Updated 2 years ago
- This project provides an AI-driven test case generator using FastAPI. The application accepts a GitHub repository name and generates test…☆20Jun 7, 2024Updated last year
- A straightforward method for training your LLM, from downloading data to generating text.☆1,595May 22, 2026Updated last week
- The Python Implementation of CRISP: Clustering Multi-Vector Representations for Denoising and Pruning☆27Jul 27, 2025Updated 10 months ago
- All the content of my youtube channel : https://youtube.com/@florenzerstling?si=7t10PBr6MDha74PO☆14May 28, 2025Updated last year
- Creating the DeepSeek V3 model from scratch☆28Mar 28, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Emacs integration between fuz and ivy.☆12Dec 22, 2019Updated 6 years ago
- ☆14Nov 16, 2024Updated last year
- Gemini, as capable as GPT-4, provides a free API with limited access. I tested it with the help of prompt engineering and found that it c…☆36Jan 19, 2024Updated 2 years ago
- Scripts for preprocessing the CoNLL-2005 SRL dataset.☆24Mar 28, 2019Updated 7 years ago
- ☆14Mar 30, 2026Updated last month
- PoC for visualizing Graphs with React, D3 and FastAPI☆20Aug 27, 2024Updated last year
- Share text/file between your computer and phone☆16Nov 4, 2021Updated 4 years ago
- Champion at Brainhack TIL 2023: Team 10000SGDMRT☆18May 29, 2024Updated 2 years ago
- The github repository for the paper at COLING 2025: Retrieval Augmented Instruction Tuning for Open NER with Large Language Models.☆27Jun 26, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Ultimate NLP Toolkit for GPUs: RAPIDS-AI, PyTorch, NeMo, Tensorboard, TensorRT, CUDA 10.1☆10Mar 19, 2020Updated 6 years ago
- Using the OpenAI Gym library, I implemented two reinforcement learning algorithms in the Frozen Lake environment.☆11Feb 10, 2024Updated 2 years ago
- ☆11Sep 7, 2020Updated 5 years ago
- Notes for AWS Solutions Architect Associate☆10Aug 27, 2022Updated 3 years ago
- Ready-to-run hub to engage and extend your Google Summer of Code Community☆21Jan 15, 2026Updated 4 months ago
- ☆12Mar 19, 2026Updated 2 months ago
- Chatbot slot filling based on LLM + langchian|多轮对话槽值填充。☆35Sep 1, 2023Updated 2 years ago
- ☆12Sep 1, 2020Updated 5 years ago
- This project shows how to train a language-recognizer from scratch that is able to distinguish between German and English, robustly.☆12Dec 17, 2020Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 基于BERT-BLSTM-CRF 序列标注模型,支持中文分词、词性标注、命名实体识别、语义角色标注。☆24Aug 17, 2020Updated 5 years ago
- Examples to use Azure with LLMs for Chat☆18Jan 8, 2024Updated 2 years ago
- Comet for Data Science, published by Packt☆42Mar 2, 2026Updated 2 months ago
- A dedicated Colab notebooks to experiment (Nanonets OCR, Monkey OCR, OCRFlux 3B, Typhoo OCR 3B & more..) On T4 GPU - free tier☆24Feb 12, 2026Updated 3 months ago
- Demonstration showing how to deploy Streamlit using Azure App Services☆17Oct 23, 2023Updated 2 years ago
- Automated agent using LangChain and Gmail API to classify and respond to incoming emails based on their content.☆14Oct 12, 2024Updated last year
- A chatbot with IBM Voice Gateway which enables direct voice interactions over a telephone with an artificial intelligence (AI) selfservic…☆20Sep 16, 2019Updated 6 years ago