DeepLearningのAttentionモデルをPytorchの低レベルAPIを使って1から制作しようという試みのリポジトリです。
☆74Dec 17, 2025Updated 3 months ago
Alternatives and similar repositories for Attention-from-scratch
Users that are interested in Attention-from-scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorchで微分を計算する方法を説明することで、ニューラルネットの操作の一歩手前を理解する。☆18Mar 14, 2023Updated 3 years ago
- JAX implementation of Large Language Models. You can train GPT-2-like model with 青空文庫 (aozora bunko-clean dataset) or any other text dat…☆13Aug 5, 2024Updated last year
- ☆22Dec 19, 2023Updated 2 years ago
- ☆20Mar 28, 2023Updated 3 years ago
- ☆11Jan 14, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- AI agent running in local environment.☆13Apr 24, 2024Updated last year
- 自分の声で音声合成☆17Mar 4, 2019Updated 7 years ago
- Browser-based chat UI for TinySwallow-1.5B that runs without API calls.☆133Dec 1, 2025Updated 4 months ago
- 数値が偶数かどうかを判定する革新的なライブラリです。奇数の場合は、言い分を述べることで、偶数の概念を超越した解釈を提供します。☆17Updated this week
- Blog contents☆19Feb 16, 2023Updated 3 years ago
- ☆14Jan 27, 2024Updated 2 years ago
- ☆19Mar 12, 2026Updated 3 weeks ago
- ☆12Aug 27, 2025Updated 7 months ago
- A set of scikit-learn style transformers for Polars☆30Jun 15, 2025Updated 9 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Learning Animeface space using Progressive GAN☆17Oct 15, 2018Updated 7 years ago
- 強さはコミット数じゃない!コードの量だ!!!!☆22Mar 23, 2026Updated 2 weeks ago
- twinte内部で使用されるKdBパーサ☆15Jul 9, 2023Updated 2 years ago
- A curated list of awesome stuff related to @honojs☆29Nov 19, 2023Updated 2 years ago
- WIP☆14Mar 6, 2025Updated last year
- ☆11Sep 7, 2024Updated last year
- narabas: Japanese phoneme forced alignment tool☆14Mar 15, 2023Updated 3 years ago
- An implementation of "Subspace Representations for Soft Set Operations and Sentence Similarities" (NAACL 2024)☆10May 31, 2024Updated last year
- Code for Zero-shot Triplet Extraction by Template Infilling (Kim et al; IJCNLP-AACL 2023)☆21Feb 17, 2024Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- 0️⃣1️⃣🤗 BitNet-Transformers: Huggingface Transformers Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" i…☆98Mar 1, 2024Updated 2 years ago
- 深刻な下ネタを回避するためのcspell用の辞書(淫夢要素はないです)☆37Jan 30, 2025Updated last year
- ☆54Sep 27, 2023Updated 2 years ago
- RPC implementation for Nim based on msgpack4nim☆14Jul 28, 2018Updated 7 years ago
- Easily turn large English text datasets into Japanese text datasets using open LLMs.☆28Jan 20, 2025Updated last year
- This is an unofficial repository of CSVs extracted from the Excel files posted on the Prime Minister of Japan website. Auto-updated.☆24Jul 9, 2021Updated 4 years ago
- TCP/IP Stack in Python☆22Jun 15, 2025Updated 9 months ago
- ☆50Apr 10, 2024Updated last year
- MLコンペ用実験テンプレート☆167Jan 20, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆15Aug 26, 2024Updated last year
- 京都人流気象予報『アメドス』☆20Oct 11, 2024Updated last year
- [⚠️ WIP] ALMOは拡張Markdownパーサ・静的サイトジェネレータです。WebAssemblyを使ってブラウザ上で完結する実行環境を提供し、サーバを必要としないサンプルコードの実行環境やジャッジシステムを提供するページの構築を可能にします。☆16Updated this week
- A simple whiteboard.☆11Nov 11, 2021Updated 4 years ago
- Swallowプロジェクト 大規模言語モデル 評価スクリプト☆24Sep 17, 2025Updated 6 months ago
- Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequ…☆30Sep 20, 2025Updated 6 months ago
- AtCoder Janken!!☆12May 8, 2019Updated 6 years ago