ToyLLM: Learning LLM from Scratch
☆25Apr 13, 2026Updated this week
Alternatives and similar repositories for toyllm
Users that are interested in toyllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 🔥Keywords and URLs Censored on the Chinese Internet☆13Feb 22, 2020Updated 6 years ago
- coded with and corrected by Google Anti-Gravity☆13Nov 23, 2025Updated 4 months ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆19Feb 9, 2026Updated 2 months ago
- An Empirical Study of Memorization in NLP (ACL 2022)☆13Jun 22, 2022Updated 3 years ago
- demonstration for our ACL 2018 paper, "On the Practical Computational Power of Finite Precision RNNs for Language Recognition"☆11May 26, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- GEMM☆10Aug 26, 2023Updated 2 years ago
- ☆11Dec 31, 2020Updated 5 years ago
- ☆11Sep 21, 2022Updated 3 years ago
- AI 可以在 50 trun 内实现一个简单的高性能向量数据库吗?☆53Mar 30, 2026Updated 2 weeks ago
- Convex Formulation of Multiple Instance Learning from Positive and Unlabeled Bags☆10Apr 28, 2018Updated 7 years ago
- Unifew: Unified Fewshot Learning Model☆18Sep 10, 2021Updated 4 years ago
- A std::execution style runtime context and High Performance RPC Transport for using OpenUCX. Including CUDA/ROCM/... devices with RDMA.☆30Apr 9, 2026Updated last week
- 🎓Automatically Update circult-eda-mlsys-tinyml Papers Daily using Github Actions (Update Every 8th hours)☆10Updated this week
- My tests and experiments with some popular dl frameworks.☆17Sep 11, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- GEMV implementation with CUTLASS☆21Aug 21, 2025Updated 7 months ago
- Implementation of the first neural natural logic paper on natural language inference☆10Oct 31, 2022Updated 3 years ago
- Multi-heap-sort for many small arrays, quicksort with 3 pivots for one big array, CUDA acceleration, CUDA memory compression.☆13Sep 29, 2024Updated last year
- A method for estimating causal effects in time-series data. Uses available data to automatically find natural experiments for identifying…☆17Dec 16, 2019Updated 6 years ago
- Prolog interpreter with support for weak unification. Fork of https://bitbucket.org/cfbolz/pyrolog/☆15Jun 23, 2020Updated 5 years ago
- Large-scale Exploration of Neural Relation Classification Architectures☆12Nov 15, 2018Updated 7 years ago
- Code for modeling attention network for distant supervised relation extraction (CoNLL 2019).☆15Feb 28, 2020Updated 6 years ago
- Source code for Findings of EMNLP 2021 paper ``Keyphrase Generation with Fine-Grained Evaluation-Guided Reinforcement Learning``☆13Nov 9, 2021Updated 4 years ago
- Orange's一个操作系统的实现(随书光盘)☆19Jun 30, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Deep Learning model to tackle the Fake News Challenge☆13Nov 6, 2018Updated 7 years ago
- Efficient Pre-training of Masked Language Model via Concept-based Curriculum Masking☆13Feb 5, 2023Updated 3 years ago
- 。☆13Jan 15, 2022Updated 4 years ago
- ☆18Nov 22, 2025Updated 4 months ago
- FaiRR: Faithful and Robust Deductive Reasoning over Natural Language (ACL 2022)☆13May 19, 2022Updated 3 years ago
- ☆32Jul 2, 2025Updated 9 months ago
- some hpc project for learning☆26Aug 28, 2024Updated last year
- Fast GPU based tensor core reductions☆13Jan 13, 2023Updated 3 years ago
- portFFT is a library implementing Fast Fourier Transforms using SYCL☆19Mar 1, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Multi-Level Adversarial for Cross-lingual Name Tagging☆12Jun 18, 2020Updated 5 years ago
- All Resources from Stanford CS106B 2021☆25Jul 11, 2025Updated 9 months ago
- The pytorch implementation of relational extraction models with PCNN feature extractor and multi-instance learning☆16Mar 8, 2018Updated 8 years ago
- a reactor network library☆16Aug 21, 2025Updated 7 months ago
- 通过实验对比LLM推理中Prefill和Decoding阶段的吞吐量差异,揭示性能瓶颈,解释PD分离优化技术的原理。包含CUDA和Apple MPS (M系列芯片) 的测试脚本。☆21May 22, 2025Updated 10 months ago
- ☆15Mar 23, 2022Updated 4 years ago
- PyTorch implementation of ACL paper https://arxiv.org/abs/1906.02656☆25Jun 12, 2023Updated 2 years ago