ToyLLM: Learning LLM from Scratch
☆25Mar 23, 2026Updated this week
Alternatives and similar repositories for toyllm
Users that are interested in toyllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 🔥Keywords and URLs Censored on the Chinese Internet☆13Feb 22, 2020Updated 6 years ago
- coded with and corrected by Google Anti-Gravity☆13Nov 23, 2025Updated 4 months ago
- demonstration for our ACL 2018 paper, "On the Practical Computational Power of Finite Precision RNNs for Language Recognition"☆11May 26, 2019Updated 6 years ago
- ☆11Dec 31, 2020Updated 5 years ago
- Convex Formulation of Multiple Instance Learning from Positive and Unlabeled Bags☆10Apr 28, 2018Updated 7 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Source code and data for the EDM 2022 paper☆12May 16, 2022Updated 3 years ago
- Unifew: Unified Fewshot Learning Model☆18Sep 10, 2021Updated 4 years ago
- ☆15Aug 19, 2024Updated last year
- Implementation of the first neural natural logic paper on natural language inference☆10Oct 31, 2022Updated 3 years ago
- A method for estimating causal effects in time-series data. Uses available data to automatically find natural experiments for identifying…☆17Dec 16, 2019Updated 6 years ago
- [AAAI 2025] Assessing the Creativity of LLMs in Proposing Novel Solutions to Mathematical Problems☆13May 5, 2025Updated 10 months ago
- Prolog interpreter with support for weak unification. Fork of https://bitbucket.org/cfbolz/pyrolog/☆15Jun 23, 2020Updated 5 years ago
- Large-scale Exploration of Neural Relation Classification Architectures☆12Nov 15, 2018Updated 7 years ago
- Code for modeling attention network for distant supervised relation extraction (CoNLL 2019).☆15Feb 28, 2020Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Source code for Findings of EMNLP 2021 paper ``Keyphrase Generation with Fine-Grained Evaluation-Guided Reinforcement Learning``☆13Nov 9, 2021Updated 4 years ago
- Deep Learning model to tackle the Fake News Challenge☆13Nov 6, 2018Updated 7 years ago
- Efficient Pre-training of Masked Language Model via Concept-based Curriculum Masking☆13Feb 5, 2023Updated 3 years ago
- A list of research resources that I've appreciated.☆12Dec 10, 2019Updated 6 years ago
- The source code and dataset for our paper "Integrating Relation Constraints with Neural Relation Extractors" which is publicated at AAAI …☆15Mar 25, 2020Updated 6 years ago
- FaiRR: Faithful and Robust Deductive Reasoning over Natural Language (ACL 2022)☆13May 19, 2022Updated 3 years ago
- 🎤 开源语音输入工具 | 比 Typeless 更早的免费方案!支持豆包流式ASR、OpenAI GPT-4o Transcribe、本地Whisper。按下快捷键说话,文字自动输入到光标处。☆37Mar 16, 2026Updated last week
- Multi-Level Adversarial for Cross-lingual Name Tagging☆12Jun 18, 2020Updated 5 years ago
- ☆17Feb 25, 2026Updated last month
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- The pytorch implementation of relational extraction models with PCNN feature extractor and multi-instance learning☆16Mar 8, 2018Updated 8 years ago
- 通过实验对比LLM推理中Prefill和Decoding阶段的吞吐量差异,揭示性能瓶颈,解释PD分离优化技术的原理。包含CUDA和Apple MPS (M系列芯片) 的测试脚本。☆19May 22, 2025Updated 10 months ago
- PyTorch implementation of ACL paper https://arxiv.org/abs/1906.02656☆25Jun 12, 2023Updated 2 years ago
- Cross-lingual TRansfer Evaluation of Multilingual Encoders (XTREME)☆22Apr 11, 2020Updated 5 years ago
- Incremental Few-shot Text Classification with Multi-round New Classes: Formulation, Dataset and System. NAACL 2021. https://arxiv.org/abs…☆20May 28, 2021Updated 4 years ago
- ☆13Dec 19, 2019Updated 6 years ago
- ☆20Nov 11, 2019Updated 6 years ago
- FEVER Workshop Shared-Task☆16Apr 16, 2019Updated 6 years ago
- ☆19Sep 11, 2018Updated 7 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A bert baseline for DocRED☆18Oct 12, 2022Updated 3 years ago
- A repo of useful MLX skills.☆79Jan 25, 2026Updated 2 months ago
- Repository for group 17 on the Statistical Natural Language Processing module at UCL☆23Aug 23, 2021Updated 4 years ago
- Code for "Open Vocabulary Extreme Classification Using Generative Models"☆24Aug 25, 2022Updated 3 years ago
- ☆26Aug 7, 2021Updated 4 years ago
- The RunBugRun dataset of executable bugs☆24Sep 24, 2025Updated 6 months ago
- ☆27Oct 30, 2023Updated 2 years ago