☆18Feb 4, 2025Updated last year
Alternatives and similar repositories for Transformer-Cookbook
Users that are interested in Transformer-Cookbook are comparing it to the libraries listed below
Sorting:
- Statistics on multilingual datasets☆17Jul 12, 2022Updated 3 years ago
- Official code for "Algorithmic Capabilities of Random Transformers" (NeurIPS 2024)☆16Sep 28, 2024Updated last year
- micro-gpt in ASM on the Super Nintendo☆49Feb 12, 2026Updated last month
- ☆21Mar 10, 2026Updated last week
- A lightweight framework for the efficient parsing and manipulation of PDDL in lifted form.☆19Dec 17, 2025Updated 3 months ago
- Demo server for TREC LiveQA competition☆11Dec 7, 2016Updated 9 years ago
- A YAML editor for the modern Kirby games☆13Aug 23, 2025Updated 6 months ago
- Code for the paper "Greed is All You Need: An Evaluation of Tokenizer Inference Methods"☆13Nov 26, 2024Updated last year
- Code for the ILNewsDiff Twitter account☆10May 23, 2023Updated 2 years ago
- Finds snippets in iambic pentameter in English-language text and tries to combine them to a rhyming sonnet.☆13Jan 5, 2023Updated 3 years ago
- ☆14Oct 30, 2024Updated last year
- Stochastic Parameter Decomposition☆68Updated this week
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆27Nov 25, 2024Updated last year
- ☆13Apr 6, 2022Updated 3 years ago
- A benchmark for mechanistic discovery of circuits in Transformers☆16Dec 15, 2024Updated last year
- ☆18Mar 13, 2026Updated last week
- Code for SaGe subword tokenizer (EACL 2023)☆28Nov 30, 2024Updated last year
- New York Times Word Innovation Types dataset☆21Dec 1, 2020Updated 5 years ago
- ☆44Feb 11, 2026Updated last month
- Source code for the paper "Multilingual Neural Machine Translation with Soft Decoupled Encoding"☆29Jun 2, 2021Updated 4 years ago
- Library for LLM-driven action model acquisition via natural language☆49Feb 18, 2026Updated last month
- An implementation of the exponential random graph model☆27May 14, 2014Updated 11 years ago
- Python library for Adversarial ML Evaluation☆26Jul 14, 2025Updated 8 months ago
- The Recognizing, Exploring, and Articulating Limitations in Machine Learning research tool (REAL ML) is a set of guided activities to hel…☆52May 6, 2022Updated 3 years ago
- This repository contains the code for the Form-Context Model and its Attentive Mimicking variant.☆31May 11, 2020Updated 5 years ago
- Useful decorators every Data Scientist should know☆29Nov 30, 2022Updated 3 years ago
- A scalable Dreamer implementation in JAX☆10May 22, 2022Updated 3 years ago
- Codebase describing experiments in Truncation Sampling as Language Model Desmoothing☆13Dec 6, 2022Updated 3 years ago
- Attribution-based Parameter Decomposition☆34Jun 11, 2025Updated 9 months ago
- Code for the paper-"Mirostat: A Perplexity-Controlled Neural Text Decoding Algorithm" (https://arxiv.org/abs/2007.14966).☆61Feb 7, 2022Updated 4 years ago
- A national initiative for the creation of infrastructure, research and development of advanced capabilities for the advancement of the fi…☆39Nov 2, 2022Updated 3 years ago
- Neural Unification for Logic Reasoning over Language☆22Nov 15, 2021Updated 4 years ago
- Convert LaTeX to MathML Core☆32Mar 13, 2026Updated last week
- Code for EMNLP 2016 paper: Morphological Priors for Probabilistic Word Embeddings☆53Dec 6, 2016Updated 9 years ago
- Yet Another (natural language) Parser☆43May 15, 2019Updated 6 years ago
- An implementation of Compositional Attention: Disentangling Search and Retrieval by MILA☆14Jun 1, 2022Updated 3 years ago
- Python library for multiparameter persistence☆39Mar 9, 2026Updated last week
- Header-only C library for Binary Neural Network Feedforward Inference (targeting small devices)☆49Jan 10, 2022Updated 4 years ago
- ☆100Dec 8, 2021Updated 4 years ago