Efficiently computing & storing token n-grams from large corpora
☆27Oct 6, 2024Updated last year
Alternatives and similar repositories for tokengrams
Users that are interested in tokengrams are comparing it to the libraries listed below
Sorting:
- Pile Deduplication Code☆18May 15, 2023Updated 2 years ago
- TransformerLens + HuggingFace☆11Nov 4, 2023Updated 2 years ago
- ☆17Aug 30, 2025Updated 6 months ago
- Mapping out the "memory" of neural nets with data attribution☆49Updated this week
- Engine for collecting, uploading, and downloading model activations☆26Apr 2, 2025Updated 11 months ago
- ☆20Sep 1, 2018Updated 7 years ago
- Landing page for MIB: A Mechanistic Interpretability Benchmark☆24Aug 15, 2025Updated 7 months ago
- A library for mechanistic anomaly detection☆22Jan 9, 2025Updated last year
- ☆44Nov 17, 2024Updated last year
- Minimum Description Length probing for neural network representations☆20Jan 28, 2025Updated last year
- See https://github.com/cuda-mode/triton-index/ instead!☆11May 8, 2024Updated last year
- Code for the paper "Model Agnostic Interpretability for Multiple Instance Learning".☆13Jan 28, 2022Updated 4 years ago
- GeoCLUSTER is a Python-based web application that provides a collection of interactive methods for streamlining the visualization of the …☆16Feb 15, 2026Updated last month
- Fork of kingoflolz/mesh-transformer-jax with memory usage optimizations and support for GPT-Neo, GPT-NeoX, BLOOM, OPT and fairseq dense L…☆22Nov 14, 2022Updated 3 years ago
- An unofficial implementation of the Infini-gram model proposed by Liu et al. (2024)☆33Jun 19, 2024Updated last year
- ☆12Jan 10, 2023Updated 3 years ago
- ☆13Dec 15, 2025Updated 3 months ago
- Implementing LRP (Layer-wise Relevance Propagation) for a sequence-to-sequence model with GRU layers.☆12Sep 8, 2023Updated 2 years ago
- ☆23Jan 27, 2025Updated last year
- DL Backtrace is a new explainablity technique for deep learning models that works for any modality and model type.☆24Feb 16, 2026Updated last month
- Hugging Face Jobs☆19Jul 11, 2025Updated 8 months ago
- PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)☆20Jan 19, 2025Updated last year
- Decoder only transformer, built from scratch with PyTorch☆33Oct 22, 2023Updated 2 years ago
- URL downloader supporting checkpointing and continuous checksumming.☆19Nov 29, 2023Updated 2 years ago
- PluRel: Synthetic Data unlocks Scaling Laws for Relational Foundation Models☆44Updated this week
- A tool for model sparse based on torch.fx☆13Jun 3, 2024Updated last year
- Weighted multiple-instance learning algorithm☆18Oct 9, 2018Updated 7 years ago
- An implementation of "Subspace Representations for Soft Set Operations and Sentence Similarities" (NAACL 2024)☆10May 31, 2024Updated last year
- A Typst Resume/CV template, inspired by Alessandro Plasmati's Graduate CV LaTex template☆22Dec 16, 2024Updated last year
- ☆78Dec 7, 2023Updated 2 years ago
- NYU Tandon Machine Learning and Finance Fall 2022☆11Dec 13, 2022Updated 3 years ago
- A Deep Neural Network explanation-by-example library for generating meaningful explanations☆17Nov 11, 2020Updated 5 years ago
- ☆10Dec 17, 2020Updated 5 years ago
- Reasoning-based Evaluation and Ranking of Translations.☆20Jul 18, 2025Updated 8 months ago
- Utilities for PyTorch distributed☆25Feb 27, 2025Updated last year
- A pytorch implemention of the Explainable AI work 'Contrastive layerwise relevance propagation (CLRP)'☆17Jun 24, 2022Updated 3 years ago
- Copying Garbage Collector☆14May 13, 2020Updated 5 years ago
- Segmenting a given document using recursive xy-cut algorithm.☆12Oct 9, 2018Updated 7 years ago
- A framework for evaluating Machine Translation models.☆12May 26, 2025Updated 9 months ago