Rust library for indexing and quickly searching large pretraining corpora
☆31Oct 30, 2025Updated 4 months ago
Alternatives and similar repositories for rusty-dawg
Users that are interested in rusty-dawg are comparing it to the libraries listed below
Sorting:
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆18Mar 15, 2024Updated last year
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆19Feb 24, 2026Updated last week
- A file-backed dictionary for Python☆12Aug 15, 2022Updated 3 years ago
- 3D geoms for plotnine (grammar of graphics in Python)☆12Aug 5, 2022Updated 3 years ago
- ☆13Dec 31, 2023Updated 2 years ago
- Code and data for "A Systematic Assessment of Syntactic Generalization in Neural Language Models"☆29Jun 18, 2021Updated 4 years ago
- ☆15Nov 3, 2025Updated 4 months ago
- A dataset for training interactive plotting agent☆14Dec 8, 2022Updated 3 years ago
- Fast dataset format and loader☆24Jan 2, 2026Updated 2 months ago
- Utilities for efficient fine-tuning, inference and evaluation of code generation models☆21Oct 3, 2023Updated 2 years ago
- This repository contains two datasets with multi-turn adversarial conversations generated by human agents interacting with a dialog model…☆32Jul 16, 2024Updated last year
- Graphically structured diffusion model.☆21Jun 16, 2023Updated 2 years ago
- Gantry is a CLI that streamlines running experiments in Beaker☆32Updated this week
- [CoLM 24] Official Repository of MambaByte: Token-free Selective State Space Model☆24Oct 12, 2024Updated last year
- Stick-breaking attention☆62Jul 1, 2025Updated 8 months ago
- treemind interprets tree models☆41Jul 23, 2025Updated 7 months ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆35May 24, 2024Updated last year
- Codebase for fine-tuning Llama2 70B to generate math test questions and answers.☆11Aug 30, 2024Updated last year
- An embedded DSL for creating, composing, and using probability measures.☆42Sep 10, 2019Updated 6 years ago
- ☆33Sep 27, 2024Updated last year
- A repository for transformer critique learning and generation☆89Dec 7, 2023Updated 2 years ago
- mReasoner is a unified computational implementation of the model theory of thinking and reasoning☆13Aug 17, 2023Updated 2 years ago
- Concurrency library☆17Oct 13, 2024Updated last year
- ☆11Dec 23, 2024Updated last year
- Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".☆12Oct 14, 2024Updated last year
- A JAX library for building lattice-based speech transducer models☆47Updated this week
- Multi-pass compiler and runtime for probabilistic programming.☆46Oct 24, 2025Updated 4 months ago
- [ICLR 2024] This is the official implementation for the paper: "Beyond imitation: Leveraging fine-grained quality signals for alignment"☆10May 5, 2024Updated last year
- Tool for Evaluating Multilingual WS-353 and SimLex-999☆10Dec 15, 2016Updated 9 years ago
- This is a read-only mirror of the CRAN R package repository. randomForestSRC — Fast Unified Random Forests for Survival, Regression, an…☆10Feb 12, 2026Updated 3 weeks ago
- ☆12Nov 22, 2024Updated last year
- An active inference model of Lacanian psychoanalysis☆15Jun 7, 2025Updated 9 months ago
- Develop C++/CUDA extensions with PyTorch like Python scripts☆10Jan 7, 2026Updated last month
- Implements Global Word Vectors.☆11Feb 8, 2020Updated 6 years ago
- ☆16Nov 2, 2025Updated 4 months ago
- ☆10Aug 15, 2019Updated 6 years ago
- OneStop: A 360-Participant Eye Tracking Dataset with Different Reading Regimes☆16Dec 5, 2025Updated 3 months ago
- A library for Partially Homomorphic Encryption in Python☆12May 30, 2017Updated 8 years ago
- [AAAI2024] An official pytorch implement of the paper: Vision-Language Pre-training with Object Contrastive Learning for 3D Scene Underst…☆13Dec 8, 2024Updated last year