Shaping capabilities with token-level pretraining data filtering
☆93Jan 28, 2026Updated 2 months ago
Alternatives and similar repositories for token-filtering
Users that are interested in token-filtering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A library to create and manage configuration files, especially for machine learning projects.☆78Mar 14, 2022Updated 4 years ago
- Does patch ordering affect context-limited vision transformers?☆17Oct 10, 2025Updated 6 months ago
- A toy text-to-image model trained from scratch.☆19Jun 9, 2025Updated 10 months ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆28Mar 30, 2026Updated last week
- Implementation of numerous Vision Transformers in Google's JAX and Flax.☆22Aug 30, 2022Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- 🌾 Universal, customizable and deployable fine-grained evaluation for text generation.☆24Oct 26, 2023Updated 2 years ago
- Research work aimed at addressing the problem of modeling infinite-length context☆48Dec 18, 2025Updated 3 months ago
- Simple and scalable tools for data-driven pretraining data selection.☆29Jun 9, 2025Updated 10 months ago
- Code for experiments on self-prediction as a way to measure introspection in LLMs☆16Dec 10, 2024Updated last year
- Code for the paper "Large Language Models Share Representations of Latent Grammatical Concepts Across Typologically Diverse Languages" (N…☆17Apr 13, 2025Updated 11 months ago
- Load any clip model with a standardized interface☆22Oct 20, 2025Updated 5 months ago
- Materials for EACL2024 tutorial: Transformer-specific Interpretability☆64Mar 26, 2024Updated 2 years ago
- ☆22Dec 3, 2021Updated 4 years ago
- ☆14Jun 24, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- MoE training for Me and You and maybe other people☆380Mar 15, 2026Updated 3 weeks ago
- Collections of RLxLM experiments using minimal codes☆14Feb 17, 2025Updated last year
- A full-stack online music app, developed using MERN stack (React, Express.js, MongoDB) and Electron. Libraries including Tailwind CSS, Re…☆10Jul 2, 2024Updated last year
- Fork of Flame repo for training of some new stuff in development☆19Mar 17, 2026Updated 3 weeks ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- ☆13Jul 14, 2024Updated last year
- Fluent student-teacher redteaming☆23Jul 25, 2024Updated last year
- ☆40Jan 14, 2025Updated last year
- ☆22Apr 28, 2025Updated 11 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Official PyTorch Implementation for Learning a Generative Meta-Model of LLM Activations☆78Mar 18, 2026Updated 3 weeks ago
- Inference API for many LLMs and other useful tools for empirical research☆113Mar 23, 2026Updated 2 weeks ago
- Database for International Physics Olympiads☆10Sep 22, 2025Updated 6 months ago
- entropix style sampling + GUI☆27Oct 30, 2024Updated last year
- Implementation of <Model Merging with Functional Dual Anchors>☆47Nov 23, 2025Updated 4 months ago
- Cross Atlas Remapping via Optimal Transport☆12Dec 14, 2023Updated 2 years ago
- coded with and corrected by Google Anti-Gravity☆13Nov 23, 2025Updated 4 months ago
- Understand what physics/algorithms do transformers learn internally when trained on planetary motion☆39Feb 9, 2026Updated 2 months ago
- Collection of academic works in natural language processing, computational linguistics, and computational cognitive science that study th…☆22Mar 20, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆16Jul 7, 2025Updated 9 months ago
- [EMNLP 2025] Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards☆61Sep 15, 2025Updated 6 months ago
- tuimorphic choose-your-own-adventure story game☆18Mar 3, 2026Updated last month
- ☆13Aug 20, 2021Updated 4 years ago
- ☆10Jun 12, 2021Updated 4 years ago
- a simple variational auto encoder with some exploration☆12Nov 22, 2024Updated last year
- CIFAR-10 speedrun: Trains to 94% accuracy in 1.98 seconds on a single NVIDIA A100 GPU.☆73Oct 17, 2025Updated 5 months ago