Official implementation of "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models"
☆122Oct 6, 2025Updated 6 months ago
Alternatives and similar repositories for TAID
Users that are interested in TAID are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python-based chat demo for TinySwallow-1.5B that works completely offline☆58Jan 29, 2025Updated last year
- Browser-based chat UI for TinySwallow-1.5B that runs without API calls.☆135Dec 1, 2025Updated 5 months ago
- CycleQD is a framework for parameter space model merging.☆49Feb 1, 2025Updated last year
- ☆18Aug 3, 2025Updated 9 months ago
- Preferred Generation Benchmark☆94Mar 6, 2026Updated 2 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Flexible evaluation tool for language models☆59Updated this week
- Awesome List of Sources of Japanese Censored Words☆19Sep 11, 2022Updated 3 years ago
- Code for "Word Tour: One-dimensional Word Embeddings via the Traveling Salesman Problem" (NAACL 2022)☆112May 14, 2025Updated 11 months ago
- Discovering Universal Geometry in Embeddings with ICA (Published in EMNLP 2023)☆21Jun 17, 2025Updated 10 months ago
- [⚠️ WIP] ALMOは拡張Markdownパーサ・静的サイトジェネレータです。WebAssemblyを使ってブラウザ上で完結する実行環境を提供し、サーバを必要としないサンプルコードの実行環境やジャッジシステムを提供するページの構築を可能にします。☆16Apr 14, 2026Updated 3 weeks ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆357Oct 22, 2024Updated last year
- This repository has implementations of data augmentation for NLP for Japanese.☆64Feb 16, 2023Updated 3 years ago
- ☆16Aug 14, 2023Updated 2 years ago
- 日本語CLIPモデル☆13Sep 15, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆11Oct 2, 2024Updated last year
- ICML 2025 Oral: ABKD: Pursuing a Proper Allocation of the Probability Mass in Knowledge Distillation via α-β-Divergence☆45Aug 8, 2025Updated 8 months ago
- JAX implementation of Large Language Models. You can train GPT-2-like model with 青空文庫 (aozora bunko-clean dataset) or any other text dat…☆13Aug 5, 2024Updated last year
- ☯️ AllenNLP training configurations for promising models on Named Entity Recognition. (BiLSTM-CRF, BiLSTM-CNN-CRF, BERT, BERT-CRF)☆15Nov 26, 2020Updated 5 years ago
- Scrapbox viewer for readers☆24Oct 24, 2023Updated 2 years ago
- Funer is Rule based Named Entity Recognition tool.☆22Apr 21, 2022Updated 4 years ago
- LaTeX document class for the proceedings of ANLP☆21Oct 28, 2025Updated 6 months ago
- ☆152Apr 28, 2026Updated last week
- The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.☆126Apr 10, 2026Updated 3 weeks ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Use custom tokenizers in spacy-transformers☆16Aug 9, 2022Updated 3 years ago
- Code and documentation to train Stanford's Alpaca models, and generate the data.☆24Mar 19, 2023Updated 3 years ago
- ☆29Apr 28, 2026Updated last week
- Checkpointable dataset utilities for foundation model training☆32Jan 29, 2024Updated 2 years ago
- Extracting Entities with Limited Evidence☆16Dec 26, 2022Updated 3 years ago
- Zenn contents☆11Feb 28, 2026Updated 2 months ago
- ☆22Oct 10, 2025Updated 6 months ago
- ☆47Mar 30, 2026Updated last month
- ☆19Mar 12, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆171Aug 25, 2025Updated 8 months ago
- Japanese BERT Pretrained Model☆23Nov 13, 2021Updated 4 years ago
- Word Rotator's Distance☆18Sep 5, 2021Updated 4 years ago
- Pytorch implementation and pre-trained Japanese model for CANINE, the efficient character-level transformer.☆89Nov 3, 2023Updated 2 years ago
- ☆16Mar 23, 2025Updated last year
- ☆19Apr 21, 2026Updated 2 weeks ago
- Official repository of Evolutionary Optimization of Model Merging Recipes☆1,419Nov 29, 2024Updated last year