Official implementation of "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models"
☆121Oct 6, 2025Updated 5 months ago
Alternatives and similar repositories for TAID
Users that are interested in TAID are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python-based chat demo for TinySwallow-1.5B that works completely offline☆58Jan 29, 2025Updated last year
- Browser-based chat UI for TinySwallow-1.5B that runs without API calls.☆131Dec 1, 2025Updated 3 months ago
- CycleQD is a framework for parameter space model merging.☆49Feb 1, 2025Updated last year
- DIRECT: Direct and Indirect REsponses in Conversational Text Corpus☆17Jul 1, 2021Updated 4 years ago
- Performs tasks together with GPT.☆13Apr 4, 2023Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆18Aug 3, 2025Updated 7 months ago
- Preferred Generation Benchmark☆92Mar 6, 2026Updated 2 weeks ago
- Flexible evaluation tool for language models☆58Mar 18, 2026Updated last week
- Awesome List of Sources of Japanese Censored Words☆19Sep 11, 2022Updated 3 years ago
- Code for "Word Tour: One-dimensional Word Embeddings via the Traveling Salesman Problem" (NAACL 2022)☆111May 14, 2025Updated 10 months ago
- Discovering Universal Geometry in Embeddings with ICA (Published in EMNLP 2023)☆20Jun 17, 2025Updated 9 months ago
- [⚠️ WIP] ALMOは拡張Markdownパーサ・静的サイトジェネレータです。WebAssemblyを使ってブラウザ上で完結する実行環境を提供し、サーバを必要としないサンプルコードの実行環境やジャッジシステムを提供するページの構築を可能にします。☆16Feb 28, 2026Updated 3 weeks ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆352Oct 22, 2024Updated last year
- This repository has implementations of data augmentation for NLP for Japanese.☆64Feb 16, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆16Aug 14, 2023Updated 2 years ago
- ☆11Oct 2, 2024Updated last year
- JAX implementation of Large Language Models. You can train GPT-2-like model with 青空文庫 (aozora bunko-clean dataset) or any other text dat…☆13Aug 5, 2024Updated last year
- A Chrome extension that helps you translate Kaggle notebook with translate engine like Google Translate.☆35Mar 26, 2025Updated last year
- ☯️ AllenNLP training configurations for promising models on Named Entity Recognition. (BiLSTM-CRF, BiLSTM-CNN-CRF, BERT, BERT-CRF)☆15Nov 26, 2020Updated 5 years ago
- Scrapbox viewer for readers☆24Oct 24, 2023Updated 2 years ago
- Funer is Rule based Named Entity Recognition tool.☆22Apr 21, 2022Updated 3 years ago
- ☆150Updated this week
- LaTeX document class for the proceedings of ANLP☆21Oct 28, 2025Updated 4 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.☆124Nov 13, 2025Updated 4 months ago
- Use custom tokenizers in spacy-transformers☆16Aug 9, 2022Updated 3 years ago
- Code and documentation to train Stanford's Alpaca models, and generate the data.☆24Mar 19, 2023Updated 3 years ago
- ☆29Mar 12, 2026Updated last week
- Checkpointable dataset utilities for foundation model training☆32Jan 29, 2024Updated 2 years ago
- Extracting Entities with Limited Evidence☆16Dec 26, 2022Updated 3 years ago
- Zenn contents☆11Feb 28, 2026Updated 3 weeks ago
- ☆46Updated this week
- ☆19Mar 12, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆172Aug 25, 2025Updated 7 months ago
- Japanese BERT Pretrained Model☆23Nov 13, 2021Updated 4 years ago
- Word Rotator's Distance☆19Sep 5, 2021Updated 4 years ago
- Pytorch implementation and pre-trained Japanese model for CANINE, the efficient character-level transformer.☆89Nov 3, 2023Updated 2 years ago
- ☆19May 23, 2024Updated last year
- ☆16Mar 23, 2025Updated last year
- Official repository of Evolutionary Optimization of Model Merging Recipes☆1,405Nov 29, 2024Updated last year