Official implementation of "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models"
☆122Oct 6, 2025Updated 6 months ago
Alternatives and similar repositories for TAID
Users that are interested in TAID are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python-based chat demo for TinySwallow-1.5B that works completely offline☆58Jan 29, 2025Updated last year
- Browser-based chat UI for TinySwallow-1.5B that runs without API calls.☆133Dec 1, 2025Updated 4 months ago
- CycleQD is a framework for parameter space model merging.☆49Feb 1, 2025Updated last year
- DIRECT: Direct and Indirect REsponses in Conversational Text Corpus☆17Jul 1, 2021Updated 4 years ago
- Performs tasks together with GPT.☆13Apr 4, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆18Aug 3, 2025Updated 8 months ago
- DistilBERT model pre-trained on 131 GB of Japanese web text. The teacher model is BERT-base that built in-house at LINE.☆46Mar 22, 2023Updated 3 years ago
- Preferred Generation Benchmark☆93Mar 6, 2026Updated last month
- Flexible evaluation tool for language models☆59Apr 6, 2026Updated last week
- Awesome List of Sources of Japanese Censored Words☆19Sep 11, 2022Updated 3 years ago
- Code for "Word Tour: One-dimensional Word Embeddings via the Traveling Salesman Problem" (NAACL 2022)☆112May 14, 2025Updated 11 months ago
- Discovering Universal Geometry in Embeddings with ICA (Published in EMNLP 2023)☆21Jun 17, 2025Updated 9 months ago
- [⚠️ WIP] ALMOは拡張Markdownパーサ・静的サイトジェネレータです。WebAssemblyを使ってブラウザ上で完結する実行環境を提供し、サーバを必要としないサンプルコードの実行環境やジャッジシステムを提供するページの構築を可能にします。☆16Updated this week
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆354Oct 22, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This repository has implementations of data augmentation for NLP for Japanese.☆64Feb 16, 2023Updated 3 years ago
- ☆16Aug 14, 2023Updated 2 years ago
- 日本語CLIPモデル☆13Sep 15, 2025Updated 7 months ago
- ☆11Oct 2, 2024Updated last year
- JAX implementation of Large Language Models. You can train GPT-2-like model with 青空文庫 (aozora bunko-clean dataset) or any other text dat…☆13Aug 5, 2024Updated last year
- A Chrome extension that helps you translate Kaggle notebook with translate engine like Google Translate.☆35Mar 26, 2025Updated last year
- Scrapbox viewer for readers☆24Oct 24, 2023Updated 2 years ago
- ☆150Mar 30, 2026Updated 2 weeks ago
- LaTeX document class for the proceedings of ANLP☆21Oct 28, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.☆126Updated this week
- Use custom tokenizers in spacy-transformers☆16Aug 9, 2022Updated 3 years ago
- Code and documentation to train Stanford's Alpaca models, and generate the data.☆24Mar 19, 2023Updated 3 years ago
- ☆29Updated this week
- Checkpointable dataset utilities for foundation model training☆32Jan 29, 2024Updated 2 years ago
- Extracting Entities with Limited Evidence☆16Dec 26, 2022Updated 3 years ago
- Zenn contents☆11Feb 28, 2026Updated last month
- ☆46Mar 30, 2026Updated 2 weeks ago
- ☆19Mar 12, 2026Updated last month
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Japanese BERT Pretrained Model☆23Nov 13, 2021Updated 4 years ago
- ☆10Feb 18, 2025Updated last year
- Word Rotator's Distance☆19Sep 5, 2021Updated 4 years ago
- Pytorch implementation and pre-trained Japanese model for CANINE, the efficient character-level transformer.☆89Nov 3, 2023Updated 2 years ago
- ☆19May 23, 2024Updated last year
- Official repository of Evolutionary Optimization of Model Merging Recipes☆1,414Nov 29, 2024Updated last year
- A Programming Language implemented in JavaScript☆18Feb 11, 2026Updated 2 months ago