Efficient encoder-decoder architecture for small language models (≤1B parameters) with cross-architecture knowledge distillation and vision-language capabilities
☆32Feb 7, 2025Updated last year
Alternatives and similar repositories for encoder-decoder-slm
Users that are interested in encoder-decoder-slm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- decontamination☆29Mar 4, 2026Updated last month
- [NeurIPS 2022]MorphTE: Injecting Morphology in Tensorized Embeddings☆17Oct 29, 2022Updated 3 years ago
- 🕸 GlotCC Dataset and Pipline -- NeurIPS 2024☆20Apr 6, 2025Updated last year
- Jane Street universe☆18Sep 14, 2020Updated 5 years ago
- Code repository for the paper "MrT5: Dynamic Token Merging for Efficient Byte-level Language Models."☆56Sep 25, 2025Updated 6 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- docker:dind with NVIDIA GPU support via NVIDIA container toolkit☆13Apr 1, 2026Updated last week
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆14Mar 30, 2024Updated 2 years ago
- A Streamlit app to add structured tags to a dataset card☆22Jun 30, 2022Updated 3 years ago
- [EMNLP 2022] Adapting a Language Model While Preserving its General Knowledge☆21Feb 12, 2023Updated 3 years ago
- [ICML 2025] Predictive Data Selection: The Data That Predicts Is the Data That Teaches☆63Mar 4, 2025Updated last year
- ☆12Nov 30, 2022Updated 3 years ago
- Deploy KoGPT with Triton Inference Server☆14Nov 18, 2022Updated 3 years ago
- Set-Equivariant Deep Learning Models☆22Dec 23, 2021Updated 4 years ago
- ☆12Mar 24, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- NanoGPT (124M) quality in 2.67B tokens☆28Sep 17, 2025Updated 6 months ago
- This tool helps you easily deploy ASR models on NPUs on AMD's Ryzen AI 300 series laptops☆23Mar 16, 2026Updated 3 weeks ago
- ☆16Apr 2, 2026Updated last week
- Open-source version of the TDspora synthetic data generation algorithm.☆18Updated this week
- Drax: Speech Recognition with Discrete Flow Matching☆75Oct 15, 2025Updated 5 months ago
- Convert Numerical Representations to Korean Pronunciation☆14Apr 20, 2020Updated 5 years ago
- Code for the EACL 2024 paper: "Small Language Models Improve Giants by Rewriting Their Outputs"☆12Apr 20, 2024Updated last year
- bindings to gnuplot (fork of https://bitbucket.org/ogu/gnuplot-ocaml/)☆13May 6, 2024Updated last year
- Examples of using Galileo for better ML data quality!!☆13Feb 5, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Code for our paper "Learning to Generate Unit Tests for Automated Debugging"☆17Mar 7, 2025Updated last year
- An extention to the GaLore paper, to perform Natural Gradient Descent in low rank subspace☆18Oct 21, 2024Updated last year
- ☆12Oct 6, 2024Updated last year
- Robust Self-augmentation for NER with Meta-reweighting☆29Nov 8, 2022Updated 3 years ago
- Scripts to generate and analyze afdb clusters☆11Sep 15, 2023Updated 2 years ago
- ☆15Jan 11, 2019Updated 7 years ago
- TabMini: A Benchmark Suite for Evaluating and Analyzing the Data Efficiency of Tabular Classifiers☆10Mar 31, 2025Updated last year
- Bogazici University - CMPE150 (Introduction to Computing C) lab notes☆11Dec 20, 2019Updated 6 years ago
- ☆31Jun 28, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Universal Neurons in GPT2 Language Models☆30May 28, 2024Updated last year
- Plotting for ocaml based on matplotlib.pyplot☆33Jun 19, 2022Updated 3 years ago
- Fast vectorized bitarrays for OCaml☆16Jul 11, 2023Updated 2 years ago
- A python package for finding words that sound like other words. Useful for entity resolution and poetry, among other things.☆15Oct 26, 2022Updated 3 years ago
- BigSMILES☆10Jun 16, 2024Updated last year
- Get the URL from a web shortcut file☆14Aug 14, 2021Updated 4 years ago
- ☆29Dec 23, 2019Updated 6 years ago