Repository for "Self-Distillation for Model Stacking Unlocks Cross-Lingual NLU in 200+ Languages"
☆15Oct 4, 2024Updated last year
Alternatives and similar repositories for trident-nllb-llm2vec
Users that are interested in trident-nllb-llm2vec are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Ukrainian ELECTRA model☆12Mar 11, 2023Updated 3 years ago
- A library for language transfer methods and algorithms.☆16Feb 6, 2026Updated 3 months ago
- suffix array construction and searching algorithms for in-memory binary data.☆12Sep 10, 2022Updated 3 years ago
- From Hero to Zéroe: A Benchmark of Low-Level Adversarial Attacks☆15Feb 23, 2023Updated 3 years ago
- Skybox previewer and generator using BlockadeLabs☆15May 13, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Script to convert all MP4 videos in a zip archive to JPG frames at a desired FPS with unique names. It will then retrain the top layers o…☆12Jul 6, 2016Updated 9 years ago
- [NAACL 2024] A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining☆18Nov 26, 2023Updated 2 years ago
- A reordering tool for machine translation.☆15May 3, 2019Updated 7 years ago
- C++ code of "Learning to Parse and Translate Improves Neural Machine Translation"☆21May 8, 2017Updated 8 years ago
- ☆20Dec 16, 2024Updated last year
- ChatGptHub: Gpt Chatbot Library with LangChain Support☆15Apr 18, 2023Updated 3 years ago
- Official PyTorch implementation of "Attention-Free Keyword Spotting", Mashrur. M. Morshed & Ahmad Omar Ahsan, PML4DC @ ICLR 2022.☆15Nov 5, 2022Updated 3 years ago
- [ICASSP'22] Continual Learning Benchmark for Spoken Keyword Spotting☆17Jun 7, 2022Updated 3 years ago
- Simple s3 parallel downloader☆16Jun 5, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆23May 22, 2024Updated last year
- ☆13Dec 23, 2021Updated 4 years ago
- Python implementation of CTC beam search decoder + agnostic LM scorer☆20Dec 16, 2020Updated 5 years ago
- A simple way to parse a string using type annotations☆13Jul 28, 2022Updated 3 years ago
- Scaling Sparse Fine-Tuning to Large Language Models☆19Jan 31, 2024Updated 2 years ago
- Rust binding to crfsuite☆25Jan 31, 2026Updated 3 months ago
- Go through the list of accepted papers for ICLR in terminal and add them to your reading list.☆13Jan 30, 2021Updated 5 years ago
- Code for the paper "Getting the most out of your tokenizer for pre-training and domain adaptation"☆22Feb 14, 2024Updated 2 years ago
- Crawler based on a modified browser to detect online tracking.☆11Jul 19, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Mar 6, 2023Updated 3 years ago
- A collection of utilities for handling IPA phones.☆27Sep 24, 2023Updated 2 years ago
- Generate a cute welcome message for yourself each day☆22Mar 30, 2023Updated 3 years ago
- ☆27Feb 18, 2025Updated last year
- [ACL 2025] 🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment☆11Apr 6, 2025Updated last year
- Building and Using A Seed Corpus for the Human Language Project☆11Feb 9, 2018Updated 8 years ago
- Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning☆30Jan 25, 2023Updated 3 years ago
- SQL and Bash scripts to import the offical Stack Overflow data dump and the SOTorrent data set, to retrieve Stack Overflow references fro…☆15Sep 14, 2025Updated 7 months ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Aug 15, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 从 HAR 文件 下载整个网站资源☆15Jan 16, 2017Updated 9 years ago
- A repository for resources relating to NLP in the Balochi language☆19Jun 3, 2023Updated 2 years ago
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆68Feb 8, 2023Updated 3 years ago
- Per-collection OCR leaderboards using VLM-as-judge☆59Mar 23, 2026Updated last month
- Collection of color palettes for Python☆15Apr 25, 2022Updated 4 years ago
- ☆26Nov 13, 2022Updated 3 years ago
- React wrapper for daisyUI☆10Mar 12, 2022Updated 4 years ago