Repository for "Self-Distillation for Model Stacking Unlocks Cross-Lingual NLU in 200+ Languages"
β15Oct 4, 2024Updated last year
Alternatives and similar repositories for trident-nllb-llm2vec
Users that are interested in trident-nllb-llm2vec are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- πΈ GlotCC Dataset and Pipline -- NeurIPS 2024β20Apr 6, 2025Updated 11 months ago
- Ukrainian ELECTRA modelβ12Mar 11, 2023Updated 3 years ago
- β12Mar 7, 2022Updated 4 years ago
- Skybox previewer and generator using BlockadeLabsβ15May 13, 2023Updated 2 years ago
- Script to convert all MP4 videos in a zip archive to JPG frames at a desired FPS with unique names. It will then retrain the top layers oβ¦β12Jul 6, 2016Updated 9 years ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A library for data streaming and augmentationβ21May 5, 2025Updated 10 months ago
- A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretrainingβ18Nov 26, 2023Updated 2 years ago
- A reordering tool for machine translation.β15May 3, 2019Updated 6 years ago
- Yandex Mystem makes morphological analysis of a russian textβ28Feb 15, 2018Updated 8 years ago
- Simplifying Content-Based Neural News Recommendation: On User Modeling and Training Objectivesβ15Mar 21, 2025Updated last year
- ChatGptHub: Gpt Chatbot Library with LangChain Supportβ15Apr 18, 2023Updated 2 years ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.β15Jun 27, 2020Updated 5 years ago
- Official PyTorch implementation of "Attention-Free Keyword Spotting", Mashrur. M. Morshed & Ahmad Omar Ahsan, PML4DC @ ICLR 2022.β15Nov 5, 2022Updated 3 years ago
- Simple s3 parallel downloaderβ15Jun 5, 2025Updated 9 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Reddit Media Downloader is a Python application designed to simplify the process of downloading images and GIFs from Reddit. It allows usβ¦β16May 15, 2025Updated 10 months ago
- A simple way to parse a string using type annotationsβ13Jul 28, 2022Updated 3 years ago
- Rust binding to crfsuiteβ25Jan 31, 2026Updated last month
- Go through the list of accepted papers for ICLR in terminal and add them to your reading list.β13Jan 30, 2021Updated 5 years ago
- β22Oct 26, 2020Updated 5 years ago
- Residual Quantization Autoencoder, used for interpreting LLMsβ14Jan 1, 2025Updated last year
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"β23Mar 6, 2023Updated 3 years ago
- Utilities to gather software metrics from tools (SONAR, etc) and store them into ElasticSearch for later display using Kibana.β11Dec 31, 2017Updated 8 years ago
- KnowMAN: Weakly Supervised Multinomial Adversarial Networksβ12Nov 9, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Code repository accompanying the CHI 2021 Paper titled "Adapting User Interfaces with Model-based Reinforcement Learning"β16Oct 18, 2021Updated 4 years ago
- MCP Server to make searching openrouter easyβ19Feb 28, 2026Updated 3 weeks ago
- Curated list of awesome datasets for various table understanding tasksβ18Sep 5, 2025Updated 6 months ago
- SQL and Bash scripts to import the offical Stack Overflow data dump and the SOTorrent data set, to retrieve Stack Overflow references froβ¦β15Sep 14, 2025Updated 6 months ago
- δ» HAR ζδ»Ά δΈθ½½ζ΄δΈͺη½η«θ΅ζΊβ15Jan 16, 2017Updated 9 years ago
- a fast implementation of BM25β10Sep 15, 2022Updated 3 years ago
- A repository for resources relating to NLP in the Balochi languageβ19Jun 3, 2023Updated 2 years ago
- You know, an awesome list of search engines.β30Jul 18, 2025Updated 8 months ago
- Towards Few-Shot Fact-Checking via Perplexityβ14Jun 11, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Collection of color palettes for Pythonβ15Apr 25, 2022Updated 3 years ago
- React wrapper for daisyUIβ10Mar 12, 2022Updated 4 years ago
- Text language detection basic on trigrams.β16Oct 2, 2023Updated 2 years ago
- The original weights of some Caffe models, ported to PyTorch.β11Jan 18, 2022Updated 4 years ago
- GlotEval: a unified evaluation toolkit designed to benchmark multilingual Large Language Models (LLMs) in a language-specific wayβ18Nov 4, 2025Updated 4 months ago
- The asm.js benchmarkβ48Mar 30, 2018Updated 7 years ago
- β25Mar 3, 2026Updated 3 weeks ago