fdschmidt93/trident-nllb-llm2vec

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/fdschmidt93/trident-nllb-llm2vec)

fdschmidt93 / trident-nllb-llm2vec

Repository for "Self-Distillation for Model Stacking Unlocks Cross-Lingual NLU in 200+ Languages"

☆15

Alternatives and similar repositories for trident-nllb-llm2vec

Users that are interested in trident-nllb-llm2vec are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cisnlp / GlotCC
View on GitHub
[NeurIPS 2024] 🕸 GlotCC Dataset and Pipline
☆21Apr 6, 2025Updated last year
nrl-ai / chessai
View on GitHub
Chinese Chess Advanced Analytics
☆14Dec 1, 2023Updated 2 years ago
stefan-it / ukrainian-electra
View on GitHub
Ukrainian ELECTRA model
☆12Mar 11, 2023Updated 3 years ago
deep-spin / sparse-communication
View on GitHub
☆12Mar 7, 2022Updated 4 years ago
Shelly111111 / SinaFinanceKnowledge
View on GitHub
使用PaddleNLP搭建seq2seq，实现text2sparql生成，对新浪财经中的部分数据进行解析。
☆11Jul 16, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
marian-nmt / sotastream
View on GitHub
A library for data streaming and augmentation
☆22May 5, 2025Updated last year
trestad / mitigating-reversal-curse
View on GitHub
Code for paper 'Are We Falling in a Middle-Intelligence Trap? An Analysis and Mitigation of the Reversal Curse'
☆14Aug 2, 2024Updated last year
FantSun / Speechflow
View on GitHub
Speechflow for emotion recognition related information decomposition
☆10Jul 27, 2021Updated 5 years ago
AnesBenmerzoug / langsfer
View on GitHub
A library for language transfer methods and algorithms.
☆16Feb 6, 2026Updated 5 months ago
Skeletonboi / ocr-nlp-flyer
View on GitHub
Ranked Top 6 Hackathon Submission. Extracts product and promotional data from flyer images using OpenCV image segmentation and PyTesserac…
☆18Jan 29, 2020Updated 6 years ago
hucsmn / suffix_array
View on GitHub
suffix array construction and searching algorithms for in-memory binary data.
☆13Sep 10, 2022Updated 3 years ago
karthikncode / MorphoChain
View on GitHub
A model for unsupervised morphological analysis that integrates orthographic and semantic views of words.
☆13Oct 10, 2023Updated 2 years ago
jasonmayes / Retraining-TensorFlow-Classifier-Using-Video
View on GitHub
Script to convert all MP4 videos in a zip archive to JPG frames at a desired FPS with unique names. It will then retrain the top layers o…
☆12Jul 6, 2016Updated 10 years ago
kadarakos / hieratt
View on GitHub
Experimenting with Hierarchical Attention Networks from https://arxiv.org/abs/1606.02393 in Keras
☆13Oct 12, 2016Updated 9 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
UBC-NLP / afrolid
View on GitHub
AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.
☆39Feb 5, 2026Updated 5 months ago
yannikbenz / zeroe
View on GitHub
From Hero to Zéroe: A Benchmark of Low-Level Adversarial Attacks
☆15Feb 23, 2023Updated 3 years ago
cisnlp / ofa
View on GitHub
[NAACL 2024] A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining
☆18Nov 26, 2023Updated 2 years ago
azpoliak / eco
View on GitHub
Code and data related to "Efficient, Compositional, Order-Sensitive n-gram Embeddings" (EACL 2017)
☆15Apr 6, 2017Updated 9 years ago
fitnr / unwiki
View on GitHub
Python module to remove wiki markup text.
☆10Jan 15, 2016Updated 10 years ago
andreeaiana / simplifying_nnr
View on GitHub
Simplifying Content-Based Neural News Recommendation: On User Modeling and Training Objectives
☆16Mar 21, 2025Updated last year
neubig / lader
View on GitHub
A reordering tool for machine translation.
☆15May 3, 2019Updated 7 years ago
amazon-science / tree-of-traversals
View on GitHub
☆17Jul 19, 2024Updated 2 years ago
akikoe / nmtrnng
View on GitHub
C++ code of "Learning to Parse and Translate Improves Neural Machine Translation"
☆21May 8, 2017Updated 9 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
MicrosoftTranslator / ToShipOrNotToShip
View on GitHub
☆19Dec 16, 2024Updated last year
IllustratedMan-code / telescope-conda.nvim
View on GitHub
☆15Jan 9, 2023Updated 3 years ago
amar-chheda / InterviewWarmupLocal
View on GitHub
AI-powered local interview prep tool. Practice answering custom questions with speech recognition and get AI feedback based on your resum…
☆18Sep 18, 2024Updated last year
sanderland / script_tok
View on GitHub
Code for the paper "BPE stays on SCRIPT", "Which Pieces Does Unigram Tokenization Really Need?" and MinGram
☆18Updated this week
genlm / genlm-backend
View on GitHub
High-performance backend for language model probabilistic programs
☆17Updated this week
archiki / ASR-Accent-Analysis
View on GitHub
Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.
☆15Jun 27, 2020Updated 6 years ago
cimeister / tokenizer-intrinsic-evals
View on GitHub
TokEval: intrinsic quality metrics for tokenizers across natural language, code, and math
☆46Jul 4, 2026Updated 3 weeks ago
AI-Research-BD / Keyword-MLP
View on GitHub
Official PyTorch implementation of "Attention-Free Keyword Spotting", Mashrur. M. Morshed & Ahmad Omar Ahsan, PML4DC @ ICLR 2022.
☆15Nov 5, 2022Updated 3 years ago
huangyz0918 / kws-continual-learning
View on GitHub
[ICASSP'22] Continual Learning Benchmark for Spoken Keyword Spotting
☆17Jun 7, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
gregor-ge / Babel-ImageNet
View on GitHub
☆23May 22, 2024Updated 2 years ago
krangelie / bias-in-german-nlg
View on GitHub
Master thesis: Exploring bias in German NLG (GPT-3 & GerPT-2). Applies regard classification and bias mitigation triggers.
☆16Sep 25, 2024Updated last year
devaansh100 / CLIPTrans
View on GitHub
Official implementation for the paper "Transferring Visual Knowledge with Pre-trained Models for Multimodal Machine Translation", publish…
☆20Jun 3, 2024Updated 2 years ago
tjdevries / youtube_lua
View on GitHub
Youtube Series: <todo>
☆25Apr 18, 2021Updated 5 years ago
techiew / Misc-Projects
View on GitHub
Various hobby projects.
☆14Feb 14, 2021Updated 5 years ago
RUCKBReasoning / DSM
View on GitHub
☆17Jan 5, 2023Updated 3 years ago
chenaoxd / dtopwords
View on GitHub
☆13Dec 23, 2021Updated 4 years ago