Repository for "Self-Distillation for Model Stacking Unlocks Cross-Lingual NLU in 200+ Languages"
β15Oct 4, 2024Updated last year
Alternatives and similar repositories for trident-nllb-llm2vec
Users that are interested in trident-nllb-llm2vec are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2024] πΈ GlotCC Dataset and Piplineβ20Apr 6, 2025Updated last year
- A model for unsupervised morphological analysis that integrates orthographic and semantic views of words.β13Oct 10, 2023Updated 2 years ago
- Skybox previewer and generator using BlockadeLabsβ15May 13, 2023Updated 3 years ago
- C++ code of "Learning to Parse and Translate Improves Neural Machine Translation"β21May 8, 2017Updated 9 years ago
- ChatGptHub: Gpt Chatbot Library with LangChain Supportβ15Apr 18, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Simple s3 parallel downloaderβ16Jun 5, 2025Updated last year
- βοΈ Sentence segmentation with wtpsplit's state-of-the-art Segment any Text (SaT) modelsβ39May 2, 2026Updated last month
- A simple way to parse a string using type annotationsβ13Jul 28, 2022Updated 3 years ago
- A Multilingual Dataset For Cross-lingual News Recommendationβ22Mar 27, 2024Updated 2 years ago
- Scaling Sparse Fine-Tuning to Large Language Modelsβ19Jan 31, 2024Updated 2 years ago
- Go through the list of accepted papers for ICLR in terminal and add them to your reading list.β13Jan 30, 2021Updated 5 years ago
- Code for the paper "Getting the most out of your tokenizer for pre-training and domain adaptation"β22Feb 14, 2024Updated 2 years ago
- Residual Quantization Autoencoder, used for interpreting LLMsβ14Jan 1, 2025Updated last year
- Crawling engine that crawls a set of top-level domains looking for documents in a list of languagesβ11Feb 6, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- KnowMAN: Weakly Supervised Multinomial Adversarial Networksβ12Nov 9, 2021Updated 4 years ago
- Code repository accompanying the CHI 2021 Paper titled "Adapting User Interfaces with Model-based Reinforcement Learning"β17Oct 18, 2021Updated 4 years ago
- [WWW 2026] πΈ GlotWeb: Web Indexing for Minority Languagesβ17Apr 14, 2026Updated 2 months ago
- MCP Server to make searching openrouter easyβ22Feb 28, 2026Updated 3 months ago
- Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learningβ30Jan 25, 2023Updated 3 years ago
- Curated list of awesome datasets for various table understanding tasksβ19Sep 5, 2025Updated 9 months ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning Pβ¦β34Aug 15, 2023Updated 2 years ago
- δ» HAR ζδ»Ά δΈθ½½ζ΄δΈͺη½η«θ΅ζΊβ15Jan 16, 2017Updated 9 years ago
- a fast implementation of BM25β10Sep 15, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- π€ Disaggregators: Curated data labelers for in-depth analysis.β69Feb 8, 2023Updated 3 years ago
- You know, an awesome list of search engines.β31Jul 18, 2025Updated 10 months ago
- Collection of color palettes for Pythonβ15Apr 25, 2022Updated 4 years ago
- Text language detection basic on trigrams.β16Oct 2, 2023Updated 2 years ago
- The original weights of some Caffe models, ported to PyTorch.β11Jan 18, 2022Updated 4 years ago
- GlotEval: a unified evaluation toolkit designed to benchmark multilingual Large Language Models (LLMs) in a language-specific wayβ18Nov 4, 2025Updated 7 months ago
- β25Mar 3, 2026Updated 3 months ago
- The asm.js benchmarkβ48Mar 30, 2018Updated 8 years ago
- This is an efficient implementation of Proximal Policy Optimization in C++ LibTorch adapted from the wonderful Python implementation by: β¦β13May 2, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Hengam: An Adversarially Trained Transformer for Persian Temporal Tagging (AACL'22)β11Aug 25, 2023Updated 2 years ago
- β11Dec 3, 2022Updated 3 years ago
- OxLM: Oxford Neural Language Modelling Toolkitβ39Nov 6, 2015Updated 10 years ago
- Persian Datasets including: Wikipedia, Twitter, Hamshahri, Hellokish, NSURL'19, Peyma, Text_mining.irβ12Oct 6, 2023Updated 2 years ago
- β12Jun 25, 2018Updated 7 years ago
- AI for designers course linksβ16Oct 27, 2020Updated 5 years ago
- Synced from GitLab - Deploy your own Serverless Telegram bot integration to quickly interface with Bing's AI (a.k.a Sydney) using theirβ¦β12May 25, 2023Updated 3 years ago