Language-agnostic BERT Sentence Embedding (LaBSE)
☆155Sep 10, 2020Updated 5 years ago
Alternatives and similar repositories for labse
Users that are interested in labse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Language-agnostic BERT Sentence Embedding (LaBSE) Pytorch Model☆21Sep 2, 2020Updated 5 years ago
- ACL 2023 Dual-Alignment Pre-training for Cross-lingual Sentence Embedding☆24Aug 21, 2024Updated last year
- In this project, we implemented a topic detection system on Twitter. This system reads tweets from a data stream and assigns them to one …☆26Nov 7, 2022Updated 3 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Efficient-Sentence-Embedding-using-Discrete-Cosine-Transform☆17Jul 2, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Source code for ACL 2020 paper "Learning Spoken Language Representations with Neural Lattice Language Modeling"☆17Feb 11, 2023Updated 3 years ago
- Multi-Task instruction-tuned LLaMA☆14May 5, 2023Updated 3 years ago
- ☆19Nov 7, 2022Updated 3 years ago
- A neural word aligner based on multilingual BERT☆375Mar 10, 2022Updated 4 years ago
- explores Chinese language models with sub-character level visual information☆16Oct 5, 2018Updated 7 years ago
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Mar 24, 2023Updated 3 years ago
- Code and Data release for "Improving Multilingual Translation by Representation and Gradient Regularization" (Yang et al. EMNLP 2021), an…☆13Aug 12, 2024Updated last year
- Topic clustering library built on Transformer embeddings and cosine similarity metrics.Compatible with all BERT base transformers from hu…☆44Jun 11, 2021Updated 4 years ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Sep 24, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- 目前只有阅读理解赛道的☆13Mar 31, 2021Updated 5 years ago
- Tooling to play around with multilingual machine translation for Indian Languages.☆22Mar 5, 2022Updated 4 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆106May 20, 2022Updated 3 years ago
- A PyTorch implementation of paper "Learning Shared Semantic Space for Speech-to-Text Translation", ACL (Findings) 2021☆47Feb 21, 2022Updated 4 years ago
- Curated list of publicly available parallel corpus for Indian Languages☆37Jul 15, 2021Updated 4 years ago
- KBQA demo☆24Jul 3, 2018Updated 7 years ago
- ☆15Mar 3, 2022Updated 4 years ago
- Improved Sentence Alignment in Linear Time and Space☆194Mar 6, 2023Updated 3 years ago
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Aug 12, 2019Updated 6 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- The score code of FastBERT (ACL2020)☆608Oct 29, 2021Updated 4 years ago
- Code for Findings of ACL 2023 paper "Improving Zero-shot Multilingual Neural Machine Translation by Leveraging Cross-lingual Consistency …☆10Jul 18, 2023Updated 2 years ago
- Knowledge Graph based Question Answering benchmark.☆10Feb 1, 2020Updated 6 years ago
- Codes for "EDG-based Question Decomposition for Complex Question Answering over Knowledge Bases"☆13Nov 12, 2021Updated 4 years ago
- Chinese Text Generation using LSTM☆11Aug 7, 2017Updated 8 years ago
- Implementation of "Effective Adversarial Regularization for Neural Machine Translation", ACL 2019☆21Jan 11, 2020Updated 6 years ago
- Task-oriented dialog system toolkits☆86Mar 24, 2023Updated 3 years ago
- Pure Python implementations of the language models for information retrieval surveyed here: https://dl.acm.org/doi/10.1145/383952.384019.☆13Dec 11, 2019Updated 6 years ago
- A Corpus for Multilingual Document Classification in Eight Languages.☆153Jun 6, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Bicleaner fork that uses neural networks☆40Feb 23, 2026Updated 2 months ago
- Simple Telegram bot to annotate and varify automatic speech recognition datasets☆12Mar 30, 2021Updated 5 years ago
- A Hindi-English Dataset for Text Normalization☆17Jan 3, 2022Updated 4 years ago
- Source code for ASRU 2019 paper "Adapting Pretrained Transformer to Lattices for Spoken Language Understanding"☆10Jul 8, 2020Updated 5 years ago
- Seq2seqAttGeneration, an basic implementation of text generation that using seq2seq attention model to generate poem series. this project…☆18Jan 11, 2021Updated 5 years ago
- TVsub: DCU-Tencent Chinese-English Dialogue Corpus☆46Feb 14, 2018Updated 8 years ago
- Data augmentation for NLP☆4,658Jun 24, 2024Updated last year