rai-project / go-fasttextLinks
☆14Updated 7 years ago
Alternatives and similar repositories for go-fasttext
Users that are interested in go-fasttext are comparing it to the libraries listed below
Sorting:
- Golang binding for facebook fastText☆13Updated 8 years ago
- Go Bindings for BERT NLP Models☆104Updated 5 years ago
- LASER multilingual sentence embeddings as a pip package☆224Updated last year
- go-corenlp is a Golang wrapper for Stanford CoreNLP.☆30Updated 5 years ago
- ☆11Updated 4 years ago
- Corpus preprocessing☆97Updated last year
- ☆42Updated 7 years ago
- A multilingual command line sentence tokenizer in Golang☆452Updated last year
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆86Updated 4 years ago
- Word Embeddings in Go!☆497Updated 2 years ago
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.☆76Updated last year
- A python true casing utility that restores case information for texts☆89Updated 2 years ago
- A word2vec negative sampling implementation with correct CBOW update.☆261Updated 3 years ago
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.☆127Updated 4 years ago
- Tool to fix bitexts and tag near-duplicates for removal☆30Updated 5 months ago
- Text tokenization and sentence segmentation (segtok v2)☆205Updated 3 years ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆170Updated 3 years ago
- Misspelling Oblivious Word Embeddings☆201Updated 5 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆158Updated last year
- Bicleaner fork that uses neural networks☆40Updated last month
- Live survey of off-the-shelf language identification tools for python☆26Updated 3 years ago
- The NLPStatTest project☆12Updated 3 years ago
- An easy-to-use library to linguistically compare one sentence and its words to another, in the same language or a different one. For inst…☆22Updated 3 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆103Updated 3 years ago
- ☆35Updated 3 years ago
- PALI: Language identification for Perso-Arabic Scripts☆9Updated 2 years ago
- General-Purpose Neural Networks for Sentence Boundary Detection☆73Updated 2 years ago
- Demonstration of the results in "Text Normalization using Memory Augmented Neural Networks", Authors: Subhojeet Pramanik, Aman Hussain☆60Updated 6 years ago
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆87Updated last month
- A library for efficient similarity search and clustering of dense vectors. It's a Go wrapper of faiss (https://github.com/facebookresearc…☆24Updated 2 years ago