rai-project / go-fasttextLinks
☆14Updated 7 years ago
Alternatives and similar repositories for go-fasttext
Users that are interested in go-fasttext are comparing it to the libraries listed below
Sorting:
- Golang binding for facebook fastText☆13Updated 8 years ago
- Source code for the Apple reproduction☆32Updated 4 years ago
- go-corenlp is a Golang wrapper for Stanford CoreNLP.☆30Updated 6 years ago
- Go Bindings for BERT NLP Models☆104Updated 6 years ago
- Python package for lexicon; Trie and DAWG implementation.☆55Updated 8 months ago
- Demonstration of the results in "Text Normalization using Memory Augmented Neural Networks", Authors: Subhojeet Pramanik, Aman Hussain☆60Updated 6 years ago
- LASER multilingual sentence embeddings as a pip package☆224Updated 2 years ago
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆86Updated 4 years ago
- Corpus preprocessing☆98Updated last year
- A multilingual command line sentence tokenizer in Golang☆454Updated last year
- A tiny BERT for low-resource monolingual models☆31Updated 10 months ago
- Tool to fix bitexts and tag near-duplicates for removal☆31Updated 6 months ago
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.☆76Updated last year
- source code of bison☆26Updated 5 years ago
- ☆42Updated 7 years ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆252Updated 2 years ago
- ☆12Updated 9 years ago
- xfspell — the Transformer Spell Checker☆190Updated 5 years ago
- Live survey of off-the-shelf language identification tools for python☆26Updated 3 years ago
- Runnable morphological analysis tools from the UniMorph project☆16Updated 6 years ago
- code and data used to build a training dataset for dragnet models☆10Updated 4 years ago
- General-Purpose Neural Networks for Sentence Boundary Detection☆73Updated 2 years ago
- A python true casing utility that restores case information for texts☆89Updated 2 years ago
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.☆127Updated 4 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆105Updated 3 years ago
- A library for efficient similarity search and clustering of dense vectors. It's a Go wrapper of faiss (https://github.com/facebookresearc…☆24Updated 2 years ago
- GC4LM: A Colossal (Biased) language model for German☆13Updated 4 years ago
- Morfessor EM+Prune☆10Updated 5 years ago
- The NLPStatTest project☆12Updated 3 years ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆170Updated 3 years ago