rai-project / go-fasttext
☆14Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for go-fasttext
- Golang binding for facebook fastText☆13Updated 7 years ago
- ☆26Updated last month
- Morfessor EM+Prune☆10Updated 4 years ago
- Python implementation of Levenshtein distance and Levenshtein automata matching☆27Updated 5 years ago
- Go Bindings for BERT NLP Models☆99Updated 5 years ago
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆86Updated 3 years ago
- A library for efficient similarity search and clustering of dense vectors. It's a Go wrapper of faiss (https://github.com/facebookresearc…☆24Updated last year
- Tools for managing datasets for governance and training.☆78Updated 3 weeks ago
- Code for pre-training CharacterBERT models (as well as BERT models).☆34Updated 3 years ago
- Source code for ASRU 2019 paper "Adapting Pretrained Transformer to Lattices for Spoken Language Understanding"☆11Updated 4 years ago
- Tool to fix bitexts and tag near-duplicates for removal☆29Updated 3 months ago
- Python package for lexicon; Trie and DAWG implementation.☆55Updated 5 months ago
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.☆126Updated 3 years ago
- liblinear bindings for Go☆45Updated 6 years ago
- Corpus preprocessing☆95Updated 8 months ago
- Demonstration of the results in "Text Normalization using Memory Augmented Neural Networks", Authors: Subhojeet Pramanik, Aman Hussain☆60Updated 5 years ago
- A multilingual command line sentence tokenizer in Golang☆439Updated 8 months ago
- Read and use word2vec vectors in Go☆56Updated 6 years ago
- Facebook fastText database in SQLite with Go API☆32Updated 4 years ago
- go-corenlp is a Golang wrapper for Stanford CoreNLP.☆30Updated 5 years ago
- ☆36Updated 3 years ago
- A High-Quality Multilingual Dataset for Structured Documentation Translation☆34Updated 4 months ago
- Spoken Language Translation System☆14Updated 5 years ago
- The NLPStatTest project☆11Updated 2 years ago
- Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)☆67Updated 6 months ago
- Library and command line utility to do approximate string matching of a source against a bitext index and get matched source and target.☆45Updated 6 months ago
- ☆54Updated last year
- GLADIS: A General and Large Acronym Disambiguation Benchmark (EACL 23)☆13Updated 4 months ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆101Updated 2 years ago
- TextRank implementation in Golang with extendable features (summarization, phrase extraction) and multithreading (goroutine).☆205Updated 3 years ago