microsoft / factored-segmenterLinks
Unsupervised factor-based text tokenizer for natural-language processing applications
☆17Updated 5 years ago
Alternatives and similar repositories for factored-segmenter
Users that are interested in factored-segmenter are comparing it to the libraries listed below
Sorting:
- Machine is a natural language processing library for .NET that is focused on providing tools for processing resource-poor languages.☆29Updated this week
- Port of PragmaticSegmenter for sentence boundary detection☆39Updated 4 years ago
- ☆17Updated last year
- C# Sequence to Sequence Learning with Attention using LSTM neural Networks☆26Updated 10 months ago
- C# Word2Vec object with fast neighbor search. Format compatible with gensim☆25Updated 5 years ago
- A phonetic matching library. Includes text utilities to do string comparisons on phonemes (the sound of the string), as opposed to charac…☆164Updated 2 years ago
- ☆11Updated last year
- Numpy style scientific programming package☆18Updated 6 years ago
- Proof of concept of training a simple Region Classifier using PdfPig and ML.NET (LightGBM). The objective is to classify each text block…☆28Updated 5 years ago
- CRFSharp is Conditional Random Fields implemented by .NET(C#), a machine learning algorithm for learning from labeled sequences of exampl…☆122Updated 5 years ago
- Open source project for BERT Tokenizers in C#.☆92Updated 2 years ago
- Converts docx to html☆14Updated this week
- Research EXpression Language☆24Updated last year
- .Net Implementation for google word2vec tools.☆37Updated 3 years ago
- Audio WAV file tools for C# read and write, 8 and 16 bits, mono and stereo.☆11Updated 10 years ago
- A C# library to extract tabular data from PDFs (port of camelot Python version using PdfPig).☆35Updated 3 years ago
- .NET Standard interface for the WebRTC voice activity detection (VAD) component.☆54Updated 3 years ago
- C# library for decoding Fsmn Vad model , used in speech activity detection☆23Updated 4 months ago
- Fast and memory-efficient library for WordPiece tokenization as it is used by BERT.☆51Updated last month
- Palaso Library: A set of .Net libraries useful for developers of Language Software.☆44Updated this week
- A light weight core package for Reactive Marbles to introduce MVVM abstractions☆15Updated this week
- Simple application to full-text searching in file system☆18Updated 5 years ago
- A tool for detecting identifiable information in data sources (CSV, DICOM, Relational Database and MongoDB)☆13Updated 3 weeks ago
- ☆19Updated 2 years ago
- SpacyDotNet is a .NET wrapper for the popular natural language library spaCy☆35Updated 7 months ago
- Pure C# port of the Pocketsphinx keyword spotter☆13Updated 5 years ago
- It is a WIP C# .Net Framework implementation of the original markitdown Python library.☆18Updated 9 months ago
- NLTK library wrapper for .NET☆49Updated 9 months ago
- Convolutional Neural Networks☆17Updated 10 years ago
- Text clustering algorithm, implemented in .NET☆21Updated 2 years ago