microsoft / factored-segmenterLinks
Unsupervised factor-based text tokenizer for natural-language processing applications
☆17Updated 4 years ago
Alternatives and similar repositories for factored-segmenter
Users that are interested in factored-segmenter are comparing it to the libraries listed below
Sorting:
- Port of PragmaticSegmenter for sentence boundary detection☆35Updated 3 years ago
- Machine is a natural language processing library for .NET that is focused on providing tools for processing resource-poor languages.☆27Updated this week
- C# Word2Vec object with fast neighbor search. Format compatible with gensim☆25Updated 5 years ago
- Numpy style scientific programming package☆18Updated 5 years ago
- Research EXpression Language☆24Updated 11 months ago
- A phonetic matching library. Includes text utilities to do string comparisons on phonemes (the sound of the string), as opposed to charac…☆162Updated last year
- Fast and memory-efficient library for WordPiece tokenization as it is used by BERT.☆49Updated last week
- .Net Implementation for google word2vec tools.☆37Updated 2 years ago
- Rewrite .NET assemblies to allow referencing of two different versions☆26Updated 2 years ago
- Open source project for BERT Tokenizers in C#.☆86Updated 2 years ago
- Palaso Library: A set of .Net libraries useful for developers of Language Software.☆43Updated this week
- A C# library to extract tabular data from PDFs (port of camelot Python version using PdfPig).☆33Updated 3 years ago
- It is a WIP C# .Net Framework implementation of the original markitdown Python library.☆15Updated 4 months ago
- ☆18Updated last year
- C# Sequence to Sequence Learning with Attention using LSTM neural Networks☆26Updated 4 months ago
- A set of variations on ObjectPool implementations with differing underlying collections.☆18Updated 5 months ago
- Easily add searching to your apps☆14Updated last week
- Word2Vec.Net-CSharp☆18Updated 6 years ago
- Parse sql using antlr.☆12Updated 2 years ago
- MessageCommunicator is a library + testing Gui for message-based communication. The library can be integrated in .NET apps (current .NET …☆14Updated 6 months ago
- Text clustering algorithm, implemented in .NET☆22Updated 2 years ago
- Run and train Transformer based Large Language Models (LLMS) natively in .NET using TorchSharp☆24Updated 8 months ago
- A collection of algorithms I've implemented and found generally useful☆16Updated last year
- CRFSharp is Conditional Random Fields implemented by .NET(C#), a machine learning algorithm for learning from labeled sequences of exampl…☆121Updated 4 years ago
- WPF/MVVM Xml Viewer for large Xml files (Alpha Version)☆17Updated 6 years ago
- Converts docx to html☆13Updated last week
- ☆37Updated 7 years ago
- SpacyDotNet is a .NET wrapper for the popular natural language library spaCy☆34Updated 2 months ago
- Enterprise VAD (Voice Activity Detection) in C#.NET (.NET 6.0+) with Microsoft.ML.Net, ONNXRuntime and DirectML. The easiest, efficient, …☆1Updated 2 months ago
- .NET researching application for rendering, recording, playback, analyzing and compression of audio data.☆10Updated 8 years ago