microsoft / factored-segmenter
Unsupervised factor-based text tokenizer for natural-language processing applications
☆17Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for factored-segmenter
- Machine is a natural language processing library for .NET that is focused on providing tools for processing resource-poor languages.☆26Updated this week
- .NET researching application for rendering, recording, playback, analyzing and compression of audio data.☆10Updated 7 years ago
- Port of PragmaticSegmenter for sentence boundary detection☆33Updated 3 years ago
- Palaso Library: A set of .Net libraries useful for developers of Language Software.☆44Updated this week
- C# Word2Vec object with fast neighbor search. Format compatible with gensim☆25Updated 4 years ago
- SpacyDotNet is a .NET wrapper for the popular natural language library spaCy☆32Updated 3 years ago
- C# Sequence to Sequence Learning with Attention using LSTM neural Networks☆25Updated 6 years ago
- call rwkv v4/v5/v6 raven/world/finch 1B5-14B ggml using csharp cpu/gpu (support INT4,8,Float16,32)☆31Updated last month
- Numpy style scientific programming package☆18Updated 5 years ago
- Reed-Solomon Erasure Coding in C#/.NET☆26Updated last year
- .Net Implementation for google word2vec tools.☆37Updated last year
- A phonetic matching library. Includes text utilities to do string comparisons on phonemes (the sound of the string), as opposed to charac…☆154Updated last year
- ☆11Updated 3 years ago
- AIMLBot (Program#) is a small, fast, standards-compliant yet easily customizable .NET dll implementation of an AIML (Artificial Intellige…☆14Updated 7 years ago
- A testing framework for Visual Studio extensions☆14Updated last week
- Proof of concept of training a simple Region Classifier using PdfPig and ML.NET (LightGBM). The objective is to classify each text block…☆23Updated 4 years ago
- Word2Vec.Net-CSharp☆18Updated 5 years ago
- Open source project for BERT Tokenizers in C#.☆83Updated last year
- Run and train Transformer based Large Language Models (LLMS) natively in .NET using TorchSharp☆16Updated this week
- ☆17Updated 2 weeks ago
- .NET Standard interface for the WebRTC voice activity detection (VAD) component.☆43Updated 2 years ago
- Text clustering algorithm, implemented in .NET☆23Updated last year
- .NET wrapper of spaCy (Industrial-strength NLP)☆18Updated 5 years ago
- A Visual Studio extension for showing GitHub Actions workflow information☆18Updated 5 months ago
- Natural Language Processing Engine built with ML.NET☆25Updated last year
- ☆11Updated last year
- .NET Reflection Made Fast and Simple ⛺☆27Updated 3 years ago
- NLTK library wrapper for .NET☆46Updated 3 years ago