NMZivkovic / BertTokenizers
Open source project for BERT Tokenizers in C#.
☆85Updated last year
Alternatives and similar repositories for BertTokenizers:
Users that are interested in BertTokenizers are comparing it to the libraries listed below
- BERT Model for dotnet ML☆97Updated last year
- Fast and memory-efficient library for WordPiece tokenization as it is used by BERT.☆47Updated last week
- .NET Standard wrapper for fastText library. Now works on Windows, Linux and MacOs!☆74Updated 6 months ago
- C# library for approximate nearest neighbors search using Hierarchical Navigable Small World graphs☆80Updated last week
- Lightweight In-memory Text Vector Database to embed in any .NET Applications☆73Updated last week
- ☆25Updated 4 months ago
- This project implements token calculation for OpenAI's gpt-4 and gpt-3.5-turbo model, specifically using `cl100k_base` encoding.☆73Updated last week
- Pinecone.NET is a fully-fledged C# library for the Pinecone vector database.☆58Updated 6 months ago
- Port of PragmaticSegmenter for sentence boundary detection☆35Updated 3 years ago
- .NET implementation of the LangChain project.☆70Updated last year
- Natural Language Processing in .NET Core☆116Updated 2 years ago
- State-of-the-art face detection and face recognition for .NET.☆84Updated last week
- An unofficial C#/.NET SDK for accessing the Anthropic Claude API☆122Updated this week
- SpacyDotNet is a .NET wrapper for the popular natural language library spaCy☆33Updated 3 years ago
- BLlamaSharp.ChatGpt.Blazor is a Blazor-based LLamaSharp Chat GPT application☆63Updated last year
- pgvector support for .NET (C#, F#, and Visual Basic)☆179Updated this week
- Sound classification using ML.NET and D-CNN's☆27Updated 5 years ago
- International Components for Unicode for .NET☆31Updated 4 months ago
- Automated .NET SDKs for your APIs☆45Updated last week
- ONNX format parsing and manipulation in C#.☆28Updated 2 months ago
- A lightweight full text indexer for .NET☆188Updated 3 weeks ago
- Qdrant .Net SDK☆123Updated 3 weeks ago
- Token calculation for OpenAI models, using `o200k_base` `cl100k_base` `p50k_base` encoding.☆117Updated 2 weeks ago
- SimMetrics is a Similarity Metric Library, e.g. from edit distance's (Levenshtein, Gotoh, Jaro etc) to other metrics, (e.g Soundex, Chapm…☆134Updated 6 months ago
- A miniature large language model (LLM) that generates shakespeare like text written in C#. Project meant to help dotnet developers get in…☆33Updated last year
- .NET wrapper for LLaMA.cpp for LLaMA language model inference on CPU. 🦙☆57Updated last year
- A simple light-weight library that wraps the Open AI API.☆95Updated this week
- A .NET Core implementation of Amazon's S3 API with focus on simplicity, security and performance☆54Updated 2 weeks ago
- net standard 2.1 Library for running Sentence Transformers All-MiniLM-L6-v2 from C#.☆18Updated 3 months ago
- NER (Named Entity Recognition) implementation using a BERT/DistilBERT-based ONNX model for Token Classification in ML.NET☆21Updated last month