imoneoi / mistral-tokenizerLinks
☆21Updated last year
Alternatives and similar repositories for mistral-tokenizer
Users that are interested in mistral-tokenizer are comparing it to the libraries listed below
Sorting:
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- JS tokenizer for LLaMA 1 and 2☆357Updated last year
- ☆135Updated last year
- ☆38Updated last year
- ☆111Updated last year
- ☆87Updated 2 weeks ago
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆140Updated last year
- A dictionary, but it shows you position in embedding space relative to some synonyms/antonyms instead of a definition.☆74Updated 7 months ago
- An HTTP serving framework by Banana☆101Updated last year
- Vercel and web-llm template to run wasm models directly in the browser.☆160Updated last year
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆169Updated last year
- ☆79Updated last year
- Unofficial python bindings for the rust llm library. 🐍❤️🦀☆75Updated 2 years ago
- Code interpreter support for o1☆32Updated 11 months ago
- SemanticFinder - frontend-only live semantic search with transformers.js☆293Updated 5 months ago
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆50Updated last year
- Plug n Play GBNF Compiler for llama.cpp☆27Updated last year
- The code we currently use to fine-tune models.☆115Updated last year
- Build AI Agents with Your Existing Python Code!☆64Updated 10 months ago
- A collection of LLM services you can self host via docker or modal labs to support your applications development☆194Updated last year
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU☆103Updated 2 years ago
- Record and stream WAV audio data in the browser across all platforms☆88Updated 9 months ago
- run embeddings in MLX☆91Updated 11 months ago
- ☆40Updated 3 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- ☆123Updated last year
- LLaVA server (llama.cpp).☆182Updated last year
- Command-line script for inferencing from models such as MPT-7B-Chat☆100Updated 2 years ago
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆58Updated last year
- ☆116Updated 8 months ago