FlexiTokens
☆19Dec 27, 2025Updated 3 months ago
Alternatives and similar repositories for flexitokens
Users that are interested in flexitokens are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆21Apr 3, 2026Updated last week
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆27Nov 25, 2024Updated last year
- ANE accelerated embedding models!☆20Dec 11, 2024Updated last year
- User-friendly viewer for Parquet files☆11Mar 7, 2026Updated last month
- Create a Robust CDN for your Django Project Static Files in this section. This repo is the reference code for the Django + S3 + Cloudfron…☆11Sep 8, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Python Module implementing SRP☆12Jul 29, 2022Updated 3 years ago
- Fork of Flame repo for training of some new stuff in development☆19Mar 17, 2026Updated 3 weeks ago
- Comparison of existing spell checking tools☆11Mar 28, 2023Updated 3 years ago
- Use the React CDN as well as Babel to make a Standalone React app without running `npx create-react-app`☆12Mar 22, 2019Updated 7 years ago
- ☆44Feb 11, 2026Updated 2 months ago
- Reference code for the AWS S3 section in the Dive into AWS Course.☆15Dec 8, 2022Updated 3 years ago
- Official Repository for Paper "BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Compet…☆18Aug 28, 2024Updated last year
- Official implementation of "Data Mixture Inference: What do BPE tokenizers reveal about their training data?"☆18May 15, 2025Updated 10 months ago
- Use Django with Docker and Deploy to Heroku.☆14Sep 21, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- benchmarks for LLM tokenizers☆18Mar 25, 2026Updated 2 weeks ago
- DImensionality REduction in JAX☆26Nov 21, 2025Updated 4 months ago
- An MCP tool server that provides stateful, TUI-compatible terminal sessions.☆14Feb 3, 2025Updated last year
- Declaratively set your DNS records with dnsmill, powered by libdns.☆12Nov 26, 2025Updated 4 months ago
- Automatically exported from code.google.com/p/transducersaurus☆11Apr 1, 2015Updated 11 years ago
- Pre-train Static Word Embeddings☆98Mar 27, 2026Updated 2 weeks ago
- Ukrainian ELECTRA model☆12Mar 11, 2023Updated 3 years ago
- A frontend for your PDS☆23Oct 20, 2025Updated 5 months ago
- ☆69Mar 17, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- An implementation of the Pair Adjacent Violators algorithm for isotonic regression in Rust☆12Mar 25, 2026Updated 2 weeks ago
- An R package for analyzing linguistic alignment between partners in conversation transcripts☆14Jan 30, 2026Updated 2 months ago
- Viewer for text datasets in formats like HuggingFace, JSONL, etc.☆15Feb 25, 2025Updated last year
- copied from http://readable.sourceforge.net☆10Mar 13, 2016Updated 10 years ago
- get grabby with file trees☆13Mar 27, 2024Updated 2 years ago
- The PyTorch implementation of paper "KERMIT: Knowledge Graph Completion of Enhanced Relation Modeling with Inverse Transformation"☆15Jul 4, 2025Updated 9 months ago
- ☆12Nov 17, 2018Updated 7 years ago
- Crowd-sourced lists of urls to help Common Crawl crawl under-resourced languages. See https://github.com/commoncrawl/web-languages-code/ …☆69Jan 7, 2026Updated 3 months ago
- A curated list of papers, tools, and resources on Multi-Token Prediction (MTP) and related techniques in Large Language Models (LLMs), Sp…☆74Feb 7, 2026Updated 2 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 🚀🤗 A collection of templates for Hugging Face Spaces☆35Oct 9, 2023Updated 2 years ago
- Code for SaGe subword tokenizer (EACL 2023)☆28Nov 30, 2024Updated last year
- zero shot NER fine tuning☆14Mar 17, 2025Updated last year
- CONFSEC's ComputeNode component of the OpenPCC standard☆18Dec 15, 2025Updated 3 months ago
- Plug-and-play document AI with zero-shot models.☆125Feb 16, 2026Updated last month
- Notebooks and other course materials for Emory QTM 340 (Fall 2022)☆12Dec 13, 2022Updated 3 years ago
- Tools for training causal language models for Finnish☆27Jan 14, 2026Updated 2 months ago