FlexiTokens
☆23Dec 27, 2025Updated 6 months ago
Alternatives and similar repositories for flexitokens
Users that are interested in flexitokens are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆21Apr 3, 2026Updated 2 months ago
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆27Nov 25, 2024Updated last year
- ANE accelerated embedding models!☆20Dec 11, 2024Updated last year
- Create a Robust CDN for your Django Project Static Files in this section. This repo is the reference code for the Django + S3 + Cloudfron…☆11Sep 8, 2021Updated 4 years ago
- Crowd-sourced lists of urls to help Common Crawl crawl under-resourced languages. See https://github.com/commoncrawl/web-languages-code/ …☆70Updated this week
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Python Module implementing SRP☆12Jul 29, 2022Updated 3 years ago
- Fork of Flame repo for training of some new stuff in development☆19Jun 23, 2026Updated last week
- User-friendly viewer for Parquet files☆15May 8, 2026Updated last month
- Comparison of existing spell checking tools☆11Mar 28, 2023Updated 3 years ago
- Use the React CDN as well as Babel to make a Standalone React app without running `npx create-react-app`☆12Mar 22, 2019Updated 7 years ago
- ☆45Feb 11, 2026Updated 4 months ago
- Reference code for the AWS S3 section in the Dive into AWS Course.☆15Dec 8, 2022Updated 3 years ago
- Official Repository for Paper "BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Compet…☆18Aug 28, 2024Updated last year
- Use Django with Docker and Deploy to Heroku.☆14Sep 21, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- benchmarks for LLM tokenizers☆18Mar 25, 2026Updated 3 months ago
- DImensionality REduction in JAX☆26Nov 21, 2025Updated 7 months ago
- Official implementation of "Data Mixture Inference: What do BPE tokenizers reveal about their training data?"☆23May 15, 2025Updated last year
- Run a program sandboxed in an ephemeral jj workspace using a Nix devshell☆33Apr 24, 2026Updated 2 months ago
- An MCP tool server that provides stateful, TUI-compatible terminal sessions.☆15Feb 3, 2025Updated last year
- Automatically exported from code.google.com/p/transducersaurus☆11Apr 1, 2015Updated 11 years ago
- Declaratively set your DNS records with dnsmill, powered by libdns.☆12Nov 26, 2025Updated 7 months ago
- Ukrainian ELECTRA model☆12Mar 11, 2023Updated 3 years ago
- Pre-train Static Word Embeddings☆106Jun 9, 2026Updated 3 weeks ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A frontend for your PDS☆25Oct 20, 2025Updated 8 months ago
- ☆69Mar 17, 2022Updated 4 years ago
- An implementation of the Pair Adjacent Violators algorithm for isotonic regression in Rust☆13Mar 25, 2026Updated 3 months ago
- Koa.js framework setup to run within Next.js API routes.☆12May 23, 2026Updated last month
- Easy Setup, File-based, Offline Capable Federated Learning and Computations☆22Mar 28, 2026Updated 3 months ago
- An R package for analyzing linguistic alignment between partners in conversation transcripts☆17Apr 29, 2026Updated 2 months ago
- Viewer for text datasets in formats like HuggingFace, JSONL, etc.☆15Feb 25, 2025Updated last year
- copied from http://readable.sourceforge.net☆10Mar 13, 2016Updated 10 years ago
- get grabby with file trees☆13Mar 27, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- The PyTorch implementation of paper "KERMIT: Knowledge Graph Completion of Enhanced Relation Modeling with Inverse Transformation"☆16Jul 4, 2025Updated 11 months ago
- ☆12Nov 17, 2018Updated 7 years ago
- Code for SaGe subword tokenizer (EACL 2023)☆28Nov 30, 2024Updated last year
- 🚀🤗 A collection of templates for Hugging Face Spaces☆34Oct 9, 2023Updated 2 years ago
- zero shot NER fine tuning☆14Mar 17, 2025Updated last year
- CONFSEC's ComputeNode component of the OpenPCC standard☆19Dec 15, 2025Updated 6 months ago
- Plug-and-play document AI with zero-shot models.☆126May 11, 2026Updated last month