dariush-bahrami/character-tokenizer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dariush-bahrami/character-tokenizer)

dariush-bahrami / character-tokenizer

A character tokenizer for Hugging Face Transformers

☆32

Alternatives and similar repositories for character-tokenizer

Users that are interested in character-tokenizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

neelnanda-io / Neuroscope
View on GitHub
Accompanying codebase for neuroscope.io, a website for displaying max activating dataset examples for language model neurons
☆15Feb 13, 2023Updated 3 years ago
fengzhanglab / Joung_TFAtlas_Manuscript
View on GitHub
☆15Dec 21, 2022Updated 3 years ago
xiye17 / StructuredRegex
View on GitHub
Data and Code for StructuredRegex.
☆14Nov 16, 2023Updated 2 years ago
MAGICS-LAB / GERM
View on GitHub
[ICML 2025] Fast and Low-Cost Genomic Foundation Models via Outlier Removal.
☆19Jun 19, 2025Updated last year
xiye17 / SketchRegex
View on GitHub
Sketch Driven Regular Expression Generation.
☆17Apr 26, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
IdoAmos / not-from-scratch
View on GitHub
☆33Oct 22, 2024Updated last year
Marghrid / FOREST
View on GitHub
Regular expression for form validations synthesizer
☆15Apr 17, 2025Updated last year
bernardo-de-almeida / DeepSTARR_embryo
View on GitHub
Deep learning models to predict enhancers in different Drosophila embryo tissues
☆20Dec 10, 2023Updated 2 years ago
PAIR-code / tiny-transformers
View on GitHub
☆22Updated this week
nbroad1881 / strideformer
View on GitHub
Using short models to classify long texts
☆21Mar 8, 2023Updated 3 years ago
TheDenk / hwb
View on GitHub
Hand Written Blots augmentation
☆12Aug 28, 2025Updated 11 months ago
pokaxpoka / RoGNoisyLabel
View on GitHub
Description Code for the paper "Robust Inference via Generative Classifiers for Handling Noisy Labels".
☆33Sep 18, 2019Updated 6 years ago
vadimtimakin / 2nd-place-solution-Digital-Peter
View on GitHub
The 2nd place Solution for Digital Peter competition.
☆10Jan 7, 2022Updated 4 years ago
dange-academic / real_network_datasets
View on GitHub
This project collects real network datasets used in published papers in the field of complex networks, and provides a simple Python code …
☆39Oct 14, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
affjljoo3581 / KW-Computer-Vision-AI-1st-Solution
View on GitHub
광운대학교 컴퓨터 비전 AI 경진대회 1등 솔루션입니다.
☆15Oct 5, 2022Updated 3 years ago
affjljoo3581 / G2Net-Detecting-Continuous-Gravitational-Waves
View on GitHub
🥈12th place solution on G2Net Detecting Continuous Gravitational Waves🥈
☆14Jan 4, 2023Updated 3 years ago
keraJLi / synthetic-gymnax
View on GitHub
Drop-in environment replacements that make your RL algorithm train faster.
☆22Jun 19, 2024Updated 2 years ago
goodfire-ai / scribe-task-suite
View on GitHub
A suite of interpretability tasks to evaluate agents using Scribe for notebook access
☆18Oct 2, 2025Updated 9 months ago
SchwinnL / circuit-breakers-eval
View on GitHub
Independent robustness evaluation of Improving Alignment and Robustness with Short Circuiting
☆18Apr 15, 2025Updated last year
edwardzjl / pybox
View on GitHub
☆10Feb 11, 2025Updated last year
nicolay-r / awesome-sentiment-attitude-extraction
View on GitHub
A curated list of awesome sentiment analysis studies, in which attitude corresponds to the text position conveyed by Subject towards othe…
☆19Mar 23, 2026Updated 4 months ago
oxylabs / httpx-vs-requests-vs-aiohttp
View on GitHub
See how HTTPX, Requests, and AIOHTTP libraries compare for sending network requests and find out which one may fit your case better.
☆22Sep 25, 2025Updated 10 months ago
AlignmentResearch / obfuscation-atlas
View on GitHub
The Obfuscation Atlas: Mapping Where Honesty Emerges in RLVR with Deception Probes
☆15Jul 22, 2026Updated last week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
affjljoo3581 / Google-American-Sign-Language-Fingerspelling-Recognition
View on GitHub
🎖️ 5th place solution in the Google American Sign Language Fingerspelling Recognition Competition🎖️
☆16Sep 19, 2023Updated 2 years ago
neelnanda-io / Grokking
View on GitHub
A Mechanistic Interpretability Analysis of Grokking
☆29Sep 26, 2022Updated 3 years ago
kldarek / chaii
View on GitHub
Chaii - Hindi and Tamil Question Answering Kaggle Competition Solution
☆14Nov 23, 2021Updated 4 years ago
arandilopez / laravel-feed-parser
View on GitHub
Laravel and Lumen package for parse feeds
☆12Sep 15, 2016Updated 9 years ago
utopia-group / regel
View on GitHub
REGEL: Regular Expression Generation from Examples and Language
☆36Jul 11, 2022Updated 4 years ago
TropComplique / set-transformer
View on GitHub
A neural network architecture for prediction on sets
☆24May 30, 2022Updated 4 years ago
lolpa1n / digital-peter-ocrv
View on GitHub
1st place (public LB) solution of AIJ2020 Sberbank competition (Digital Peter)
☆18Nov 22, 2020Updated 5 years ago
AbdualimovTP / datret
View on GitHub
Tensorflow implementation for structured tabular data
☆11Jan 21, 2023Updated 3 years ago
andrewcharlesjones / pcpca
View on GitHub
Probabilistic contrastive principal component analysis (PCPCA)
☆24Nov 7, 2021Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
psinger / kaggle-curriculum-solution
View on GitHub
☆17Mar 24, 2023Updated 3 years ago
jarvislabsai / fastapi-sd-template
View on GitHub
☆10Oct 24, 2022Updated 3 years ago
nkrkv / pyinsales
View on GitHub
InSales e-commerce platform API bindings
☆14Jul 13, 2024Updated 2 years ago
di37 / gemma3-270M-tinystories-pytorch
View on GitHub
A complete PyTorch implementation of Google's Gemma3 270M language model, featuring sliding window attention, RoPE positional encoding, a…
☆49Sep 7, 2025Updated 10 months ago
Developmint / npm-stats-api
View on GitHub
Fetch stats for your NPM packages
☆16Dec 11, 2020Updated 5 years ago
IlyaGusev / codearkt
View on GitHub
Implementation of the CodeAct agentic framework with Docker containers for security, MCP servers for tool integrations, and multi-agent s…
☆40Oct 22, 2025Updated 9 months ago
VikhrModels / ru_llm_arena
View on GitHub
Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language
☆47Mar 20, 2025Updated last year