Accurate word segmentation for hashtags and text, powered by Transformers and Beam Search. A scalable alternative to heuristic splitters and massive LLMs.
☆77Jan 8, 2026Updated 4 months ago
Alternatives and similar repositories for hashformers
Users that are interested in hashformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Jan 31, 2022Updated 4 years ago
- HashtagMaster: Segmentation tool for hashtags☆12Oct 27, 2020Updated 5 years ago
- The Natural Portuguese Language Benchmark (Napolab). Stay up to date with the latest advancements in Portuguese language models and their…☆72Jul 28, 2025Updated 9 months ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆156May 24, 2024Updated 2 years ago
- This project shows how to build a simple handwriting recognizer in Keras with the IAM dataset.☆13Aug 15, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 🚂 Fine-tune OpenAI models for text classification, question answering, and more☆17May 1, 2023Updated 3 years ago
- 🛠️ Tools for Transformers compression using PyTorch Lightning ⚡☆85Feb 1, 2026Updated 3 months ago
- [LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweeban…☆106Jan 24, 2024Updated 2 years ago
- Few-shot Named Entity Recognition☆119Mar 30, 2022Updated 4 years ago
- A Python library for calculating a large variety of metrics from text☆363May 5, 2026Updated 3 weeks ago
- This repository holds files and scripts for incorporating simple CI/CD practices for model training in ML.☆20Oct 26, 2021Updated 4 years ago
- Materials for PyCon 2016 in Portland, Oregon☆10Aug 30, 2015Updated 10 years ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆157Dec 20, 2023Updated 2 years ago
- Transformer based Trigram Blocking implementation in Tensorflow☆11Feb 26, 2020Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Sep 19, 2025Updated 8 months ago
- Presents an optimized Apache Beam pipeline for generating sentence embeddings (runnable on Cloud Dataflow).☆20Mar 7, 2022Updated 4 years ago
- 🤝 Trade any tensors over the network☆31Sep 27, 2023Updated 2 years ago
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.☆112Apr 16, 2024Updated 2 years ago
- Official TensorFlow code for the paper "DeepWay: a Deep Learning Waypoint Estimator for Global Path Generation".☆11Jun 24, 2022Updated 3 years ago
- Contains additional materials for two keras.io blog posts.☆18Sep 12, 2021Updated 4 years ago
- Concept Modeling: Topic Modeling on Images and Text☆223Nov 4, 2024Updated last year
- SpanMarker for Named Entity Recognition☆473Apr 10, 2026Updated last month
- ☆69May 1, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- OptimSeed - Seed Word Selection for Weakly-Supervised Text Classification [NAACL SRW 2021]☆14Mar 29, 2021Updated 5 years ago
- A framework for evaluating semantic search across custom datasets, metrics, and embedding backends.☆39May 8, 2026Updated 2 weeks ago
- Source code and data for Like a Good Nearest Neighbor☆30Jan 12, 2025Updated last year
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆221Jan 20, 2025Updated last year
- Multilingual Neural Machine Translation using Transformers with Conditional Normalization.☆18Mar 24, 2023Updated 3 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆60May 11, 2023Updated 3 years ago
- Command Line Interface for Hugging Face Inference Endpoints☆65Apr 10, 2024Updated 2 years ago
- INCOME: An Easy Repository for Training and Evaluation of Index Compression Methods in Dense Retrieval. Includes BPR and JPQ.☆24Sep 24, 2023Updated 2 years ago
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Dec 21, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- HateBR is the first large-scale expert annotated dataset of Brazilian Instagram comments for hate speech and offensive language detection…☆49Jan 5, 2026Updated 4 months ago
- The dataset and code for ACL 2022 paper "SciNLI: A Corpus for Natural Language Inference on Scientific Text" are released here.☆28Oct 17, 2023Updated 2 years ago
- Implements RNNPool and SoftPool for CNNs.☆14Jan 29, 2021Updated 5 years ago
- Contains code to demonstrate distributed training in TensorFlow 2 with AI Platform and custom Docker contains.☆20Apr 28, 2021Updated 5 years ago
- ☆16Jun 17, 2024Updated last year
- Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An All-Round Python Library for Transformer…☆396May 11, 2023Updated 3 years ago
- This is the python program which performs text summarization with pronoun replacement method. This method initially identifies pronouns i…☆10Dec 5, 2018Updated 7 years ago