Accurate word segmentation for hashtags and text, powered by Transformers and Beam Search. A scalable alternative to heuristic splitters and massive LLMs.
☆77Jan 8, 2026Updated last month
Alternatives and similar repositories for hashformers
Users that are interested in hashformers are comparing it to the libraries listed below
Sorting:
- This project shows how to build a simple handwriting recognizer in Keras with the IAM dataset.☆13Aug 15, 2021Updated 4 years ago
- HashtagMaster: Segmentation tool for hashtags☆12Oct 27, 2020Updated 5 years ago
- [LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweeban…☆105Jan 24, 2024Updated 2 years ago
- 🛠️ Tools for Transformers compression using PyTorch Lightning ⚡☆85Feb 1, 2026Updated last month
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆157May 24, 2024Updated last year
- The Natural Portuguese Language Benchmark (Napolab). Stay up to date with the latest advancements in Portuguese language models and their…☆72Jul 28, 2025Updated 7 months ago
- Presents an optimized Apache Beam pipeline for generating sentence embeddings (runnable on Cloud Dataflow).☆20Mar 7, 2022Updated 3 years ago
- Contains additional materials for two keras.io blog posts.☆17Sep 12, 2021Updated 4 years ago
- This repository holds files and scripts for incorporating simple CI/CD practices for model training in ML.☆21Oct 26, 2021Updated 4 years ago
- Concept Modeling: Topic Modeling on Images and Text☆220Nov 4, 2024Updated last year
- Contains code to demonstrate distributed training in TensorFlow 2 with AI Platform and custom Docker contains.☆20Apr 28, 2021Updated 4 years ago
- Few-shot Named Entity Recognition☆121Mar 30, 2022Updated 3 years ago
- Wikimedia Enterprise - client SDK in Python☆20Nov 11, 2025Updated 3 months ago
- ☆69May 1, 2025Updated 10 months ago
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆36Oct 16, 2025Updated 4 months ago
- A UI automation engine☆11Aug 14, 2025Updated 6 months ago
- Official TensorFlow code for the paper "DeepWay: a Deep Learning Waypoint Estimator for Global Path Generation".☆11Jun 24, 2022Updated 3 years ago
- 🤝 Trade any tensors over the network☆31Sep 27, 2023Updated 2 years ago
- A Python library for calculating a large variety of metrics from text☆360Jan 30, 2026Updated last month
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.☆110Apr 16, 2024Updated last year
- A framework for evaluating semantic search across custom datasets, metrics, and embedding backends.☆38May 26, 2025Updated 9 months ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆244Jun 19, 2023Updated 2 years ago
- DALLE-tools provided useful dataset utilities to improve you workflow with WebDatasets.☆14Mar 9, 2022Updated 3 years ago
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Dec 21, 2021Updated 4 years ago
- YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddings☆13May 22, 2025Updated 9 months ago
- A Simulator for Traffic Intersection based on Crossroads technique☆10Dec 4, 2019Updated 6 years ago
- Materials for PyCon 2016 in Portland, Oregon☆10Aug 30, 2015Updated 10 years ago
- Transformer based Trigram Blocking implementation in Tensorflow☆11Feb 26, 2020Updated 6 years ago
- LinkedIn Web Scraper☆10Mar 3, 2021Updated 5 years ago
- SpanMarker for Named Entity Recognition☆464Jan 8, 2025Updated last year
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆157Dec 20, 2023Updated 2 years ago
- Edo Liberty's class notes form the course Algorithms in Data Mining given in Tel Aviv University in academic years 2011-2013☆26May 20, 2022Updated 3 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆61May 11, 2023Updated 2 years ago
- A simple tkinter GUI for illustrating DFS and BFS.☆12Jun 26, 2020Updated 5 years ago
- DL4CV book☆10Sep 18, 2018Updated 7 years ago
- This repository contains the codebase mentioned and used in trains' blogs☆11Jul 25, 2025Updated 7 months ago
- Includes additional materials for the following keras.io blog post.☆12Jun 23, 2021Updated 4 years ago
- Repository of data and code to use the models described in the paper "Citation Needed: A Taxonomy and Algorithmic Assessment of Wikipedia…☆11Nov 21, 2022Updated 3 years ago
- Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborations☆14Aug 16, 2022Updated 3 years ago