willf/segment

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/willf/segment)

willf / segment

A tool to segment text based on frequencies and the Viterbi algorithm "#TheBoyWhoLived" => ['#', 'The', 'Boy', 'Who', 'Lived']

☆79

Alternatives and similar repositories for segment

Users that are interested in segment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

reddragon / is-dirty
View on GitHub
A very naive classifier to figure out if a sentence contains dirty words
☆33Jul 7, 2015Updated 11 years ago
jmhessel / BasicSpearmint
View on GitHub
A simple tool for small scale experiments using bayesian optimization
☆35Aug 14, 2018Updated 7 years ago
filannim / ManTIME
View on GitHub
Cross-domain temporal information extractors: temporal expressions, events and temporal links.
☆21Oct 29, 2015Updated 10 years ago
abhishek-kumar / NNForMLL
View on GitHub
Neural Network Models for Multi-label learning
☆16Oct 13, 2020Updated 5 years ago
swabhs / joint-lstm-parser
View on GitHub
Transition-based joint syntactic dependency parser and semantic role labeler using a stack LSTM RNN architecture.
☆61Apr 5, 2017Updated 9 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
parisk / chrome-rest
View on GitHub
REST API for controlling Google Chrome
☆13Sep 23, 2015Updated 10 years ago
jlouback / nlp-viterbi
View on GitHub
☆12Jan 10, 2016Updated 10 years ago
semanticize / st
View on GitHub
Semanticizest: dump parser and client
☆20May 11, 2016Updated 10 years ago
o19s / lazy-semantic-indexing
View on GitHub
Elasticsearch Latent Semantic Indexing experimentation
☆32Oct 18, 2019Updated 6 years ago
pdasigi / neural-semantic-encoders
View on GitHub
Reimplementation of Munkhdalai et al's Neural Semantic Encoders (https://arxiv.org/pdf/1607.04315v2.pdf)
☆59Oct 28, 2016Updated 9 years ago
ai-ku / wkmeans
View on GitHub
k-means algorithm with (optional) instance weights.
☆15Mar 7, 2015Updated 11 years ago
rohit-jain / parzer
View on GitHub
Statistical Dependency Parser using SVM as proposed by Yamada et al
☆29Feb 10, 2016Updated 10 years ago
jimbelton / wikidata
View on GitHub
Tools for working with wikidata (structured data from wikipedia)
☆13Apr 26, 2016Updated 10 years ago
bcho / donkey
View on GitHub
A simple cron-like library for executing scheduled jobs.
☆21Jun 27, 2015Updated 11 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
marklit / airline-passenger-counts
View on GitHub
Commercial Airline Passenger Counts between airports (for latest year reported, usually 2013)
☆31Aug 13, 2015Updated 10 years ago
ellisonbg / vizarray
View on GitHub
A Python package for visualizing 1d and 2d NumPy arrays
☆18Dec 31, 2015Updated 10 years ago
hickeroar / simplebayes
View on GitHub
A memory-based, optional-persistence naive Bayesian text classification package and web API for Python.
☆36Feb 24, 2026Updated 4 months ago
max-ionov / russian-anaphora
View on GitHub
System for automatic pronominal resolution for Russian
☆13Apr 3, 2020Updated 6 years ago
lezhang7 / TreeMix
View on GitHub
[NAACL 2022] TreeMix: Compositional Constituency-based Data Augmentation for Natural Language Understanding
☆10Jul 15, 2023Updated 3 years ago
j2kun / segment
View on GitHub
Python code and data for the post "Word Segmentation, or Makingsenseofthis"
☆16Oct 24, 2022Updated 3 years ago
mcoavoux / mtg
View on GitHub
Statistical discontinuous constituent parsing
☆11Feb 15, 2018Updated 8 years ago
oir / deep-recursive
View on GitHub
Implementation of a deep recursive net over binary parse trees (code for NIPS2014 paper)
☆28Feb 6, 2015Updated 11 years ago
forcedotcom / distributions
View on GitHub
Low-level primitives for collapsed Gibbs sampling in python and C++
☆33Mar 27, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
lintool / IR-Reproducibility
View on GitHub
Open-Source Information Retrieval Reproducibility Challenge
☆51Jan 11, 2016Updated 10 years ago
tanyaschlusser / office-nfl-pool
View on GitHub
A fun introduction to Pandas andScikit-Learn using nfl data
☆45Jan 12, 2016Updated 10 years ago
biesnecker / cleveland
View on GitHub
Simple asyncio-based actors.
☆38Jun 22, 2024Updated 2 years ago
siemanko / a3c
View on GitHub
Asynchronous Advantage Actor Critic
☆20Aug 15, 2016Updated 9 years ago
aeye-lab / EMNLP-SyntheticScanpaths-NLU-PretrainedLM
View on GitHub
☆11May 24, 2024Updated 2 years ago
superisaac / pycetr
View on GitHub
Python implementation of CETR: Content Extraction via Tag Ratios
☆13Jan 18, 2012Updated 14 years ago
jaberg / skdata
View on GitHub
Data sets for machine learning in Python
☆478Jul 27, 2017Updated 8 years ago
semanticize / semanticizest
View on GitHub
Standalone Semanticizer
☆32Mar 4, 2015Updated 11 years ago
alexeygrigorev / classifying-crisis-reports-dsc
View on GitHub
The top 10 solution to the "Growing Instability: Classifying Crisis Reports" challenge
☆20Sep 9, 2017Updated 8 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ardydedase / apiwrapper
View on GitHub
API Wrapper in Python with polling and request callbacks.
☆40Jan 25, 2026Updated 5 months ago
sronnqvist / topicMap
View on GitHub
Exploratory topic modeling with distributional semantics and interactive visualization
☆18Jan 11, 2017Updated 9 years ago
abietti / stochs
View on GitHub
stochs: fast stochastic solvers for machine learning in C++ and Cython
☆27Oct 13, 2022Updated 3 years ago
matejbalog / gumbel-relatives
View on GitHub
Code to reproduce experiments appearing in the academic paper Lost Relatives of the Gumbel Trick
☆17Jun 14, 2017Updated 9 years ago
tankle / LDA
View on GitHub
Latent Dirichlet Allocation
☆13Jun 17, 2015Updated 11 years ago
NLeSC / spudisc-emotion-classification
View on GitHub
☆16Jul 29, 2015Updated 10 years ago
gallupliu / keras-quora-question-pairs
View on GitHub
A Keras model that addresses the Quora Question Pairs dyadic prediction task.
☆14Feb 18, 2017Updated 9 years ago