trec-kba/many-stop-words

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/trec-kba/many-stop-words)

trec-kba / many-stop-words

stop word lists in several languages

☆21

Alternatives and similar repositories for many-stop-words

Users that are interested in many-stop-words are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ddhira123 / Stop-Words-List
View on GitHub
The stop words list for all languages around the world made by the contributors around the world! Start your contributions now!
☆14Jun 9, 2025Updated last year
paulproteus / dirtbike
View on GitHub
Dirtbike turns system-installed Python packages ("distributions") into Python wheels
☆13Apr 12, 2020Updated 6 years ago
nikita-smetanin / fuzzy-search-tools
View on GitHub
Tools for fuzzy string search in text and dictionaries written in Java
☆10Dec 24, 2015Updated 10 years ago
s4weng / word2phrase
View on GitHub
Words -> Phrases; NLP
☆11Apr 8, 2016Updated 10 years ago
akopich / dplsa
View on GitHub
Distributed implementation of Robust PLSA using Spark
☆12Apr 29, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
kristopherkyle / corpus_toolkit
View on GitHub
A simple toolkit for conducting analyses using corpus methods
☆28Nov 11, 2021Updated 4 years ago
syhw / speech_embeddings
View on GitHub
Using embedding-based loss functions for phonetics/speech recognition.
☆17Nov 24, 2014Updated 11 years ago
WSOL12 / Pumpfun-Sniper-Bot
View on GitHub
A fast and automated Solana PumpFun sniper bot that detects new token launches, monitors market conditions, and executes buy/sell trades …
☆17Feb 14, 2026Updated 5 months ago
dustalov / watset
View on GitHub
Watset: Automatic Induction of Synsets from a Graph of Synonyms
☆16Jul 7, 2019Updated 7 years ago
vaskonov / burvec
View on GitHub
Word Embeddings for Low Resource Languages: The Case of Buryat
☆10Mar 12, 2025Updated last year
EmergentOrder / template-scala-topic-model-LDA
View on GitHub
A PredictionIO engine template using Latent Dirichlet Allocation to learn a topic model from raw text
☆12May 4, 2016Updated 10 years ago
rakanalh / raiden.rs
View on GitHub
The unofficial Raiden client (Ethereum L2 scaling solution) implementation in Rust
☆12Apr 11, 2024Updated 2 years ago
ajschumacher / nypd
View on GitHub
archive NYPD crime data PDFs
☆14Dec 12, 2017Updated 8 years ago
anolivetree / goncurrent
View on GitHub
Golang like channels and select for Java
☆14Aug 4, 2015Updated 10 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
penn-nlp / mmid
View on GitHub
Words and their images in 98 languages
☆14Mar 1, 2019Updated 7 years ago
swannodette / cljs-master
View on GitHub
ClojureScript Master Class
☆17Sep 14, 2019Updated 6 years ago
davidcampos / covid19-corpus
View on GitHub
COVID-19 corpus with annotated biomedical entities.
☆11Jun 2, 2021Updated 5 years ago
kevinastone / django-descriptors
View on GitHub
Demonstration of using Python Descriptors to Enhance Django Models and Fields
☆17Jul 2, 2015Updated 11 years ago
datanews / mean-streets
View on GitHub
Data on 268 New York City traffic deaths in 2014.
☆10Feb 19, 2015Updated 11 years ago
zouzias / spark-lucenerdd-examples
View on GitHub
Examples of spark-lucenerdd
☆15Oct 6, 2023Updated 2 years ago
java10000 / semantic_similarity_based_on_ANN
View on GitHub
基于人工神经网络的中文语义相似度计算研究
☆11Apr 1, 2013Updated 13 years ago
alexanderpanchenko / sim-eval
View on GitHub
A tool for evaluation of semantic similarity measures.
☆22Feb 3, 2013Updated 13 years ago
undertherain / vsmlib
View on GitHub
Python library for vector space models
☆13Jun 14, 2018Updated 8 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
felixrieseberg / electron-comrade
View on GitHub
Run Electron apps with different versions or builds of Electron
☆19Jan 24, 2019Updated 7 years ago
jmccrae / lemon.patterns
View on GitHub
Design patterns for the ontology-lexicon interface using lemon and OWL
☆21Jul 27, 2018Updated 8 years ago
kamujun / elmo_experiments
View on GitHub
Experiments of ELMo that deep contextualized word representation in Keras with Tensorflow Hub.
☆14Jun 12, 2018Updated 8 years ago
OpenNewsLabs / centipede
View on GitHub
Service-based pipelines for document processing
☆17Nov 9, 2014Updated 11 years ago
cherryyingzizhang / TiO
View on GitHub
TiO is an AirBnB like android app demo developed from a hackathon. I developed it with another Android developer, a backend, and a UI des…
☆11May 30, 2016Updated 10 years ago
Sotera / mitie-trainer
View on GitHub
Model Training tool for MITIE
☆79Jul 7, 2015Updated 11 years ago
alexcrichton / wasm-sodium
View on GitHub
PoC of libsodium being used in Rust on wasm32-unknown-unknown
☆25May 22, 2018Updated 8 years ago
orenmel / synth-clinical-notes
View on GitHub
☆15Apr 28, 2020Updated 6 years ago
DS3Lab / cognival
View on GitHub
CogniVal: cognitive word embedding evaluation
☆16Dec 8, 2022Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
datadesk / noaa-wildfires
View on GitHub
Download wildfires data from NOAA satellites
☆15Updated this week
bgrimstad / censo
View on GitHub
CENSO is a framework for global optimization of nonconvex, spline-constrained MINLP problems
☆14Apr 28, 2019Updated 7 years ago
x-hansong / Crystal
View on GitHub
一个基于scrapy+selenium+phantomjs的爬虫程序，用于抓取多个学校的学术报告信息
☆10Sep 3, 2015Updated 10 years ago
talos / mta-service-status-archive
View on GitHub
A git-powered archive of http://web.mta.info/status/serviceStatus.txt
☆12Jul 21, 2018Updated 8 years ago
summa-platform / summa-oss
View on GitHub
Meta-repository for the open-source version of the SUMMA Platform
☆16Mar 25, 2024Updated 2 years ago
vgrabovets / multi_rake
View on GitHub
Multilingual Rapid Automatic Keyword Extraction (RAKE) for Python
☆272Jul 20, 2023Updated 3 years ago
hyperstudio / parserbot
View on GitHub
Web-based synthesis of nifty NLP and entity extraction services
☆13Oct 25, 2019Updated 6 years ago