igorbrigadir/stopwords

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/igorbrigadir/stopwords)

igorbrigadir / stopwords

Default English stopword lists from many different sources

☆312

Alternatives and similar repositories for stopwords

Users that are interested in stopwords are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AmenRa / indxr
View on GitHub
A Python utility for indexing file lines. Best demo honourable mention at ECIR 2024.
☆23Nov 9, 2025Updated 8 months ago
lintool / IR-Reproducibility
View on GitHub
Open-Source Information Retrieval Reproducibility Challenge
☆51Jan 11, 2016Updated 10 years ago
hscells / pybool_ir
View on GitHub
Toolkit for domain-specific information retrieval experimentation
☆19May 18, 2026Updated 2 months ago
rankbiased / rbstar
View on GitHub
Rank-Biased Precision, Overlap, Recall, and Alignment
☆12Jun 15, 2026Updated last month
diazf / indri
View on GitHub
A clone of indri-5.12 with minor customizations.
☆25Sep 23, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
irgroup / repro_eval
View on GitHub
A Python Interface to Reproducibility Measures of System-Oriented IR Experiments
☆11Dec 2, 2025Updated 7 months ago
ten-blue-links / fxt
View on GitHub
A large scale feature extraction tool for text-based machine learning
☆32Sep 6, 2022Updated 3 years ago
hamed-zamani / snrm
View on GitHub
Standalone Neural Ranking Model (SNRM)
☆76Dec 26, 2018Updated 7 years ago
jjfiv / fastrank
View on GitHub
My most frequently used learning-to-rank algorithms ported to rust for efficiency. Try it: "pip install fastrank".
☆52Mar 3, 2025Updated last year
evhart / crees
View on GitHub
Crisis Event Extraction Service (CREES)
☆15Feb 4, 2019Updated 7 years ago
CogComp / perspectrum
View on GitHub
Perspectrum: a dataset of claims, perspectives and evidence documents
☆35Jan 16, 2020Updated 6 years ago
Yevgnen / pybrat
View on GitHub
Parser for brat rapid annotation tool.
☆15May 2, 2023Updated 3 years ago
jonocarroll / starryeyes
View on GitHub
"Oh my God! — it's full of stars!"
☆12Apr 20, 2018Updated 8 years ago
TREMA-UNH / trec-car-tools
View on GitHub
Tools for working with the TREC CAR dataset.
☆38Jul 12, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
lucene4ir / lucene4ir
View on GitHub
Lucene for Information Retrieval
☆51Jan 1, 2023Updated 3 years ago
rmit-ir / polyfuse
View on GitHub
Fusion for TREC run files with popular fusion techniques
☆21Aug 26, 2022Updated 3 years ago
faneshion / DRMM
View on GitHub
CIKM 2016 paper
☆28Nov 29, 2019Updated 6 years ago
logui-framework / client
View on GitHub
A framework-agnostic client-side JavaScript library for logging user interactions on webpages.
☆19Feb 3, 2022Updated 4 years ago
teanalab / SWDM
View on GitHub
SIGIR 2017: Embedding-based query expansion for weighted sequential dependence retrieval model
☆36Aug 2, 2017Updated 8 years ago
composes-toolkit / dissect
View on GitHub
☆59Jul 14, 2015Updated 11 years ago
andrewyates / profane
View on GitHub
A library for creating complex experimental pipelines
☆12Jul 25, 2022Updated 3 years ago
MangoTheCat / franc
View on GitHub
Detect the Language of Text
☆53Jan 15, 2016Updated 10 years ago
webis-de / lightning-ir
View on GitHub
One-stop shop for running and fine-tuning transformer-based language models for retrieval
☆65Jul 9, 2026Updated last week
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
cvangysel / SERT
View on GitHub
Semantic Entity Retrieval Toolkit
☆111Jul 26, 2017Updated 8 years ago
eliasdabbas / radvertools
View on GitHub
Productivity and analysis tools for online marketing
☆10Aug 31, 2017Updated 8 years ago
trec-core / 2017
View on GitHub
TREC Core track
☆11Jul 5, 2017Updated 9 years ago
emory-irlab / pyterrier_genrank
View on GitHub
Generative Reranker PyTerrier
☆18Dec 1, 2025Updated 7 months ago
laurenfklein / QTM340-Fall21
View on GitHub
Notebooks and other course materials for Emory QTM 340 (Fall 2021)
☆23Jan 16, 2023Updated 3 years ago
smsubrahmannian / Topic-Modeling
View on GitHub
☆20Apr 22, 2018Updated 8 years ago
wlandau / drakeplanner
View on GitHub
A web app to create new drake projects
☆20Feb 6, 2021Updated 5 years ago
uhh-lt / targer
View on GitHub
A web application tagging and retrieval of arguments in text
☆30May 1, 2023Updated 3 years ago
khui / copacrr
View on GitHub
The code for COPACRR Neural IR model.
☆37Feb 6, 2018Updated 8 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
seanmacavaney / autoqrels
View on GitHub
☆15Feb 20, 2025Updated last year
teanalab / FieldedSDM
View on GitHub
Fielded Sequential Dependence Model (code and runs)
☆32Dec 23, 2015Updated 10 years ago
randy3k / rango
View on GitHub
Calling R from Go and a better cli for the R console (WIP, nothing is working now)
☆13Aug 19, 2020Updated 5 years ago
TaddyLab / maptpx
View on GitHub
map estimation of topic models
☆19May 27, 2020Updated 6 years ago
terrierteam / pyterrier_t5
View on GitHub
☆17Apr 30, 2026Updated 2 months ago
BenjaminDHorne / Language-Features-for-News
View on GitHub
Language features used in the NELA Toolkit and other news studies
☆13Oct 14, 2020Updated 5 years ago
coolbutuseless / flagon
View on GitHub
Flags of the World
☆15Apr 3, 2020Updated 6 years ago