stop word lists in several languages
☆21Mar 25, 2017Updated 9 years ago
Alternatives and similar repositories for many-stop-words
Users that are interested in many-stop-words are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Advices to look for malicious software on your devices☆17May 6, 2020Updated 6 years ago
- Pipeline for distributed Natural Language Processing, made in Python☆65Jan 31, 2017Updated 9 years ago
- ☆18Jan 21, 2021Updated 5 years ago
- Workshop bringing together individuals interested in developing curriculum, workflows, and tools to strengthen reproducibility in researc…☆33Jul 12, 2015Updated 10 years ago
- Toolchain to retrieve and parse privacy policies from websites as described in our paper "Unifying Privacy Policy Detection" by Henry Hos…☆17Apr 8, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [obsolete] Python interface to Morfeusz☆10Jul 3, 2017Updated 8 years ago
- Data Privacy Vocabulary☆18Jun 29, 2022Updated 3 years ago
- Encryption for Journalists - Hacks/Hackers NYC☆40Oct 3, 2013Updated 12 years ago
- Universal Forensic Indexer and Analyzer☆10Jan 8, 2017Updated 9 years ago
- A neural network based StoryTeller that outputs a short story from an input image☆13Dec 15, 2018Updated 7 years ago
- Using embedding-based loss functions for phonetics/speech recognition.☆17Nov 24, 2014Updated 11 years ago
- Tools for fuzzy string search in text and dictionaries written in Java☆10Dec 24, 2015Updated 10 years ago
- This project describes the D4M 2.0 Schema used in many Accumulo systems.☆21Oct 3, 2020Updated 5 years ago
- Sample code for a Zapier engineering blog post☆14Nov 23, 2013Updated 12 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆15Jun 9, 2023Updated 2 years ago
- PANiC - PAraphrasing Noun-Compounds☆15Apr 6, 2018Updated 8 years ago
- A part-of-speech tagger with support for domain adaptation and external resources.☆24Oct 26, 2022Updated 3 years ago
- RUSSE: Russian Semantic Evaluation.☆15Mar 1, 2022Updated 4 years ago
- Post-Specialisation: Retrofitting Vectors of Words Unseen in Lexical Resources☆12Apr 12, 2018Updated 8 years ago
- basically all words, in a compressed form☆17Jan 9, 2023Updated 3 years ago
- Data and code for the experiments in the Outlier Detection task proposed by Camacho-Collados et al.☆13Aug 28, 2018Updated 7 years ago
- COVID-19 corpus with annotated biomedical entities.☆11Jun 2, 2021Updated 4 years ago
- Watset: Automatic Induction of Synsets from a Graph of Synonyms☆16Jul 7, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- The unofficial Raiden client (Ethereum L2 scaling solution) implementation in Rust☆12Apr 11, 2024Updated 2 years ago
- Word Embeddings for Low Resource Languages: The Case of Buryat☆10Mar 12, 2025Updated last year
- Scala utilities for teaching computational linguistics and prototyping algorithms.☆42Dec 29, 2012Updated 13 years ago
- Python Client Library for Apache Accumulo☆26Aug 1, 2020Updated 5 years ago
- Dataset collected from popular Russian collective blog Habrahabr.ru☆13Oct 24, 2016Updated 9 years ago
- Extended Wikilinks dataset description☆15Apr 1, 2018Updated 8 years ago
- A tqdm bar progress that works with MongoDB instead of console.☆11Feb 21, 2022Updated 4 years ago
- A PredictionIO engine template using Latent Dirichlet Allocation to learn a topic model from raw text☆12May 4, 2016Updated 10 years ago
- Windows library for hooking functions across processes, injecting DLLs into other applications, and more. (Somewhat similar to MS Detours…☆12Apr 2, 2013Updated 13 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- An Outlier Detection Project for High Dimensional Data☆15Nov 25, 2017Updated 8 years ago
- Provides commonly requested features that are too controversial for the Xtend core library☆32Jun 7, 2016Updated 9 years ago
- ☆18Apr 25, 2018Updated 8 years ago
- Library for bootstrapping statistics☆22Nov 25, 2017Updated 8 years ago
- Java/python library and validator for the AIDA Interchange Format (AIF). Originally based on isi-vista/gaia-interchange.☆21Jun 14, 2023Updated 2 years ago
- Examples of spark-lucenerdd☆15Oct 6, 2023Updated 2 years ago
- This blog post visualize vector norms of FastText embedding and evaluates use of FastText word vector norm multiplied with number of word…☆19Jul 6, 2023Updated 2 years ago