trec-kba / many-stop-wordsView external linksLinks
stop word lists in several languages
☆21Mar 25, 2017Updated 8 years ago
Alternatives and similar repositories for many-stop-words
Users that are interested in many-stop-words are comparing it to the libraries listed below
Sorting:
- Workshop bringing together individuals interested in developing curriculum, workflows, and tools to strengthen reproducibility in researc…☆33Jul 12, 2015Updated 10 years ago
- ☆18Jan 21, 2021Updated 5 years ago
- Pipeline for distributed Natural Language Processing, made in Python☆65Jan 31, 2017Updated 9 years ago
- Encryption for Journalists - Hacks/Hackers NYC☆40Oct 3, 2013Updated 12 years ago
- Wiktionary Parser☆28Feb 10, 2017Updated 9 years ago
- A framework, data and configs for generating and building Tesseract OCR lang.traineddata model files, specifically for Japanese☆10Dec 9, 2013Updated 12 years ago
- This is not the official kaldi repository. It is better to fork https://github.com/kaldi-asr/kaldi or https://github.com/vimalmanohar/kal…☆33Aug 6, 2015Updated 10 years ago
- Using latent Dirichlet allocation (LDA) in Apache Lucene☆57Nov 19, 2012Updated 13 years ago
- Speech ANDroid Apps☆20Jan 22, 2014Updated 12 years ago
- "Save as DAISY" add-in for Microsoft Word☆10Dec 22, 2025Updated last month
- Grecka is a python script to convert Greek to Greeklish based on ELOT 743☆12Aug 4, 2018Updated 7 years ago
- Data notification service: subscribe to keywords and get notified whenever an open data sources mentions that keyword.☆24Aug 25, 2013Updated 12 years ago
- A colour-coded radar chart to keep track of technologies in use, whether they are being evaluated, adopted or phased out.☆14Jan 6, 2021Updated 5 years ago
- 基于人工神经网络的中文语义相似度计算研究☆11Apr 1, 2013Updated 12 years ago
- Focused Crawler for VT's CTRNet☆10May 13, 2013Updated 12 years ago
- Madek main web interface☆21Updated this week
- ☆15Jun 9, 2023Updated 2 years ago
- Roadmap for Lantern development☆12Mar 2, 2018Updated 7 years ago
- A web-based interactive 3D molecular viewer with Augmented Reality & Holographic Display.☆12May 9, 2019Updated 6 years ago
- TiO is an AirBnB like android app demo developed from a hackathon. I developed it with another Android developer, a backend, and a UI des…☆11May 30, 2016Updated 9 years ago
- Sourcecode & CAD drawings of NimbRo-OP☆27Oct 30, 2012Updated 13 years ago
- (Labeled) Latent Dirichlet Allocation on a sentence level with Gibbs Sampling☆10Mar 27, 2014Updated 11 years ago
- Get notified instantly when your users of interest speak about something.☆10Mar 24, 2020Updated 5 years ago
- Node interface which parses sentences into grammatical structures☆12May 31, 2017Updated 8 years ago
- Proof of concept implementation of a cyber threat intelligence and incident handling platform☆11Feb 10, 2023Updated 3 years ago
- Brand disambiguator for tweets to differentiate e.g. Orange vs orange (brand vs foodstuff), using NLTK and scikit-learn☆58Jul 11, 2013Updated 12 years ago
- An application that open source projects can use to ensure they include relevant documentation (and not secrets or PII!)☆10Mar 29, 2021Updated 4 years ago
- Super efficient TCP connection between remote processes☆12Apr 7, 2016Updated 9 years ago
- Python parser for the Archie Markup Language (ArchieML)☆12Nov 7, 2021Updated 4 years ago
- Malware - Machine Learning☆11Mar 24, 2018Updated 7 years ago
- A simple web framework based on asyncio.☆25Sep 25, 2016Updated 9 years ago
- 新词发现分布式机器学习算法。☆15Jul 21, 2014Updated 11 years ago
- The secure, transparent, auditable, reliable electronic voting system☆14Oct 6, 2016Updated 9 years ago
- Java开发的集成LBS查询,音乐,阅读的微信公众账号☆10May 1, 2017Updated 8 years ago
- Tools for fuzzy string search in text and dictionaries written in Java☆10Dec 24, 2015Updated 10 years ago
- A curated lust of awesome cyber civil society actors, project etc.☆10Jul 16, 2020Updated 5 years ago
- ☆15Feb 14, 2012Updated 14 years ago
- Data Driven Journalism Handbook☆23Sep 23, 2012Updated 13 years ago
- COVID-19 corpus with annotated biomedical entities.☆11Jun 2, 2021Updated 4 years ago