chimpler / blog-spark-naive-bayes-reutersView external linksLinks
Simple example on how to use Naive Bayes on Spark using the popular Reuters 21578 dataset
☆23Jul 20, 2014Updated 11 years ago
Alternatives and similar repositories for blog-spark-naive-bayes-reuters
Users that are interested in blog-spark-naive-bayes-reuters are comparing it to the libraries listed below
Sorting:
- taken from https://github.com/caesar0301/awesome-public-datasets☆14Jun 12, 2017Updated 8 years ago
- ☆12Nov 22, 2024Updated last year
- MS Marco Entity Annotations Disambiguation☆13May 19, 2023Updated 2 years ago
- [TPAMI-2018] A C++ framework for training/testing Support Vector Machine with Gaussian Sample Uncertainty (SVM-GSU).☆13Feb 20, 2018Updated 7 years ago
- WSDM 2021 Tutorial on Advances in Bias-aware Recommendation on the Web☆11Mar 8, 2021Updated 4 years ago
- Sample Python code for working with the HBase REST interface☆24Jul 25, 2013Updated 12 years ago
- Document classification with Apache Spark on an American Classic☆10Sep 25, 2015Updated 10 years ago
- Write JDBC ResultSet to Parquet File☆11Apr 14, 2025Updated 10 months ago
- Meniscus - The Python Event Logging Service☆65May 17, 2015Updated 10 years ago
- Build resume and cover letter from your portfolio for each job description with LLMs☆11Feb 23, 2025Updated 11 months ago
- A Clojure library for use case driven development☆11Dec 25, 2017Updated 8 years ago
- ☆10Aug 15, 2017Updated 8 years ago
- Winning data science solution for Energy Hack NL 2018. Sonnet: forecasting station load caused by solar panels.☆11May 28, 2018Updated 7 years ago
- A project which does the ColBERT pruning based on the LP or L1 norm☆19Jun 11, 2025Updated 8 months ago
- Experimental IMAP gateway to notmuch☆13Nov 21, 2023Updated 2 years ago
- A platform for collecting, analyzing, and visualizing social media data.☆13Dec 27, 2020Updated 5 years ago
- ☆14Oct 18, 2024Updated last year
- Compile Markdown to React component☆13Aug 12, 2023Updated 2 years ago
- Meta-Analysis of Robust04 Papers (Yang et al., SIGIR 2019)☆12May 25, 2019Updated 6 years ago
- ☆12Sep 13, 2025Updated 5 months ago
- Dataset for the ACL 2015 paper : Learning to Explain Entity Relationships in Knowledge Graphs☆11Oct 22, 2015Updated 10 years ago
- “达观杯”长文本智能处理挑战赛。达观数据提供了一批长文本数据和分类信息,希望选手动用自己的智慧,结合当下最先进的NLP和人工智能技术,深入分析文本内在结构和语义信息,构建文本分类模型,实现精准分类。☆10Jul 20, 2018Updated 7 years ago
- browse linkedin profiles without a registered account☆14Oct 20, 2016Updated 9 years ago
- Sparse Embedding Compression for Scalable Retrieval in Recommender Systems☆33Nov 21, 2025Updated 2 months ago
- OS X project that captures audio and immediately plays back using AudioQueues☆11Aug 10, 2013Updated 12 years ago
- ☆16Oct 6, 2023Updated 2 years ago
- ☆11Nov 2, 2022Updated 3 years ago
- Use Watson Knowledge Studio and Watson Discovery to analyze shipping and procurement information☆16Sep 17, 2025Updated 5 months ago
- Python source code for EMNLP 2021 Findings paper: "Subword Mapping and Anchoring Across Languages".☆13Sep 17, 2021Updated 4 years ago
- An ansible playbook to deploy a ready-to-use nextcloud w/ collabora based on https://brendan.abolivier.bzh/your-own-google-drive-docs/ fr…☆12Sep 20, 2019Updated 6 years ago
- code for paper "Feature-Budgeted Random Forest" ICML 2015☆11May 10, 2017Updated 8 years ago
- Utility for cui2vec in Go☆13Feb 25, 2023Updated 2 years ago
- Bayesian personalized feature interaction selection☆13Aug 25, 2021Updated 4 years ago
- This repository contains the complete source code of the MedTAG annotation tool. MedTAG is a biomedical annotation tool for tagging biome…☆12Jan 1, 2023Updated 3 years ago
- A repo to allow validation of performance results in the knor paper and provide a fast, scalable k-means implementation.☆15Mar 31, 2020Updated 5 years ago
- Code & data for Fast data processing with Spark V2☆14Feb 1, 2015Updated 11 years ago
- Distributed Virtual waiting room☆11Feb 3, 2026Updated 2 weeks ago
- ☆16Oct 28, 2024Updated last year
- ☆12Aug 15, 2022Updated 3 years ago