A Text Classification API in Java originally developed by DigitalPebble Ltd. The API is independent from the ML implementations used and can be used as a front end to various ML algorithms. libSVM and liblinear are currently embedded.
☆48Sep 24, 2021Updated 4 years ago
Alternatives and similar repositories for TextClassification
Users that are interested in TextClassification are comparing it to the libraries listed below
Sorting:
- The SRL-based Open IE extractor. A principal component of Open IE 4.0.☆19Oct 31, 2017Updated 8 years ago
- Code for the paper Faster Phrase-Based Decoding by Refining Feature State☆14Jan 9, 2023Updated 3 years ago
- A general purpose graph library☆11Jun 21, 2018Updated 7 years ago
- Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.☆284Apr 25, 2018Updated 7 years ago
- Use Avro to store all your values in HBase instead of regular columns☆75Dec 1, 2017Updated 8 years ago
- Item based retrieval engine with Bayesian Sets.☆20Jun 25, 2013Updated 12 years ago
- ☆70Aug 9, 2021Updated 4 years ago
- Seed acquisition tool to bootstrap focused crawlers☆23Apr 24, 2017Updated 8 years ago
- Java GC log parser for Oracle JDK☆13Feb 9, 2015Updated 11 years ago
- Vowpal Wabbit Webservice. A web service that accepts VW formatted text and runs it through a VW daemon instance.☆40Mar 9, 2016Updated 10 years ago
- Page Clipper Bookmarklet☆21Nov 14, 2015Updated 10 years ago
- HTML parser and tag balancer.☆19Mar 12, 2026Updated last week
- FCTT代码仓库☆10May 22, 2018Updated 7 years ago
- Fast way to create bookmarklets, inject jQuery on the fly also☆20Dec 12, 2017Updated 8 years ago
- Parquet IO for Tablesaw☆12Mar 2, 2026Updated 2 weeks ago
- Collaborative Synchronized Corpus Annotation Tool☆11Dec 31, 2018Updated 7 years ago
- Hadoop integration code for working with with Apache cTAKES☆10Feb 11, 2014Updated 12 years ago
- Convert tag files (ctags, gccxml, etc) to databases (sqlite, mysql, etc)☆13Mar 30, 2015Updated 10 years ago
- no bullshit ascii diagramming☆10Feb 28, 2021Updated 5 years ago
- Information Extraction System can perform NLP tasks like Named Entity Recognition, Sentence Simplification, Relation Extraction etc.☆27Apr 23, 2014Updated 11 years ago
- Vizlinc☆15Jan 14, 2016Updated 10 years ago
- The BES framework, which forms the basis for the Hyrax server☆16Mar 13, 2026Updated last week
- Mind map tool to add nodes by ENTER and TAB☆11Jun 7, 2021Updated 4 years ago
- Mirror from: https://gitlab.com/ViDA-NYU/auctus/auctus☆44May 12, 2025Updated 10 months ago
- A WYSIWYG Editor for ASCII Diagrams☆19Mar 10, 2024Updated 2 years ago
- Apache Nutch fork tunned for web services and data discovery.☆10May 18, 2015Updated 10 years ago
- Repository for revision of PREMIS OWL ontology group☆13May 12, 2022Updated 3 years ago
- The CMR Metadata Review tool is used to curate NASA EOSDIS collection and granule level metadata in CMR for correctness, completeness and…☆25Sep 4, 2025Updated 6 months ago
- Create CovJSON files from common scientific data formats☆14Apr 24, 2018Updated 7 years ago
- Ice is a rapid information extraction customizer☆15Apr 26, 2021Updated 4 years ago
- Pattern-of-Behavior Search Tool☆11Jun 20, 2022Updated 3 years ago
- ifcParserLib is a set of reusable Java components that implement functionality for IFC file parsing.☆10Oct 14, 2020Updated 5 years ago
- Mirror of Apache Edgent (Incubating) Samples☆15Feb 14, 2018Updated 8 years ago
- RESTful wrapper for the Joshua machine translation decoder☆14Oct 25, 2016Updated 9 years ago
- Table Sorter☆21Feb 28, 2017Updated 9 years ago
- An OpenStreetMap Visualization Toolkit for Python☆30Dec 18, 2017Updated 8 years ago
- realtime search/indexing system☆59May 27, 2014Updated 11 years ago
- Package provides java implementation of the latent dirichlet allocation (LDA) for topic modelling☆10May 18, 2017Updated 8 years ago
- A Java library for Stochastic Gradient Descent (SGD)☆22Nov 1, 2021Updated 4 years ago