MinorThird is a collection of Java classes for storing text, annotating text, and learning to extract entities and categorize text.
☆58Feb 2, 2018Updated 8 years ago
Alternatives and similar repositories for MinorThird
Users that are interested in MinorThird are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- R-toolbox-py☆25Aug 3, 2015Updated 10 years ago
- A system for disambiguating toponyms (placenames) given textual context and creating visualizations of the locations referenced in a give…☆19Jul 24, 2013Updated 12 years ago
- Implementation of Monte Carlo Word Movers Distance in Python with TensorFlow☆12Sep 12, 2016Updated 9 years ago
- An entity linking prototype, developed using the datasets from the TAC-KBP sub-task.☆28Apr 5, 2017Updated 8 years ago
- ☆13Jun 14, 2016Updated 9 years ago
- The accompanying code and data for the Springer 2017 publication "What's missing in geographical parsing?" in Language Resources and Eval…☆18Oct 17, 2019Updated 6 years ago
- The distributed statistical machine translation infrastructure consisting of load balancing, text pre/post-processing and translation ser…☆12Nov 29, 2018Updated 7 years ago
- Keyword Extraction system using Brown Clustering - (This version is trained to extract keywords from job listings)☆18Sep 16, 2014Updated 11 years ago
- Regularized latent variable mixed membership modeling☆13Aug 12, 2013Updated 12 years ago
- Distinguishing between anime and hentai☆16Jan 29, 2017Updated 9 years ago
- Algorithmic summarizer for RSS/Atom Feeds, Web Urls and arbitrary text. Codebase for the application deployed at http://tldrzr.herokuapp.…☆53Sep 4, 2016Updated 9 years ago
- LDA workshop presented by Fast Forward Labs☆15Jun 26, 2019Updated 6 years ago
- Implicit relation extractor using a natural language model.☆24May 25, 2018Updated 7 years ago
- Easily identify and label sentence intervals using various taggers.☆16Feb 1, 2017Updated 9 years ago
- Ranking Entity Types using the Web of Data☆30Nov 22, 2016Updated 9 years ago
- Code for the paper Faster Phrase-Based Decoding by Refining Feature State☆14Jan 9, 2023Updated 3 years ago
- Wikipedia-based keyword extraction tool in Java☆21May 11, 2015Updated 10 years ago
- Automatically exported from code.google.com/p/relation-extraction-corpus☆57Dec 14, 2015Updated 10 years ago
- Cyberinfrastructure Shell (CIShell) is an open source, community-driven framework/application for the integration and utilization of data…☆31Nov 28, 2018Updated 7 years ago
- TextFlows is an open-source online platform for composition, execution, and sharing of interactive text mining and natural language proce…☆19Dec 1, 2017Updated 8 years ago
- 使用Spark的MLlib、Hbase作为模型、Hive作数据清洗的核心推荐引擎,在Spark on Yarn测试通过☆30Mar 9, 2017Updated 9 years ago
- XPath extension for extraction from interactive web sites. NOTE: This code is currently out of sync. A more recent, but precompiled versi…☆27Feb 27, 2013Updated 13 years ago
- A tool for calculation semantic similarity between words from a text corpus based on lexico-syntactic patterns.☆27Feb 13, 2016Updated 10 years ago
- Semantic Parser with Execution☆13Dec 8, 2017Updated 8 years ago
- A (massive) DNS tools (reverse lookup for now)☆12Jul 6, 2022Updated 3 years ago
- Implementation of the Loopy Belief Propagation algorithm for Apache Spark☆41Apr 10, 2020Updated 5 years ago
- Simplified implementations of deep learning related works☆13Oct 6, 2016Updated 9 years ago
- A python module provides content extraction and summarization of a web page even if the web page was broken.☆18Apr 14, 2023Updated 2 years ago
- This is a Java library which can be used to crawl the content of some of web properties (www.salesforce.com, blogs.salesforce.com for exa…☆25May 15, 2025Updated 10 months ago
- Ranking of fine-tuned HF models as base models.☆36Sep 17, 2025Updated 6 months ago
- OpenAL JNA Wrappers for Java☆12Nov 23, 2015Updated 10 years ago
- Linked Data tools for SMEs☆16Oct 3, 2016Updated 9 years ago
- Sample AWS Batch project to read CSV files☆11Oct 22, 2017Updated 8 years ago
- Demo of random projections at BerlinBuzzwords 2015☆22Feb 25, 2020Updated 6 years ago
- Amsterdam Content Analysis Toolkit☆46Jul 6, 2022Updated 3 years ago
- easily create APIs from GeoJSON files☆14Dec 5, 2017Updated 8 years ago
- 通过示例阐述如何使用pycrfsuite☆10Nov 7, 2016Updated 9 years ago
- This module aims to extract emotions from audio. The input argument is either an uploaded audio/video file to the server or a URL. The o…☆22Apr 3, 2018Updated 7 years ago
- Online Random Bit Regression with FTRL-Proximal in Python☆75Nov 24, 2015Updated 10 years ago