allenai / pipeline
Library for building reproducible data pipelines to support experimentation
☆20Updated 9 years ago
Alternatives and similar repositories for pipeline:
Users that are interested in pipeline are comparing it to the libraries listed below
- Scala utilities for teaching computational linguistics and prototyping algorithms.☆42Updated 12 years ago
- A little text processing library for Scala.☆28Updated 9 years ago
- A library of machine learning algorithms implemented using principles of functional programming.☆23Updated 8 years ago
- Using deep learning to POS tag sentences using scala + DL4J☆37Updated 10 years ago
- Saul : Declarative Learning-Based Programming☆64Updated 5 years ago
- ReactiveLDA is a fast, lightweight implementation of the Latent Dirichlet Allocation (LDA) algorithm, using a parallel vanilla Gibbs samp…☆61Updated 9 years ago
- Fast and robust NLP components implemented in Java.☆52Updated 4 years ago
- brat rapid annotation tool (brat) - for all your textual annotation needs☆10Updated 7 years ago
- An example project for doing grid search in MLlib☆13Updated 10 years ago
- Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLP☆15Updated 8 years ago
- Scala port of the word2vec toolkit.☆11Updated 8 years ago
- NLP toolkit (tokenizer, POS-tagger, parser, etc.)☆42Updated 8 years ago
- Dependency and data pipeline management framework for Spark and Scala☆15Updated 8 years ago
- Repo for experiments on pyspark and sklearn☆79Updated 11 years ago
- A Scala wrapper for the Stanford NER (named entity recognition) tool.☆12Updated 9 years ago
- A machine learning software for extracting information from scholarly documents☆23Updated 4 years ago
- GPU Acceleration for Apache Spark☆34Updated 9 years ago
- (Weighted) Finite State Transducers for Scala NLP☆21Updated 10 years ago
- Sparse feature extraction with Spark☆30Updated 6 years ago
- Infinite relational model (IRM) for datamicroscopes☆14Updated 9 years ago
- Semanticizest: dump parser and client☆20Updated 8 years ago
- A Neural network implementation with Scala☆20Updated 8 years ago
- SmallK: very fast data clustering tools☆14Updated 6 years ago
- Gust is a set of GPU extensions for Breeze.☆33Updated 10 years ago
- Implementation of an algorithm computing the nearest "N" neighbours to a vector, using a collection of hyperplane hashers.☆30Updated 9 years ago
- A scala library for IBM ILOG CPLEX☆19Updated 5 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- Example code to explore for using DL4J in Scala.☆19Updated 9 years ago
- Data Science in Scala - Conf. Talk Repo☆15Updated 9 years ago
- Json Wikipedia, contains code to convert the Wikipedia xml dump into a json dump. Questions? https://gitter.im/idio-opensource/Lobby☆17Updated 2 years ago