microsoft / SynthaCorpusLinks
Tools for generating synthetic document corpora
☆13Updated 2 years ago
Alternatives and similar repositories for SynthaCorpus
Users that are interested in SynthaCorpus are comparing it to the libraries listed below
Sorting:
- Inverted file indexing and retrieval optimized for short texts. Supports auto-suggest and query segment classification.☆34Updated 2 years ago
- This repo contains a version of the LLVM test suite that is being modified to use Checked C. The modified programs will be used to ben…☆13Updated 2 years ago
- scripts to ease working with binary numbers☆16Updated 7 years ago
- The GraphBuilder library provides functions to construct large scale graphs. It is implemented on Apache Hadoop.☆97Updated 10 years ago
- Mobile web application shell for educational content and interactive skill practices using a chat-like interface.☆11Updated 2 years ago
- This is a set of ontologies used by different parts of the Open Semantic Framework. These ontologies should normally be loaded in OSF usi…☆14Updated 11 years ago
- A flexible data structure for low-rank (≤ 5), sparse tensors supporting slices by any dimension and Einstein summation (einsum).☆14Updated 7 months ago
- ☆12Updated 7 years ago
- Exploration Library in C++☆15Updated 2 years ago
- A Python implementation of a Python bytecode runner☆16Updated 6 years ago
- Tool to create and edit Pikov pixel art Markov chain animations.☆17Updated 4 years ago
- Extended docker build tool.☆14Updated 2 years ago
- Samples of ML models learning from source code☆19Updated 2 years ago
- Towards Neural Phrase-based Machine Translation☆12Updated 2 years ago
- This repository provides code for SVD and Importance sampling-based algorithms for large scale topic modeling.☆14Updated 4 years ago
- ☆18Updated 4 years ago
- OGDL for C☆17Updated 7 years ago
- Backup of config files used by jenkins.freebsd.org☆8Updated 9 years ago
- Mirror of official clang git repository located at http://llvm.org/git/clang. Updated every five minutes.☆12Updated 2 years ago
- Build projects required for SCXCore (Operations Manager) agent☆14Updated last month
- An experimental patchset management tool.☆12Updated 4 years ago
- An evolutionary multi-start algorithm for the Steiner Tree Problem in graphs with a fast local search.☆13Updated 2 years ago
- MetaSync☆20Updated 9 years ago
- A benchmark that simulates the 'incast' network traffic pattern.☆13Updated 5 years ago
- High performance JSON manipulation library☆13Updated last week
- Source code for North American Eclipse 2017 Megamovie project☆20Updated last year
- The Scalable Hyperlink Store☆15Updated 11 years ago
- For interacting with nutch via Python☆29Updated 3 months ago
- Tool to cleanse and semantify datasets from CKAN repositories. Based on OpenRefine.☆23Updated 9 years ago
- A conda-smithy repository for scikit-learn.☆7Updated 8 years ago