microsoft / SynthaCorpus
Tools for generating synthetic document corpora
☆13Updated last year
Alternatives and similar repositories for SynthaCorpus:
Users that are interested in SynthaCorpus are comparing it to the libraries listed below
- Inverted file indexing and retrieval optimized for short texts. Supports auto-suggest and query segment classification.☆34Updated last year
- The Scalable Hyperlink Store☆15Updated 11 years ago
- A flexible data structure for low-rank (≤ 5), sparse tensors supporting slices by any dimension and Einstein summation (einsum).☆14Updated 4 months ago
- This repo contains a version of the LLVM test suite that is being modified to use Checked C. The modified programs will be used to ben…☆13Updated 2 years ago
- Anytime Ranking for Impact-Ordered Indexes☆12Updated 8 years ago
- A library for collecting features and performing inference of machine learning evaluations based on those features, useful especially in …☆12Updated 4 years ago
- The GraphBuilder library provides functions to construct large scale graphs. It is implemented on Apache Hadoop.☆97Updated 10 years ago
- Exploration Library in Java☆12Updated last year
- Build an algorithm to predict friendships, then actually use it to meet people☆95Updated 11 years ago
- Samples of ML models learning from source code☆19Updated 2 years ago
- ☆25Updated 6 years ago
- Mobile web application shell for educational content and interactive skill practices using a chat-like interface.☆11Updated last year
- M-ATOLL: A Framework for the Lexicalization of Ontologies in Multiple Languages☆10Updated 8 years ago
- This is a set of ontologies used by different parts of the Open Semantic Framework. These ontologies should normally be loaded in OSF usi…☆14Updated 11 years ago
- Some variations on Lemire's Fast Random Integer Generation in an Interval☆15Updated 5 years ago
- Grep Front end code☆14Updated last year
- scripts to ease working with binary numbers☆16Updated 7 years ago
- A framework for building reranking models.☆28Updated 9 years ago
- Scripts to parse arxiv documents for NLP tasks☆17Updated last year
- FoLiA library for C++☆16Updated last month
- Deterministic Acyclic Finite State Automaton implementation for morphological analysis☆18Updated 4 years ago
- A Python implementation of a Python bytecode runner☆16Updated 5 years ago
- git://git.savannah.gnu.org/patch.git☆12Updated 6 months ago
- ☆12Updated 7 years ago
- Exploration Library in C++☆15Updated last year
- Python functions for popular relevance metrics (ndcg, err, etc)☆16Updated last year
- Towards Neural Phrase-based Machine Translation☆12Updated last year
- ☆21Updated 4 years ago
- High performance JSON manipulation library☆13Updated 2 months ago
- Apache STeVe -- a set of voting tools☆14Updated last month