microsoft / SynthaCorpus
Tools for generating synthetic document corpora
☆13Updated last year
Alternatives and similar repositories for SynthaCorpus:
Users that are interested in SynthaCorpus are comparing it to the libraries listed below
- Inverted file indexing and retrieval optimized for short texts. Supports auto-suggest and query segment classification.☆33Updated last year
- R library for common information retrieval metrics☆13Updated last year
- scripts to ease working with binary numbers☆16Updated 7 years ago
- Towards Neural Phrase-based Machine Translation☆12Updated last year
- Mobile web application shell for educational content and interactive skill practices using a chat-like interface.☆11Updated last year
- This repo contains a version of the LLVM test suite that is being modified to use Checked C. The modified programs will be used to ben…☆13Updated 2 years ago
- Brotli wheels☆10Updated 3 months ago
- A library for collecting features and performing inference of machine learning evaluations based on those features, useful especially in …☆12Updated 4 years ago
- The GraphBuilder library provides functions to construct large scale graphs. It is implemented on Apache Hadoop.☆97Updated 10 years ago
- Exploration Library in C++☆15Updated last year
- A flexible data structure for low-rank (≤ 5), sparse tensors supporting slices by any dimension and Einstein summation (einsum).☆14Updated last month
- Samples of ML models learning from source code☆18Updated 2 years ago
- CyBERTron-LM is a project which collects some pre-trained Transformer-based models.☆12Updated last year
- Anytime Ranking for Impact-Ordered Indexes☆12Updated 8 years ago
- Benchmarking markdown libraries☆11Updated last year
- website for MS Marco☆27Updated 2 months ago
- Scripts to parse arxiv documents for NLP tasks☆17Updated last year
- ☆12Updated 7 years ago
- M-ATOLL: A Framework for the Lexicalization of Ontologies in Multiple Languages☆10Updated 7 years ago
- ☆12Updated 2 years ago
- [Deprecated]: Exploration library☆16Updated last year
- Node.js based proxy to make a solr instance read-only.☆27Updated 8 years ago
- A Python implementation of a Python bytecode runner☆16Updated 5 years ago
- An Excel formula parser☆12Updated 5 years ago
- Apache STeVe -- a set of voting tools☆14Updated 3 weeks ago
- With AutoBrewML Framework the time it takes to get production-ready ML models with great ease and efficiency highly accelerates.☆24Updated last year
- Batch IS NOT Heavy: Learning Word Representations From All Samples☆10Updated 6 years ago
- Tools to evaluate accuracies of various (research papers') metadata extraction libraries☆11Updated 9 years ago
- Manage seeds across multiple Python RNGs.☆12Updated 3 months ago