microsoft / Computational-Use-of-Data-Agreement
Computational Use of Data Agreement - Removing Barriers to Data Innovation
☆19Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Computational-Use-of-Data-Agreement
- ☆48Updated 5 years ago
- Open Use of Data Agreement - Removing Barriers to Data Innovation☆17Updated 3 years ago
- Democratizing NLP!☆105Updated 11 months ago
- website for MS Marco☆27Updated 3 weeks ago
- Recipes for training OpenNMT systems☆14Updated 7 years ago
- Highly specialized crate to parse and use `google/sentencepiece` 's precompiled_charsmap in `tokenizers`☆18Updated 2 years ago
- Deep learning spelling patterns with a recurrent neural network☆12Updated 7 years ago
- Learning Based Java (LBJava)☆13Updated 2 years ago
- English text corrector by using deep neural networks in Pytorch☆47Updated 7 years ago
- Python package to compute metrics on an NLU intent parsing pipeline☆13Updated 4 years ago
- numeric fused-head identification and resolution☆33Updated 5 years ago
- Doing things with embeddings☆64Updated 2 years ago
- ☆21Updated 7 years ago
- A monolingual parallel corpus for sentence simplification☆11Updated 8 years ago
- Launch NMT tasks on the cloud☆13Updated last year
- Various utilities for processing the data.☆207Updated this week
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆65Updated this week
- ☆21Updated 2 years ago
- Manage seeds across multiple Python RNGs.☆12Updated last month
- c++ mosestokenizer☆16Updated 8 months ago
- Leaderboards are widely used in NLP and push the field forward. While leaderboards are a straightforward ranking of NLP models, this simp…☆16Updated 2 years ago
- Robust Cross-lingual Embeddings from Parallel Sentences☆20Updated 4 years ago
- jiant-dev☆28Updated 3 years ago
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆86Updated 3 years ago
- Indra is a Web Service which allows easy access to different distributional semantics models in several languages.☆47Updated 3 years ago
- Relational NLP: Convert text into relational facts.☆9Updated 4 years ago
- bin files☆13Updated 2 months ago
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learning☆42Updated 4 years ago
- A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF …☆65Updated 4 years ago
- Repository for the ACL 2020 virtual conference website (work in progress)☆38Updated 2 years ago