amazon-science / datatunerLinks
Code related to "Have Your Text and Use It Too! End-to-End Neural Data-to-Text Generation with Semantic Fidelity" paper
☆92Updated 3 years ago
Alternatives and similar repositories for datatuner
Users that are interested in datatuner are comparing it to the libraries listed below
Sorting:
- NeuralQA: A Usable Library for Question Answering on Large Datasets with BERT☆234Updated 2 years ago
- Machine Learning for Information Retrieval☆86Updated 4 months ago
- Explainable Zero-Shot Topic Extraction☆63Updated last year
- Information extraction pipeline containing coreference resolution, named entity linking, and relationship extraction☆81Updated 4 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 3 years ago
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.☆92Updated 3 years ago
- Semantic search through a vectorized Wikipedia (SentenceBERT) with the Weaviate vector search engine☆243Updated 2 years ago
- A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficiently…☆108Updated last year
- Database Reasoning Over Text project for ACL paper☆350Updated 3 years ago
- Automatically labeling training data☆107Updated 6 years ago
- Custom Natural Language Processing with big and small models 🌲🌱☆67Updated 4 years ago
- An extensible framework for building visualization and annotation tools to enable better interaction with NLP and Artificial Intelligence…☆49Updated 2 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆151Updated 2 years ago
- On Generating Extended Summaries of Long Documents☆78Updated 4 years ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.☆87Updated this week
- Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Tagging☆68Updated 3 years ago
- Code for obtaining the Curation Corpus abstractive text summarisation dataset☆127Updated 4 years ago
- Intelligence Task Ontology (ITO)☆74Updated 3 years ago
- Question-answers, collected from Google☆128Updated 4 years ago
- ☆43Updated 2 years ago
- Text-to-SQL in the Wild: A Naturally-Occurring Dataset Based on Stack Exchange Data☆103Updated 4 years ago
- A multi-stage neural search engine for the COVID-19 Open Research Dataset☆137Updated 2 years ago
- Expose a Top2Vec model with a REST API.☆92Updated 2 years ago
- Creating class-based TF-IDF matrices☆90Updated 2 years ago
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆63Updated last year
- A collection of utilities for writing labeling functions, transformation functions, and slicing functions.☆22Updated 5 years ago
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learning☆42Updated 5 years ago
- ☆19Updated 5 years ago
- How to encode sentences in a high-dimensional vector space, a.k.a., sentence embedding.☆134Updated 3 years ago
- Topic Inference with Zeroshot models☆61Updated 2 years ago