Skylion007 / OpenWebTextCorpus
☆21Updated last year
Alternatives and similar repositories for OpenWebTextCorpus:
Users that are interested in OpenWebTextCorpus are comparing it to the libraries listed below
- Official details for: [1803.08493] Context is Everything: Finding Meaning Statistically in Semantic Spaces☆39Updated 5 years ago
- numeric fused-head identification and resolution☆33Updated 5 years ago
- A simple neural truecaser written in pytorch and allennlp.☆33Updated 10 months ago
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆52Updated 4 years ago
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learning☆41Updated 4 years ago
- Converter from UD-trees to BART representation☆36Updated last year
- This repository includes all the code and data for the paper ELiDi (End2end Entity Linking and Disambiguation)☆14Updated 3 years ago
- Neural Network for Automatic Negation Detection☆20Updated 8 years ago
- Example using Polyaxon to experiment with pre-training spaCy☆65Updated 3 years ago
- NanigoNet — Language detector for code-mixed input supporting 150+19 human+programming languages using deep neural networks☆72Updated last year
- Concept2vec Metrics for Evaluating Quality of Embeddings for Ontological Concepts☆14Updated 6 years ago
- ☆33Updated 3 years ago
- A web application tagging and retrieval of arguments in text☆28Updated last year
- Implementation of a simple frame identification approach (SimpleFrameId) described in the paper "Out-of-domain FrameNet Semantic Role Lab…☆15Updated 8 years ago
- ADS Project☆14Updated 9 years ago
- word vector library☆34Updated 5 years ago
- Finds linguistic patterns effortlessly☆36Updated last year
- Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at…☆22Updated 8 months ago
- ☆14Updated 4 years ago
- ☆70Updated 2 years ago
- A machine learning software for extracting information from scholarly documents☆23Updated 4 years ago
- Dictionaries of names, surnames, acronyms and it's extensions, stop-words, etc., which I gathered for different experiments.☆28Updated 8 years ago
- Entity Linking for the masses☆56Updated 9 years ago
- spaCy pipeline component for adding text readability meta data to Doc objects.☆56Updated 6 years ago
- Running Prodigy for a team of annotators☆53Updated 4 years ago
- Analyzes news stories for event schemas and templates.☆17Updated 9 years ago
- ADEL is a robust and efficient entity linking framework that is adaptive to text genres and language, entity types for the classification…☆19Updated 5 years ago
- Deep neural parser for database query☆18Updated 2 years ago
- Neural Vector Space Models☆49Updated 6 years ago
- ☆40Updated 7 years ago