☆23Mar 31, 2024Updated last year
Alternatives and similar repositories for OpenWebTextCorpus
Users that are interested in OpenWebTextCorpus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Feb 3, 2025Updated last year
- code to remove "noise" from hOCR output of Tesseract OCR.☆14Oct 24, 2016Updated 9 years ago
- IDE backend for Julia☆27Dec 8, 2015Updated 10 years ago
- Objective C/Swift compatible implementation of autocorrect for iOS.☆11Aug 31, 2017Updated 8 years ago
- 2019 Toronto Datathon https://www.tdothealthhack.com☆11Oct 4, 2019Updated 6 years ago
- Python + OpenCV script to detect playing cards in an image. It uses template matching.☆13Jan 24, 2017Updated 9 years ago
- Ginkgo SARS-CoV-2 synthesis efforts: overviews and data☆13Jun 7, 2020Updated 5 years ago
- Web-based database system for flow cell management (incl. REST API)☆16Mar 7, 2024Updated 2 years ago
- Resources for grounding protein families and complexes from text and describing their hierarchical relationships.☆18Jul 10, 2024Updated last year
- Modeling the allocation of resources to markets based on the restraints of objective functions☆14Mar 15, 2016Updated 10 years ago
- ☆12Apr 15, 2022Updated 3 years ago
- Data and code for paper "ODSum: New Benchmarks for Open Domain Multi-Document Summarization"☆11Sep 20, 2024Updated last year
- A HTTP2 support library for Julia☆11Jan 9, 2019Updated 7 years ago
- A multi-label classification plugin for AllenNLP.☆11Jan 13, 2023Updated 3 years ago
- The ZEIT ONLINE content API☆29Jul 21, 2021Updated 4 years ago
- Python package to crawl the publicly available forms filed with the Securities and Exchange Commission (SEC) under the new Electronic Dat…☆16Aug 2, 2013Updated 12 years ago
- ☆13Oct 13, 2022Updated 3 years ago
- Extracting gene sets from published pathway figures☆15May 23, 2023Updated 2 years ago
- Multi-LexSum is an abstractive summarization dataset for US Civil Rights Lawsuits☆21Dec 15, 2022Updated 3 years ago
- An optimal space run-length Burrows-Wheeler transform full-text index☆27Oct 28, 2021Updated 4 years ago
- MFAQ: a Multilingual FAQ Dataset☆18Sep 17, 2023Updated 2 years ago
- Free Dutch voice dataset☆12Jan 28, 2021Updated 5 years ago
- Simple, Fast, Scalable , production grade dashboard application . Right solution for team☆14Jul 26, 2024Updated last year
- hgvslib provides functions to parse and compare the equivalency of variant strings described according to Human Genome Variation Society …☆18Dec 26, 2022Updated 3 years ago
- Wikipedia Live Monitor☆22Dec 21, 2024Updated last year
- [EMNLP 2021] Efficient Contrastive Learning via Novel Data Augmentation and Curriculum Learning☆17Jun 28, 2025Updated 8 months ago
- A holding place for Base graphics☆13Mar 1, 2026Updated 3 weeks ago
- ☆23Apr 24, 2013Updated 12 years ago
- A web application for playing 20 Questions to crowdsource common sense. 🤖☆16Sep 29, 2022Updated 3 years ago
- Experimental gazelle extension for scala☆15Jan 30, 2026Updated last month
- This is not a offical repo clone.☆18Jun 19, 2015Updated 10 years ago
- A framework to allow the matching of string entities using customised sets of transformations and matchers, plus a tool to produce the ne…☆34Apr 18, 2017Updated 8 years ago
- (Machine) Learning to Do More with Less☆14Jun 11, 2018Updated 7 years ago
- Visualizing Intergenerational Wealth Mobility and Racial Inequality☆10Mar 21, 2019Updated 7 years ago
- CoNLL-U format library for Python☆15Apr 7, 2015Updated 10 years ago
- SciAI is an extension for the text editors like Google Docs to create structured semantic biomedical papers directly at the moment of wri…☆22Jul 11, 2023Updated 2 years ago
- Implementation of BERT that could load official pre-trained models for feature extraction and prediction on TPU☆17Feb 24, 2019Updated 7 years ago
- ☆69Oct 5, 2022Updated 3 years ago
- Unsupervised domain adaptation method for relation extraction☆18Jul 16, 2018Updated 7 years ago