The repository for the paper "When Do You Need Billions of Words of Pretraining Data?"
☆21Nov 10, 2020Updated 5 years ago
Alternatives and similar repositories for pretraining-learning-curves
Users that are interested in pretraining-learning-curves are comparing it to the libraries listed below
Sorting:
- This repository contains code for the paper "Are Pretrained Language Models Symbolic Reasoners over Knowledge?"☆13Mar 23, 2021Updated 4 years ago
- GC4LM: A Colossal (Biased) language model for German☆13May 2, 2021Updated 4 years ago
- Dense Passage Retrieval using tensorflow-keras on TPU☆17Jun 27, 2021Updated 4 years ago
- ☆12Jun 10, 2021Updated 4 years ago
- VSCode extension for working with Architecture As A Code in the C4 model. Includes syntax highlighting, diagram preview, and tools for wo…☆32Feb 25, 2026Updated last week
- Gentle and praatio scripts for easy forced alignment☆18Oct 27, 2022Updated 3 years ago
- A toolbox for inference of mixture models☆16May 29, 2023Updated 2 years ago
- Code Repo for the ACL21 paper "Common Sense Beyond English: Evaluating and Improving Multilingual LMs for Commonsense Reasoning"☆23Oct 26, 2021Updated 4 years ago
- Convert words to numbers☆21Apr 13, 2022Updated 3 years ago
- Learn models that are robust to spurious correlations in the dataset.☆26Dec 31, 2019Updated 6 years ago
- Parallel data preprocessing for NLP and ML.☆34Nov 1, 2024Updated last year
- Matrix tools for building and inspecting latent spaces☆27Aug 19, 2018Updated 7 years ago
- COre Variable Feature Extraction Feature Extractor☆30Dec 8, 2022Updated 3 years ago
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Jun 21, 2022Updated 3 years ago
- ☆17Feb 21, 2026Updated 2 weeks ago
- ☆12Sep 22, 2015Updated 10 years ago
- Tool for sentiment analysis annotation☆13Mar 26, 2025Updated 11 months ago
- Deep Learning Part 2, 2019 edition - transcriptions, screenshots and notebooks☆11Jul 19, 2019Updated 6 years ago
- The current release version of QN-ACTR cognitive architecture and models☆11Sep 25, 2019Updated 6 years ago
- An ambient noise detector☆10Aug 23, 2020Updated 5 years ago
- Links to data used in Sproat & Jaitly (https://arxiv.org/abs/1611.00068) experiments.☆77Jul 9, 2021Updated 4 years ago
- ☆39Jan 9, 2023Updated 3 years ago
- ☆75Jul 2, 2021Updated 4 years ago
- BiasFinder | IEEE TSE | Metamorphic Test Generation to Uncover Bias for Sentiment Analysis Systems☆11Jan 18, 2022Updated 4 years ago
- Home sharing app for Hillary Clinton supporters☆10Jan 6, 2017Updated 9 years ago
- WordNet Domains, WordNet Affect and SentiWords☆48Jan 8, 2016Updated 10 years ago
- Collection of my implementations of computational models of cognition☆11Nov 20, 2023Updated 2 years ago
- Repository for the Machine Learning Failure Mode and Effects Analysis (ML FMEA) Template. The ML FMEA is detailed within the SAE World C…☆11Apr 3, 2025Updated 11 months ago
- ☆14Mar 21, 2024Updated last year
- Multitaper R package available on CRAN☆10Jul 17, 2024Updated last year
- AutoBench: Benchmarking Automation for Intelligent Document Processing (IDP) with confidence☆11Mar 18, 2025Updated 11 months ago
- Free programming language books☆10Jun 4, 2020Updated 5 years ago
- This repository shows how to implement a basic model for multimodal entailment.☆10Aug 17, 2021Updated 4 years ago
- Reinforcement Learning Recommender System suggesting relevant scientific services to appropriate researchers☆11Aug 29, 2024Updated last year
- ☆46Apr 13, 2022Updated 3 years ago
- A collection of English tweets annotated in Universal Dependencies.☆39Oct 20, 2021Updated 4 years ago
- Helper scripts and notes that were used while porting various nlp models☆49Mar 22, 2022Updated 3 years ago
- Anonymization of legal cases (Fr) based on Flair embeddings☆88Dec 9, 2020Updated 5 years ago
- Ukrainian ELECTRA model☆12Mar 11, 2023Updated 2 years ago