A library for computing diverse text characteristics and using them to analyze data sets and models with ease.
☆41Aug 18, 2022Updated 3 years ago
Alternatives and similar repositories for text_characterization_toolkit
Users that are interested in text_characterization_toolkit are comparing it to the libraries listed below
Sorting:
- Brave is a simple visualisation library for NLP information extraction, built on top of embedded BRAT.☆15Dec 25, 2019Updated 6 years ago
- ☆28Nov 28, 2021Updated 4 years ago
- Code for our TSD paper "TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models"☆14Aug 19, 2022Updated 3 years ago
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆59Jan 12, 2023Updated 3 years ago
- Tower Parse: Low-Resource Dependency Parsing via Hierarchical Source Selection☆15Aug 20, 2021Updated 4 years ago
- A simple Tensorflow implementation of https://arxiv.org/abs/1906.04985☆13May 16, 2019Updated 6 years ago
- [ACL 20] Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction☆13Apr 21, 2020Updated 5 years ago
- A dataset for realistic evaluation of noisy label methods☆14Dec 3, 2023Updated 2 years ago
- Code accompanying the paper "Knowledge Base Completion Meets Transfer Learning"☆15Feb 21, 2024Updated 2 years ago
- ☆64Feb 2, 2023Updated 3 years ago
- Experiments for "A Closer Look at In-Context Learning under Distribution Shifts"☆19May 29, 2023Updated 2 years ago
- PropSegmEnt is an annotated dataset for segmenting English text into propositions, and recognizing proposition-level entailment relations…☆21Dec 21, 2022Updated 3 years ago
- Game code and data for Fool Me Twice: Entailment from Wikipedia Gamification https://arxiv.org/abs/2104.04725☆25Feb 13, 2026Updated 2 weeks ago
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆85May 10, 2022Updated 3 years ago
- ☆24Jun 12, 2023Updated 2 years ago
- Code for paper "Prompt-Based Metric Learning for Few-shot NER".☆23Nov 14, 2023Updated 2 years ago
- The code for lifelong few-shot language learning☆55Feb 17, 2022Updated 4 years ago
- The Art and Science of Empirical Computer Science (Fall 2022)☆21Sep 1, 2023Updated 2 years ago
- Simple Attentive Reader Code☆29Jun 10, 2016Updated 9 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆28Oct 3, 2021Updated 4 years ago
- A first cut into exploring the use of dependency links for building Text Graphs, that, among other things, with help of a centrality algo…☆32Oct 20, 2023Updated 2 years ago
- Information Extraction Dataset Zoo.☆30Apr 9, 2022Updated 3 years ago
- Creation of a Fantasy Premier League data pipeline for analysis of both team & player performance. Technologies include, dbt, Prefect, Te…☆11Apr 13, 2023Updated 2 years ago
- Comprehensive evaluation framework for Open Information Extraction.☆40Jun 21, 2022Updated 3 years ago
- Open Use of Data Agreement - Removing Barriers to Data Innovation☆18Aug 11, 2021Updated 4 years ago
- Python library for Myra☆10Jan 21, 2019Updated 7 years ago
- Repository of IPBench☆19Jan 4, 2026Updated last month
- [ACL‘20] Highway Transformer: A Gated Transformer.☆33Dec 5, 2021Updated 4 years ago
- Replication code for "With Little Power Comes Great Responsibility"☆39Oct 15, 2020Updated 5 years ago
- Краулеры для проекта Taiga Corpus и Taiga Parser, скачивание ресурсов из открытых источников☆14Apr 9, 2019Updated 6 years ago
- Topic modelling☆13Aug 16, 2019Updated 6 years ago
- [NeurIPS 2025] Official code for "Tropical Attention: Neural Algorithmic Reasoning for Combinatorial Algorithms"☆23Oct 23, 2025Updated 4 months ago
- Simple CORPORA list crawler☆10Dec 2, 2016Updated 9 years ago
- scrape web content into readable markdown for llms and human readers☆10Feb 19, 2024Updated 2 years ago
- Replication package for "Fine-grained prediction of food crises from news streams"☆10Jun 27, 2023Updated 2 years ago
- C# implementation of Peter Norvig’s spelling corrector☆10Feb 24, 2023Updated 3 years ago
- Narration Studio, your all in one TTS Solution!☆27Feb 19, 2026Updated last week
- GBM implementation on Legate☆14Jan 28, 2026Updated last month
- Workshop materials for scraping Twitter with Python☆13May 25, 2016Updated 9 years ago