A library for computing diverse text characteristics and using them to analyze data sets and models with ease.
☆41Aug 18, 2022Updated 3 years ago
Alternatives and similar repositories for text_characterization_toolkit
Users that are interested in text_characterization_toolkit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- YATO: Yet Another deep learning based Text analysis Open toolkit☆47Oct 11, 2023Updated 2 years ago
- A simple Tensorflow implementation of https://arxiv.org/abs/1906.04985☆13May 16, 2019Updated 6 years ago
- Brave is a simple visualisation library for NLP information extraction, built on top of embedded BRAT.☆15Dec 25, 2019Updated 6 years ago
- Code for our TSD paper "TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models"☆14Aug 19, 2022Updated 3 years ago
- ☆64Feb 2, 2023Updated 3 years ago
- A dataset for realistic evaluation of noisy label methods☆14Dec 3, 2023Updated 2 years ago
- [ACL 20] Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction☆13Apr 21, 2020Updated 5 years ago
- Game code and data for Fool Me Twice: Entailment from Wikipedia Gamification https://arxiv.org/abs/2104.04725☆25Updated this week
- TAC KBP Event Argument Extraction and Linking Shared Task☆24Oct 30, 2017Updated 8 years ago
- ☆28Aug 27, 2025Updated 6 months ago
- Abstract Meaning Representation (AMR) Hackathon☆28Oct 8, 2018Updated 7 years ago
- ☆19Oct 14, 2021Updated 4 years ago
- Official code of our work, Representation Learning for Resource-Constrained Keyphrase Generation.☆11May 26, 2022Updated 3 years ago
- Code and data for "Superbizarre Is Not Superb: Derivational Morphology Improves BERT's Interpretation of Complex Words"☆18Aug 17, 2021Updated 4 years ago
- Code for the EMNLP 2020 paper titled "Chapter Captor: Text Segmentation in Novels"☆30Nov 9, 2020Updated 5 years ago
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆22Jun 30, 2025Updated 8 months ago
- Experiments for "A Closer Look at In-Context Learning under Distribution Shifts"☆19May 29, 2023Updated 2 years ago
- ☆24Jun 12, 2023Updated 2 years ago
- The Art and Science of Empirical Computer Science (Fall 2022)☆21Sep 1, 2023Updated 2 years ago
- Tower Parse: Low-Resource Dependency Parsing via Hierarchical Source Selection☆15Aug 20, 2021Updated 4 years ago
- [NAACL'22-Findings] Dataset for "Retrieval-Augmented Multilingual Keyphrase Generation with Retriever-Generator Iterative Training"☆18Sep 21, 2022Updated 3 years ago
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆85May 10, 2022Updated 3 years ago
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆91Feb 12, 2026Updated last month
- A lightweight, regex-based lexer framework for Python☆12May 16, 2017Updated 8 years ago
- DEPRECATED: research attempt to build e2e task oriented chatbot optimized over conversational data and content of DB (single table)☆11Sep 28, 2016Updated 9 years ago
- Repository collecting resources and best practices to improve experimental rigour in deep learning research.☆27Mar 30, 2023Updated 2 years ago
- The codes to the paper "Discourse Representation Structure Parsing"☆41Aug 24, 2019Updated 6 years ago
- ☆13May 15, 2021Updated 4 years ago
- Grounded SCAN data set.☆70Jan 10, 2022Updated 4 years ago
- LLM play 20questions with itself☆13Mar 31, 2023Updated 2 years ago
- ☆15Sep 28, 2020Updated 5 years ago
- A Seq2Seq with attention and copy mechanism for sentence summarization☆13Mar 11, 2019Updated 7 years ago
- C# implementation of Peter Norvig’s spelling corrector☆10Feb 24, 2023Updated 3 years ago
- Open Use of Data Agreement - Removing Barriers to Data Innovation☆18Aug 11, 2021Updated 4 years ago
- PropSegmEnt is an annotated dataset for segmenting English text into propositions, and recognizing proposition-level entailment relations…☆21Dec 21, 2022Updated 3 years ago
- Python package for Geometric / Clifford Algebra with Pytorch.☆14Jan 25, 2026Updated last month
- Model zoo for different kinds of uncertainty quantification methods used in Natural Language Processing, implemented in PyTorch.☆55May 5, 2023Updated 2 years ago
- A first cut into exploring the use of dependency links for building Text Graphs, that, among other things, with help of a centrality algo…☆32Oct 20, 2023Updated 2 years ago
- Code for the paper "The Impact of Positional Encoding on Length Generalization in Transformers", NeurIPS 2023☆138Apr 30, 2024Updated last year