A library for computing diverse text characteristics and using them to analyze data sets and models with ease.
☆41Aug 18, 2022Updated 3 years ago
Alternatives and similar repositories for text_characterization_toolkit
Users that are interested in text_characterization_toolkit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- YATO: Yet Another deep learning based Text analysis Open toolkit☆47Oct 11, 2023Updated 2 years ago
- Brave is a simple visualisation library for NLP information extraction, built on top of embedded BRAT.☆15Dec 25, 2019Updated 6 years ago
- ☆28Nov 28, 2021Updated 4 years ago
- PALI: Language identification for Perso-Arabic Scripts☆11Jul 11, 2023Updated 2 years ago
- Code for our TSD paper "TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models"☆14Aug 19, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This repository contains code for the paper "Uncertainty Estimation and Calibration with Finite-State Probabilistic RNNs" (Wang, Lawrence…☆17Mar 8, 2021Updated 5 years ago
- ☆64Feb 2, 2023Updated 3 years ago
- A dataset for realistic evaluation of noisy label methods☆15Dec 3, 2023Updated 2 years ago
- [ACL 20] Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction☆13Apr 21, 2020Updated 6 years ago
- Game code and data for Fool Me Twice: Entailment from Wikipedia Gamification https://arxiv.org/abs/2104.04725☆33May 28, 2026Updated 2 weeks ago
- TAC KBP Event Argument Extraction and Linking Shared Task☆24Oct 30, 2017Updated 8 years ago
- Code accompanying the paper "Knowledge Base Completion Meets Transfer Learning"☆15Feb 21, 2024Updated 2 years ago
- ☆29Aug 27, 2025Updated 9 months ago
- Abstract Meaning Representation (AMR) Hackathon☆28Oct 8, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆19Oct 14, 2021Updated 4 years ago
- Official code of our work, Representation Learning for Resource-Constrained Keyphrase Generation.☆11May 26, 2022Updated 4 years ago
- Code and data for "Superbizarre Is Not Superb: Derivational Morphology Improves BERT's Interpretation of Complex Words"☆18Aug 17, 2021Updated 4 years ago
- Code for the EMNLP 2020 paper titled "Chapter Captor: Text Segmentation in Novels"☆29Nov 9, 2020Updated 5 years ago
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆23Jun 30, 2025Updated 11 months ago
- Automatically detect errors in annotated corpora.☆48Sep 8, 2023Updated 2 years ago
- Experiments for "A Closer Look at In-Context Learning under Distribution Shifts"☆18May 29, 2023Updated 3 years ago
- ☆24Jun 12, 2023Updated 3 years ago
- The Art and Science of Empirical Computer Science (Fall 2022)☆21Sep 1, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Tower Parse: Low-Resource Dependency Parsing via Hierarchical Source Selection☆15Aug 20, 2021Updated 4 years ago
- Workshop materials for scraping Twitter with Python☆13May 25, 2016Updated 10 years ago
- ALBERT trained on Mongolian text corpus☆19Jan 10, 2021Updated 5 years ago
- Code for "Rissanen Data Analysis: Examining Dataset Characteristics via Description Length" by Ethan Perez, Douwe Kiela, and Kyungyhun Ch…☆37Jun 10, 2021Updated 5 years ago
- Simple CORPORA list crawler☆11Dec 2, 2016Updated 9 years ago
- [NAACL'22-Findings] Dataset for "Retrieval-Augmented Multilingual Keyphrase Generation with Retriever-Generator Iterative Training"☆18Sep 21, 2022Updated 3 years ago
- Information Extraction Dataset Zoo.☆30Apr 9, 2022Updated 4 years ago
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆91Jun 3, 2026Updated last week
- ☆23Apr 14, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [LREC 2020] EtymDB, an Etymological DataBase (v2.1)☆27Jan 4, 2022Updated 4 years ago
- DEPRECATED: research attempt to build e2e task oriented chatbot optimized over conversational data and content of DB (single table)☆11Sep 28, 2016Updated 9 years ago
- Repository collecting resources and best practices to improve experimental rigour in deep learning research.☆27Mar 30, 2023Updated 3 years ago
- The codes to the paper "Discourse Representation Structure Parsing"☆41Aug 24, 2019Updated 6 years ago
- Grounded SCAN data set.☆70Jan 10, 2022Updated 4 years ago
- ☆13May 15, 2021Updated 5 years ago
- ☆16Sep 28, 2020Updated 5 years ago