A library for computing diverse text characteristics and using them to analyze data sets and models with ease.
☆41Aug 18, 2022Updated 3 years ago
Alternatives and similar repositories for text_characterization_toolkit
Users that are interested in text_characterization_toolkit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- YATO: Yet Another deep learning based Text analysis Open toolkit☆47Oct 11, 2023Updated 2 years ago
- A simple Tensorflow implementation of https://arxiv.org/abs/1906.04985☆13May 16, 2019Updated 6 years ago
- Brave is a simple visualisation library for NLP information extraction, built on top of embedded BRAT.☆15Dec 25, 2019Updated 6 years ago
- ☆28Nov 28, 2021Updated 4 years ago
- Code for our TSD paper "TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models"☆14Aug 19, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆59Jan 12, 2023Updated 3 years ago
- [ACL 20] Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction☆13Apr 21, 2020Updated 6 years ago
- Game code and data for Fool Me Twice: Entailment from Wikipedia Gamification https://arxiv.org/abs/2104.04725☆31Apr 17, 2026Updated 2 weeks ago
- TAC KBP Event Argument Extraction and Linking Shared Task☆24Oct 30, 2017Updated 8 years ago
- Code accompanying the paper "Knowledge Base Completion Meets Transfer Learning"☆15Feb 21, 2024Updated 2 years ago
- A lexicon compiler for non-suffixational morphologies☆13Jan 29, 2026Updated 3 months ago
- ☆28Aug 27, 2025Updated 8 months ago
- Abstract Meaning Representation (AMR) Hackathon☆28Oct 8, 2018Updated 7 years ago
- Official code of our work, Representation Learning for Resource-Constrained Keyphrase Generation.☆11May 26, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code and data for "Superbizarre Is Not Superb: Derivational Morphology Improves BERT's Interpretation of Complex Words"☆18Aug 17, 2021Updated 4 years ago
- Code for pre-training BabyLM baseline models.☆16Jun 19, 2023Updated 2 years ago
- Code for the EMNLP 2020 paper titled "Chapter Captor: Text Segmentation in Novels"☆30Nov 9, 2020Updated 5 years ago
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆23Jun 30, 2025Updated 10 months ago
- Automatically detect errors in annotated corpora.☆48Sep 8, 2023Updated 2 years ago
- Experiments for "A Closer Look at In-Context Learning under Distribution Shifts"☆19May 29, 2023Updated 2 years ago
- ☆24Jun 12, 2023Updated 2 years ago
- The Art and Science of Empirical Computer Science (Fall 2022)☆21Sep 1, 2023Updated 2 years ago
- Tower Parse: Low-Resource Dependency Parsing via Hierarchical Source Selection☆15Aug 20, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Workshop materials for scraping Twitter with Python☆13May 25, 2016Updated 9 years ago
- ALBERT trained on Mongolian text corpus☆19Jan 10, 2021Updated 5 years ago
- Code for "Rissanen Data Analysis: Examining Dataset Characteristics via Description Length" by Ethan Perez, Douwe Kiela, and Kyungyhun Ch…☆37Jun 10, 2021Updated 4 years ago
- Simple CORPORA list crawler☆10Dec 2, 2016Updated 9 years ago
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆85May 10, 2022Updated 3 years ago
- Information Extraction Dataset Zoo.☆30Apr 9, 2022Updated 4 years ago
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆91Mar 30, 2026Updated last month
- ☆22Apr 14, 2025Updated last year
- A lightweight, regex-based lexer framework for Python☆12May 16, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Repository collecting resources and best practices to improve experimental rigour in deep learning research.☆27Mar 30, 2023Updated 3 years ago
- The codes to the paper "Discourse Representation Structure Parsing"☆41Aug 24, 2019Updated 6 years ago
- A Docker image for the CLIFF geolocation software.☆10Jun 12, 2018Updated 7 years ago
- ☆13May 15, 2021Updated 4 years ago
- Server side API for QANTA quiz bowl system☆10Jan 31, 2019Updated 7 years ago
- LLM play 20questions with itself☆13Mar 31, 2023Updated 3 years ago
- A Seq2Seq with attention and copy mechanism for sentence summarization☆13Mar 11, 2019Updated 7 years ago