[BEA @ ACL 2023] General-purpose tool for linguistic features extraction; Tested on readability assessment, essay scoring, fake news detection, hate speech detection, etc.
☆149Dec 3, 2024Updated last year
Alternatives and similar repositories for lftk
Users that are interested in lftk are comparing it to the libraries listed below
Sorting:
- [EMNLP 2021] LingFeat - A Comprehensive Linguistic Features Extraction ToolKit for Readability Assessment☆132Mar 7, 2023Updated 3 years ago
- Repository for the CommonLit Ease of Readability Corpus☆24Apr 17, 2024Updated last year
- ☆14Apr 10, 2024Updated last year
- This is a simple Python package for calculating a variety of lexical diversity indices☆81Sep 15, 2023Updated 2 years ago
- A simple tool which migrates all your notion notes to be obsidian ready☆14Apr 11, 2023Updated 2 years ago
- Code used for the paper "Linguistic Features for Readability Assessment" (Deutsch, Jasbi, and Shieber 2020)☆25Jul 19, 2021Updated 4 years ago
- 基于多层级语言特征融合的中文文本可读性分级模型☆12Feb 27, 2024Updated 2 years ago
- Experiments with representation engineering☆14Feb 28, 2024Updated 2 years ago
- An easy-to-use library to extract indices from texts.☆30Sep 7, 2021Updated 4 years ago
- TextComplexityDE dataset consists of 1000 sentences in the German language with subjective complexity rating, collected from German learn…☆13Apr 8, 2022Updated 3 years ago
- A self-updating GitHub profile 🐯☆15Feb 27, 2026Updated last week
- Dataset containing scroll interactions of 598 partcipants reading advanced and elementary texts from the OneStopEnglish corpus☆16Dec 27, 2021Updated 4 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆21Feb 7, 2023Updated 3 years ago
- Extract text from your DOCX documents.☆11Feb 10, 2024Updated 2 years ago
- Tool for the Automatic Analysis of Syntactic Sophistication and Complexity☆31Nov 4, 2023Updated 2 years ago
- Copy or open the Obsidian Publish URL of a note. You can also open its Git commit history on GitHub.☆22Oct 24, 2023Updated 2 years ago
- A simple toolkit for conducting analyses using corpus methods☆27Nov 11, 2021Updated 4 years ago
- A lil project to sync Twitter bookmarks with an Obsidian vault.☆22Feb 4, 2023Updated 3 years ago
- The repository contains the dataset and the code of the paper: Document-Level Text Simplification: Dataset, Metric and Model.☆26Jun 2, 2023Updated 2 years ago
- Streamline is a stream-of-consciousness writer for Obsidian☆35Apr 20, 2023Updated 2 years ago
- Semantic QA with a markdown database: Query any markdown file using vector embedding, Pinecone vector database and GPT (langchain). A wea…☆32May 27, 2023Updated 2 years ago
- Get the latest feed of GitHub Stars out there! 🌟 ⭐ ✨☆32Oct 4, 2021Updated 4 years ago
- Corpus of Annotations for Misspelings☆28Jul 31, 2023Updated 2 years ago
- A sample Java gRPC client for the Salesforce Pub/Sub API☆12Oct 9, 2024Updated last year
- spaCy extension for Visual Studio Code☆32Mar 10, 2025Updated 11 months ago
- Multi-step AI agents powered by Gemini 2.0 and the LangGraph framework. These agents orchestrate complex workflows and enhance their reas…☆10Dec 19, 2024Updated last year
- This repository contains all resources from the paper "Introducing MBIB - the first Media Bias Identification Benchmark Task and Dataset …☆34Feb 26, 2024Updated 2 years ago
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.☆69Sep 14, 2021Updated 4 years ago
- This is an Obsidian plugin to allow easily sideloading other plugins.☆33Aug 11, 2023Updated 2 years ago
- Exploratory Data Analysis of Time Series Data and Forecasting using Naïve Approach, Moving Average Method, Simple Exponential Smoothenin…☆12Jul 2, 2018Updated 7 years ago
- Basic operations prototype/syntax for developers☆12Mar 12, 2023Updated 2 years ago
- This is a demo project to compare two web scrapping frameworks, Playwright and Selenium and using the new Pipelining tool Dagster☆15Sep 9, 2021Updated 4 years ago
- A python package to automate downloads of Salesforce Weekly Data Exports☆10Jan 26, 2021Updated 5 years ago
- arxiv daily for speech translation, legal. Ref: Vincentqyw/cv-arxiv-daily☆15Jan 6, 2025Updated last year
- Fivetran's Salesforce source dbt package☆13Oct 1, 2025Updated 5 months ago
- Generative and Parametric design code: featuring Processing / Python / Javascript / HTML / CSS☆14Nov 4, 2020Updated 5 years ago
- ☆12Apr 26, 2020Updated 5 years ago
- 🦄 Terminull is a gitbook plugin allows you to create a modern terminal for your gitbook pages in order to documente your commands and it…☆12Sep 4, 2022Updated 3 years ago
- Python version for Doug Biber's Multidimensional Analysis (MDA)☆40Nov 13, 2025Updated 3 months ago