tarekziade / mwcatLinks
MediaWiki Categories Model
☆13Updated last year
Alternatives and similar repositories for mwcat
Users that are interested in mwcat are comparing it to the libraries listed below
Sorting:
- image-to-text model for PDF.js☆49Updated 9 months ago
- The NLP Bias Identification Toolkit☆39Updated 2 years ago
- Explain and validate SQL queries as you type them into Datasette☆12Updated last year
- Extract networks of entities from journalistic reporting☆49Updated 2 years ago
- Datasette enrichment for analyzing row data using OpenAI's GPT models☆21Updated last year
- 🕊️ Radically lightweight command-line interfaces☆108Updated 3 months ago
- MoodCat😼 classifies the mood of English sentences.☆14Updated 3 years ago
- Datasette plugin providing instructions for exporting data to Jupyter or Observable☆13Updated 2 years ago
- It's a cooler way to store simple linear models.☆27Updated last year
- Platform for journalists to search, analyse, categorise and share unstructured data☆56Updated last week
- Tracking instruction-tuned LLM openness. Paper: Liesenfeld, Andreas, Alianda Lopez, and Mark Dingemanse. 2023. “Opening up ChatGPT: Track…☆119Updated 9 months ago
- Datasette plugin for uploading CSV files and converting them to database tables☆27Updated last month
- Your buddy in the (L)LM space.☆64Updated last year
- CLI that queries multiple language models in parallel using prompts from a CSV file☆28Updated 3 months ago
- Python package for extractive NLP using the OpenAI API☆17Updated last year
- Datasette pre-configured with useful plugins. Experimental alpha.☆29Updated 2 months ago
- Taupe takes a downloaded Twitter archive ZIP file, extracts the URLs corresponding to tweets, retweets, replies, quote tweets, and liked …☆33Updated 2 years ago
- Datasette plugin for searching all searchable tables at once☆28Updated last month
- Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters☆154Updated last week
- 🔤 Measure edit distance based on keyboard layout☆63Updated 2 months ago
- Datasette plugin to create interactive dashboards☆168Updated last week
- A collection of prompts for use with the LLM CLI tool☆17Updated 2 years ago
- Continual pretraining of foundation LLM using ⚡ Lightning Fabric☆37Updated last year
- A helper library full of URL-related heuristics.☆73Updated 3 months ago
- A Python module for retrieving script types of writing systems including alphabets, abjads, abugidas, syllabaries, logographs, featurals …☆15Updated last year
- A whirlwind tour of Common Crawl's data using Python☆31Updated last month
- LLM plugin for embeddings using sentence-transformers☆73Updated 8 months ago
- Just another sentiment wrapper.☆18Updated 4 years ago
- DocumentCloud's back end source code - Please report bugs, issues and feature requests to info@documentcloud.org☆42Updated last week
- Blazing fast topic modelling for short texts.☆34Updated 2 months ago