tarekziade / mwcatLinks
MediaWiki Categories Model
☆12Updated last year
Alternatives and similar repositories for mwcat
Users that are interested in mwcat are comparing it to the libraries listed below
Sorting:
- image-to-text model for PDF.js☆47Updated 7 months ago
- The NLP Bias Identification Toolkit☆39Updated 2 years ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆20Updated 3 months ago
- A repository of instructions in French to fine-tune LLMs☆17Updated 2 years ago
- 🌸 Train floret vectors☆18Updated 2 years ago
- Datasette plugin providing instructions for exporting data to Jupyter or Observable☆13Updated 2 years ago
- A helper library full of URL-related heuristics.☆73Updated last month
- Datasette enrichment for analyzing row data using OpenAI's GPT models☆21Updated last year
- Tracking instruction-tuned LLM openness. Paper: Liesenfeld, Andreas, Alianda Lopez, and Mark Dingemanse. 2023. “Opening up ChatGPT: Track…☆119Updated 7 months ago
- 🕊️ Radically lightweight command-line interfaces☆109Updated 2 months ago
- Extract networks of entities from journalistic reporting☆48Updated 2 years ago
- spaCy entry points for Curated Transformers☆32Updated 5 months ago
- It's a cooler way to store simple linear models.☆27Updated last year
- ☆67Updated last year
- Datasette plugin for uploading CSV files and converting them to database tables☆27Updated last year
- A Streamlit application to visualize sentence embeddings☆18Updated 2 years ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆59Updated last year
- spaCy extension for Visual Studio Code☆31Updated 7 months ago
- MoodCat😼 classifies the mood of English sentences.☆14Updated 3 years ago
- Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters☆149Updated last week
- ☆71Updated 9 months ago
- MkDocs plugin to generate semantic reference Markdown pages from a knowledge graph☆40Updated last year
- Generate random passphrases☆34Updated 3 months ago
- Open source text annotation software created by the french supreme court 'Cour de cassation'☆23Updated last month
- Alternative robots parser module for Python☆20Updated 2 months ago
- Python based Wikidata framework for easy dataframe extraction☆45Updated last year
- Python module for OpenPGP written in Rust.☆54Updated 7 months ago
- Explain and validate SQL queries as you type them into Datasette☆12Updated last year
- Scripts to maintain German law git repository☆120Updated last year
- A markdown wiki and dashboarding system for Datasette☆21Updated 4 years ago