tarekziade / mwcat
MediaWiki Categories Model
☆12Updated last year
Alternatives and similar repositories for mwcat
Users that are interested in mwcat are comparing it to the libraries listed below
Sorting:
- The NLP Bias Identification Toolkit☆36Updated last year
- A whirlwind tour of Common Crawl's data using Python☆17Updated 4 months ago
- 🌸 Train floret vectors☆18Updated 2 years ago
- Continual pretraining of foundation LLM using ⚡ Lightning Fabric☆34Updated 5 months ago
- Adding Marimo to Datasette☆20Updated last month
- Hosting examples of interactive datamapplot output☆21Updated 7 months ago
- Create embeddings for LLM using the Nomic API☆23Updated 5 months ago
- LLM plugin for embeddings using sentence-transformers☆60Updated 3 weeks ago
- A repository of instructions in French to fine-tune LLMs☆17Updated last year
- Turn your git commit history into a scientific log☆46Updated 3 months ago
- Knowledge pills on Neural Search☆26Updated 2 years ago
- BlindBox is a tool to isolate and deploy applications inside Trusted Execution Environments for privacy-by-design apps☆56Updated last year
- Extract networks of entities from journalistic reporting☆48Updated last year
- CLI that queries multiple language models in parallel using prompts from a CSV file☆26Updated last week
- The LLM plugins directory☆42Updated last year
- Next-generation Punkt sentence boundary detection with zero dependencies☆17Updated last month
- Git scrapers for scraping the fediverse☆16Updated this week
- Joulehunter helps you find what part of your code is consuming considerable amounts of energy.☆11Updated 2 years ago
- Small python package to measure OCR quality and other related metrics.☆21Updated last year
- Load GitHub repository contents as LLM fragments☆37Updated this week
- tsellm: LLMs in SQLite and DuckDB☆23Updated 3 weeks ago
- Plugin for LLM adding a Markov chain generating model☆19Updated 10 months ago
- Datasette plugin for searching all searchable tables at once☆24Updated 8 months ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆59Updated last year
- Datasette plugin providing instructions for exporting data to Jupyter or Observable☆12Updated last year
- MoodCat😼 classifies the mood of English sentences.☆14Updated 2 years ago
- A markdown wiki and dashboarding system for Datasette☆21Updated 3 years ago
- image-to-text model for PDF.js☆36Updated 2 months ago
- A helper library full of URL-related heuristics.☆69Updated last month
- Fetches security vulnerabilities and creates pip-constraints based on them.☆12Updated 3 months ago