w3c / ai-web-impactLinks
An analysis of the systemic impact, on the Web, of AI systems, and in particular ones based on Machine Learning models, and the role that Web standardization may play in managing that impact
☆24Updated this week
Alternatives and similar repositories for ai-web-impact
Users that are interested in ai-web-impact are comparing it to the libraries listed below
Sorting:
- image-to-text model for PDF.js☆38Updated 3 months ago
- SPINACH: SPARQL-Based Information Navigation for Challenging Real-World Questions☆38Updated 2 months ago
- Blueprint to Build Your Own Timeline Algorithm☆58Updated 3 weeks ago
- H2O is a web app for creating and reading open educational resources, primarily in the legal field☆38Updated last month
- Datasette enrichment for analyzing row data using OpenAI's GPT models☆19Updated last year
- A tool for detecting viruses and NSFW material in WARC files☆15Updated 10 months ago
- CLI that queries multiple language models in parallel using prompts from a CSV file☆26Updated last month
- Datasette plugin for uploading CSV files and converting them to database tables☆26Updated last year
- Data and information related to the Books3 dataset included as part of The Pile, and used to train Meta's LLaMA among others☆29Updated last month
- Website repository for Govdirectory - a crowdsourced and fact-checked directory of official governmental online accounts and services.☆54Updated last week
- Open-source technology for creating full-stack knowledge applications for communities of all types.☆48Updated this week
- Generate embeddings for images and text using CLIP with LLM☆69Updated last year
- ☆27Updated 4 years ago
- ☆20Updated last month
- Git scrapers for scraping the fediverse☆17Updated this week
- Next-generation Punkt sentence boundary detection with zero dependencies☆17Updated 2 months ago
- Extract metadata from a video to an sqlite database☆20Updated last year
- A collection of rust algorithms and data structures☆67Updated last week
- Track changes to GraphQL APIs by git scraping their schemas☆28Updated 2 months ago
- Data from the Bloomberg News analysis on streamers and podcasters on YouTube☆23Updated 5 months ago
- Access llamafile localhost models via LLM☆20Updated last year
- A lightweight Python utility that aggregates and exports comprehensive system information to JSON, specifically designed for feeding syst…☆13Updated 2 months ago
- Crowd-sourced lists of urls to help Common Crawl crawl under-resourced languages. See https://github.com/commoncrawl/web-languages-code/ …☆45Updated this week
- Quels élus de la République (députés, ministres, maires) utilisent toujours x.com ?☆13Updated 2 weeks ago
- A Python tool to search for and remove duplicated files in messy datasets☆16Updated 6 months ago
- A polite and user-friendly downloader for Common Crawl data☆48Updated last month
- Metadata management and dissemination system for Open Access books☆53Updated this week
- Create embeddings for LLM using the Nomic API☆23Updated 7 months ago
- spaCy extension for Visual Studio Code☆32Updated 3 months ago
- DocumentCloud's back end source code - Please report bugs, issues and feature requests to info@documentcloud.org☆39Updated this week