simonw / nicar-2025-scrapingLinks
Cutting-edge web scraping techniques workshop at NICAR 2025
☆374Updated 10 months ago
Alternatives and similar repositories for nicar-2025-scraping
Users that are interested in nicar-2025-scraping are comparing it to the libraries listed below
Sorting:
- Data from the Bloomberg News analysis on streamers and podcasters on YouTube☆25Updated last year
- AI Dataset Generator – Create realistic datasets for demos, learning, and dashboards☆744Updated 3 months ago
- CLI tool for stripping tags from HTML☆355Updated 11 months ago
- Template repository for setting up a new git scraper☆123Updated 3 months ago
- https://verdad.app☆85Updated this week
- Integrate LLM in any pipeline - fit/predict pattern, JSON driven flows, and built in concurency support.☆606Updated 10 months ago
- Tools for LIL's data preservation project☆126Updated 4 months ago
- CleverBee - The Open Source Deep Researcher Tool☆310Updated this week
- An SDK for working with LLMs and AI Agents from Apache Airflow, based on Pydantic AI☆517Updated 4 months ago
- Examples and guides for using the VLM Run API☆305Updated 3 weeks ago
- Multimodal RAG to search and interact locally with technical documents of any kind☆284Updated last week
- UV kernel for Jupyter☆460Updated 8 months ago
- A Twitter, Mastodon, and BlueSky bot that shares new interactive, graphic, and data vis stories from newsrooms around the world☆58Updated this week
- Good books, good vibes☆431Updated 2 years ago
- A scientific instrument for investigating latent spaces☆749Updated 2 months ago
- A simple Python 3.13 dev container☆40Updated 8 months ago
- ☆21Updated last month
- clean & curate your data with LLMs.☆489Updated last year
- LLM plugin to access Google's Gemini family of models☆421Updated last month
- LLM plugin providing access to models running on an Ollama server☆351Updated last month
- A terminal based book tracking tool☆210Updated last month
- Tools to build your own "taskmaster"☆161Updated 5 months ago
- Code used to create text embeddings of all Magic: The Gathering cards.☆59Updated 11 months ago
- Mapping the French Culinary Universe☆50Updated 10 months ago
- Spegel - Reflect the web through AI☆333Updated last week
- Build, Improve Performance, and Productionize your LLM Application with an Integrated Framework☆341Updated last year
- OpenAI's Structured Outputs with Logprobs☆201Updated 8 months ago
- This is a framework that implements various parallel reasoning strategies from the literature☆275Updated last month
- Visualise your CSV files in seconds without sending your data anywhere☆516Updated last week
- Import unstructured data (text and images) into structured tables☆165Updated last month