simonw / nicar-2025-scrapingLinks
Cutting-edge web scraping techniques workshop at NICAR 2025
☆356Updated 4 months ago
Alternatives and similar repositories for nicar-2025-scraping
Users that are interested in nicar-2025-scraping are comparing it to the libraries listed below
Sorting:
- Integrate LLM in any pipeline - fit/predict pattern, JSON driven flows, and built in concurency support.☆601Updated 4 months ago
- Template repository for setting up a new git scraper☆106Updated 4 months ago
- AI Dataset Generator – Create realistic datasets for demos, learning, and dashboards☆609Updated last week
- CLI tool for stripping tags from HTML☆337Updated 4 months ago
- Free travel times between U.S. Census geographies☆155Updated 4 months ago
- Examples and guides for using the VLM Run API☆283Updated this week
- An SDK for working with LLMs and AI Agents from Apache Airflow, based on Pydantic AI☆444Updated 3 weeks ago
- A playbook for effectively prompting post-trained LLMs☆884Updated 5 months ago
- Tools for LIL's data preservation project☆125Updated 4 months ago
- Mapping the French Culinary Universe☆48Updated 4 months ago
- CleverBee - The Open Source Deep Researcher Tool☆301Updated last month
- Data from the Bloomberg News analysis on streamers and podcasters on YouTube☆23Updated 5 months ago
- OpenAI's Structured Outputs with Logprobs☆176Updated last month
- Turn docstrings into LLM-functions☆496Updated 3 months ago
- Import unstructured data (text and images) into structured tables☆153Updated 3 months ago
- A scientific instrument for investigating latent spaces☆712Updated 2 months ago
- Fully neural approach for text chunking☆367Updated 2 months ago
- WARC + AI - Experimental Retrieval Augmented Generation Pipeline for Web Archive Collections.☆257Updated 5 months ago
- A project providing a Graphic Walker Pane for use with HoloViz Panel.☆323Updated 3 months ago
- UV kernel for Jupyter☆442Updated last month
- ai for jq☆243Updated 9 months ago
- LLM plugin providing access to models running on an Ollama server☆322Updated 2 weeks ago
- https://verdad.app☆82Updated 6 months ago
- Yes Mcp server in bash☆466Updated last month
- Visualise your CSV files in seconds without sending your data anywhere☆511Updated last month
- Tools to build your own "taskmaster"☆108Updated this week
- A command-line book tracking tool☆205Updated last month
- Lightweight Nearest Neighbors with Flexible Backends☆294Updated this week
- ☆488Updated last month
- A hub for various industry-specific schemas to be used with VLMs.☆525Updated last month