simonw / nicar-2025-scrapingLinks
Cutting-edge web scraping techniques workshop at NICAR 2025
☆363Updated 4 months ago
Alternatives and similar repositories for nicar-2025-scraping
Users that are interested in nicar-2025-scraping are comparing it to the libraries listed below
Sorting:
- Data from the Bloomberg News analysis on streamers and podcasters on YouTube☆23Updated 6 months ago
- Template repository for setting up a new git scraper☆110Updated 5 months ago
- Integrate LLM in any pipeline - fit/predict pattern, JSON driven flows, and built in concurency support.☆604Updated 4 months ago
- An SDK for working with LLMs and AI Agents from Apache Airflow, based on Pydantic AI☆461Updated this week
- CLI tool for stripping tags from HTML☆339Updated 5 months ago
- Tools for LIL's data preservation project☆125Updated 5 months ago
- Free travel times between U.S. Census geographies☆157Updated 4 months ago
- Examples and guides for using the VLM Run API☆286Updated 3 weeks ago
- AI Dataset Generator – Create realistic datasets for demos, learning, and dashboards☆657Updated 2 weeks ago
- https://verdad.app☆83Updated 7 months ago
- Import unstructured data (text and images) into structured tables☆153Updated 3 months ago
- A playbook for effectively prompting post-trained LLMs☆889Updated 6 months ago
- A scientific instrument for investigating latent spaces☆718Updated 2 months ago
- CleverBee - The Open Source Deep Researcher Tool☆302Updated 2 months ago
- Multimodal RAG to search and interact locally with technical documents of any kind☆242Updated last week
- Mapping the French Culinary Universe☆48Updated 5 months ago
- A Twitter, Mastodon, and BlueSky bot that shares new interactive, graphic, and data vis stories from newsrooms around the world☆58Updated this week
- Fully neural approach for text chunking☆367Updated 3 months ago
- LLM plugin providing access to models running on an Ollama server☆328Updated last week
- LLM plugin to access Google's Gemini family of models☆367Updated 2 weeks ago
- UV kernel for Jupyter☆443Updated 2 months ago
- Visualise your CSV files in seconds without sending your data anywhere☆511Updated last month
- Build, Improve Performance, and Productionize your LLM Application with an Integrated Framework☆342Updated 8 months ago
- Browser-LLM Auto-Scaling Technology☆540Updated this week
- ☆495Updated 2 months ago
- Code used to create text embeddings of all Magic: The Gathering cards.☆55Updated 5 months ago
- Spegel - Reflect the web through AI☆310Updated 3 weeks ago
- Count and truncate text based on tokens☆366Updated last year
- Good books, good vibes☆430Updated last year
- a model to generate estimates of the number of outstanding votes on an election night based on the current results of the race☆78Updated 3 months ago