commoncrawl / whirlwind-pythonLinks
A whirlwind tour of Common Crawl's data using Python
☆31Updated last month
Alternatives and similar repositories for whirlwind-python
Users that are interested in whirlwind-python are comparing it to the libraries listed below
Sorting:
- Quality News - Towards a fairer ranking formula for Hacker News☆84Updated 2 months ago
- LLM plugin for pulling content from Hacker News☆123Updated 7 months ago
- Datasette plugin for searching all searchable tables at once☆27Updated last month
- Concatenated documentation for use with LLMs☆47Updated last week
- Blueprint by Mozilla.ai for answering questions about structured documents☆37Updated 9 months ago
- LLM plugin for clustering embeddings☆82Updated last year
- Official Python API client library for turbopuffer☆95Updated this week
- What if an HNSW index was just a file, and you could serve it from a CDN, and search it directly in the browser?☆109Updated 8 months ago
- llm plugin for Cerebras fast inference API☆34Updated 4 months ago
- LLM plugin for embeddings using sentence-transformers☆73Updated 8 months ago
- "llm python" is a command to run a Python interpreter in the LLM virtual environment☆36Updated 2 years ago
- Code to reproduce the Hacker News users fingerprinting with Burrows method☆52Updated 8 months ago
- Build data processing and data analysis pipelines that leverage the power of LLMs 🧠☆245Updated 3 weeks ago
- Analyzing hacker news in real-time with Bytewax and Proton☆39Updated last year
- arXiv fragment loader plugin for https://llm.datasette.io/☆16Updated 7 months ago
- A probabilistic approximate DNF counter☆39Updated 3 weeks ago
- Implement recursion using English as the programming language and an LLM as the runtime.☆238Updated 2 years ago
- Ask questions, let GPT do the SQL.☆133Updated 2 years ago
- Ask questions of your data with LLM assistance☆67Updated last year
- progscrape.com source☆96Updated 3 months ago
- Self-updating MCP server to cross-ref latest official pip, conda, poetry, uv, pixi, and pdm docs☆42Updated last week
- Exploring Hacker News by mapping and analyzing 40 million posts and comments for fun☆202Updated 7 months ago
- A repository fully generated by ChatGPT making it believed it checked out a this repository which I described like the first line of the …☆120Updated 3 years ago
- Embedding models from Jina AI☆65Updated last year
- Write and execute jq programs with the help of LLM☆191Updated last year
- LLM tools for running queries against SQLite☆44Updated 6 months ago
- Document conversion and processing engine☆45Updated 10 months ago
- Extremely memory-efficient vector database☆76Updated last year
- Your buddy in the (L)LM space.☆64Updated last year
- Official Rust Implementation of Model2Vec☆144Updated 2 months ago