commoncrawl / whirlwind-pythonLinks
A whirlwind tour of Common Crawl's data using Python
☆33Updated this week
Alternatives and similar repositories for whirlwind-python
Users that are interested in whirlwind-python are comparing it to the libraries listed below
Sorting:
- Quality News - Towards a fairer ranking formula for Hacker News☆83Updated 3 months ago
- arXiv fragment loader plugin for https://llm.datasette.io/☆16Updated 8 months ago
- Build data processing and data analysis pipelines that leverage the power of LLMs 🧠☆247Updated 2 weeks ago
- llm plugin for Cerebras fast inference API☆34Updated 6 months ago
- Datasette plugin for searching all searchable tables at once☆29Updated 3 months ago
- Official Python API client library for turbopuffer☆102Updated last week
- Ask questions of your data with LLM assistance☆68Updated last year
- Analyzing hacker news in real-time with Bytewax and Proton☆44Updated 2 years ago
- Code to reproduce the Hacker News users fingerprinting with Burrows method☆52Updated 9 months ago
- PILF: A IPWT-inspired bionic continual learning experiment focus on mitigate catastrophic forgetting with Surprise-gated Mixture of Exper…☆39Updated 6 months ago
- Document conversion and processing engine☆47Updated last year
- A cookiecutter template for creating a new LLM plugin that adds tools to LLM☆28Updated 8 months ago
- Blueprint by Mozilla.ai for answering questions about structured documents☆37Updated 10 months ago
- Create a SQLite database containing metadata from Google Drive☆163Updated 10 months ago
- LLM plugin for clustering embeddings☆82Updated last year
- Sort input lines semantically with llm☆120Updated 8 months ago
- A probabilistic approximate DNF counter☆39Updated 2 months ago
- Tools to construct and process Common Crawl webgraphs☆105Updated last week
- "llm python" is a command to run a Python interpreter in the LLM virtual environment☆36Updated 2 years ago
- Systems programming language with Python-like syntax and C-level performance. Compiles to native x86-64 machine code without external dep…☆22Updated 2 weeks ago
- Self-updating MCP server to cross-ref latest official pip, conda, poetry, uv, pixi, and pdm docs☆42Updated this week
- 🛡️ Managed isolated environments for Python☆109Updated last week
- A Datasette plugin that adds UI elements to edit, insert, or delete rows in SQLite tables☆23Updated 2 months ago
- Use LLMs to rank anything.☆109Updated this week
- Hierarchical topic segmentation of meeting transcripts using embeddings and divisive clustering.☆54Updated last year
- Like grep but with natural language queries☆50Updated 2 years ago
- LLM plugin for pulling content from Hacker News☆125Updated 9 months ago
- Parallelism and preemptive concurrency for sporadic workloads☆46Updated last year
- What if an HNSW index was just a file, and you could serve it from a CDN, and search it directly in the browser?☆109Updated 9 months ago
- A tool plugin for LLM to support web search via Exa☆30Updated 2 months ago