psmedia / Books3InfoLinks
Data and information related to the Books3 dataset included as part of The Pile, and used to train Meta's LLaMA among others
☆31Updated last month
Alternatives and similar repositories for Books3Info
Users that are interested in Books3Info are comparing it to the libraries listed below
Sorting:
- LLM plugin for clustering embeddings☆76Updated last year
- LLM plugin for models hosted on Replicate☆62Updated last year
- CLI that queries multiple language models in parallel using prompts from a CSV file☆27Updated last month
- LLM plugin for embeddings using sentence-transformers☆66Updated 2 months ago
- spaCy extension for Visual Studio Code☆32Updated 3 months ago
- The LLM plugins directory☆42Updated last year
- A Datasette plugin that turns a Datasette instance into a ChatGPT plugin☆67Updated last year
- Embedding models from Jina AI☆60Updated last year
- Access the Cohere Command R family of models☆37Updated 3 months ago
- ☆67Updated last year
- Jupyter Notebooks for testing the impact of tip incentives for ChatGPT☆22Updated last year
- Chrome Extension for exploring Hugging Face datasets 🔎☆50Updated 9 months ago
- Code and data to support "Speak, Memory: An Archaeology of Books Known to ChatGPT/GPT-4"☆69Updated 2 years ago
- Generate a SQLite database from Wikipedia & Wikidata dumps.☆35Updated last year
- Plugin for LLM adding a Markov chain generating model☆19Updated 11 months ago
- https://verdad.app☆82Updated 6 months ago
- LLM plugin adding support for the MPT-30B language model☆34Updated last year
- Convert a Claude.ai export to SQLite☆52Updated 8 months ago
- LLM plugin for models hosted by Anyscale Endpoints☆33Updated last year
- Tools to construct and process Common Crawl webgraphs☆92Updated last month
- The official project website for Datasette☆114Updated last month
- 👩🤝🤖 A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)☆23Updated 2 years ago
- Adding Marimo to Datasette☆21Updated 3 months ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆17Updated 2 months ago
- Tools for interactive visual exploration of semantic embeddings.☆34Updated 9 months ago
- Quality News - Towards a fairer ranking formula for Hacker News☆82Updated 2 months ago
- Create embeddings for LLM using the Nomic API☆23Updated 7 months ago
- A Chrome extension that saves conversations with Claude to GitHubGists or your clipboard.☆86Updated 7 months ago
- Structured Output Is All You Need!☆57Updated last year
- human_detectors hosts the data released from the paper "People who frequently use ChatGPT for writing tasks are accurate and robust detec…☆34Updated last month