psmedia / Books3Info
Data and information related to the Books3 dataset included as part of The Pile, and used to train Meta's LLaMA among others
☆26Updated last month
Alternatives and similar repositories for Books3Info:
Users that are interested in Books3Info are comparing it to the libraries listed below
- LLM plugin for clustering embeddings☆72Updated last year
- LLM plugin for models hosted on Replicate☆62Updated 11 months ago
- LLM plugin for embeddings using sentence-transformers☆53Updated this week
- A Chrome extension that saves conversations with Claude to GitHubGists or your clipboard.☆80Updated 4 months ago
- Some tough questions to test new models.☆27Updated 11 months ago
- utilities for loading and running text embeddings with onnx☆44Updated 7 months ago
- Access the Cohere Command R family of models☆34Updated 2 weeks ago
- LLM plugin for models hosted by Anyscale Endpoints☆33Updated 11 months ago
- ☆12Updated 2 years ago
- Plugin for LLM adding a Markov chain generating model☆18Updated 8 months ago
- ☆26Updated 2 weeks ago
- An experiment replicating part of "Why Literary Time is Measured in Minutes" with GPT-4.☆32Updated 2 years ago
- Tools to construct and process webgraphs from Common Crawl data☆85Updated 3 weeks ago
- assign color hues to a collection of text fragments based on embeddings☆20Updated 9 months ago
- Adding Marimo to Datasette☆20Updated last week
- Create embeddings for LLM using the Nomic API☆23Updated 4 months ago
- LLM plugin providing access to Mistral models using the Mistral API☆172Updated 2 weeks ago
- A Datasette plugin that turns a Datasette instance into a ChatGPT plugin☆67Updated last year
- examples and guides to using Nomic Atlas☆27Updated last week
- H2O is a web app for creating and reading open educational resources, primarily in the legal field☆38Updated last month
- spaCy extension for Visual Studio Code☆29Updated 3 weeks ago
- Not financial advice.☆28Updated 2 years ago
- Chat Markup Language conversation library☆55Updated last year
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 3 years ago
- ☆67Updated last year
- Handout for a talk I gave about LLM and CLI tools☆62Updated 9 months ago
- Production-grade embedding generation, for any length of text, for transformer models.☆23Updated 4 months ago
- ☆27Updated 6 months ago
- Add website scraping abilities to Datasette☆62Updated 2 years ago
- ☆28Updated last year