psmedia / Books3Info
Data and information related to the Books3 dataset included as part of The Pile, and used to train Meta's LLaMA among others
☆25Updated last year
Alternatives and similar repositories for Books3Info:
Users that are interested in Books3Info are comparing it to the libraries listed below
- LLM plugin for clustering embeddings☆65Updated 10 months ago
- LLM plugin for models hosted on Replicate☆60Updated 9 months ago
- LLM plugin for embeddings using sentence-transformers☆44Updated 11 months ago
- LLM plugin for models hosted by Anyscale Endpoints☆32Updated 8 months ago
- Embedding models from Jina AI☆57Updated last year
- Chrome Extension for exploring Hugging Face datasets 🔎☆49Updated 4 months ago
- Plugin for LLM adding a Markov chain generating model☆16Updated 6 months ago
- A Chrome extension that saves conversations with Claude to GitHubGists or your clipboard.☆76Updated last month
- Access the Cohere Command R family of models☆34Updated 9 months ago
- Knowledge Graph Generator app☆30Updated 9 months ago
- Python package for extractive NLP using the OpenAI API☆16Updated 4 months ago
- Structured Output Is All You Need!☆51Updated 9 months ago
- Tools for interactive visual exploration of semantic embeddings.☆29Updated 4 months ago
- ☆67Updated 10 months ago
- The LLM plugins directory☆40Updated last year
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆48Updated 3 months ago
- Voyage AI Official Python Library☆46Updated last month
- Chat Markup Language conversation library☆55Updated last year
- ☆29Updated last year
- spaCy entry points for Curated Transformers☆26Updated 3 months ago
- LLM plugin providing access to Mistral models using the Mistral API☆156Updated last month
- A Datasette plugin that turns a Datasette instance into a ChatGPT plugin☆67Updated 10 months ago
- ☆66Updated last year
- A BERT-based application for reusable text classification at scale☆37Updated last year
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated 10 months ago
- ☆56Updated 2 months ago
- A textual TUI for Prodigy☆14Updated last year
- ☆12Updated last year
- Code for the paper: "Large Language Models as Corporate Lobbyists" (2023).☆171Updated 2 years ago
- Dataset: BuzzFeed News “Trending” Strip, 2018–2023☆19Updated last year