psmedia / Books3Info
Data and information related to the Books3 dataset included as part of The Pile, and used to train Meta's LLaMA among others
☆27Updated 2 months ago
Alternatives and similar repositories for Books3Info:
Users that are interested in Books3Info are comparing it to the libraries listed below
- LLM plugin for clustering embeddings☆75Updated last year
- LLM plugin for embeddings using sentence-transformers☆58Updated 3 weeks ago
- ☆18Updated last year
- Knowledge Graph Generator app☆30Updated last year
- Production-grade embedding generation, for any length of text, for transformer models.☆23Updated 5 months ago
- Small python package to measure OCR quality and other related metrics.☆21Updated last year
- assign color hues to a collection of text fragments based on embeddings☆20Updated 10 months ago
- LLM plugin for models hosted on Replicate☆62Updated last year
- ☆29Updated last year
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- Structured Output Is All You Need!☆57Updated last year
- spaCy entry points for Curated Transformers☆29Updated 6 months ago
- ☆67Updated last year
- A Discord Bot for distilling papers, GitHub repos, Blogposts, and much more using the power of LLMs and vector search.☆13Updated last year
- LLM plugin for models hosted by Anyscale Endpoints☆33Updated last year
- examples and guides to using Nomic Atlas☆32Updated last week
- Chrome Extension for exploring Hugging Face datasets 🔎☆49Updated 7 months ago
- Embedding models from Jina AI☆58Updated last year
- Access the Cohere Command R family of models☆37Updated 3 weeks ago
- Get deterministic output in any format like json from any LLM.☆18Updated 2 years ago
- Run embedding models using ONNX☆32Updated last year
- H2O is a web app for creating and reading open educational resources, primarily in the legal field☆38Updated 2 months ago
- Libraries, Archives and Museums (LAM)☆82Updated 2 years ago
- utilities for loading and running text embeddings with onnx☆44Updated 8 months ago
- LLM plugin providing access to Mistral models using the Mistral API☆176Updated last month
- ☆91Updated last year
- [Added T5 support to TRLX] A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆47Updated 2 years ago
- Plugin for LLM adding a Markov chain generating model☆19Updated 9 months ago
- ☆11Updated last year
- Python examples using the bigcode/tiny_starcoder_py 159M model to generate code☆44Updated last year