psmedia / Books3InfoLinks
Data and information related to the Books3 dataset included as part of The Pile, and used to train Meta's LLaMA among others
☆33Updated 7 months ago
Alternatives and similar repositories for Books3Info
Users that are interested in Books3Info are comparing it to the libraries listed below
Sorting:
- LLM plugin for clustering embeddings☆82Updated last year
- Tools to construct and process Common Crawl webgraphs☆103Updated last week
- ☆184Updated 2 years ago
- ☆67Updated last year
- 💭 Build autonomous agents, retrieval augmented generation (RAG) processes and language model powered chat applications☆301Updated 7 months ago
- Code for the paper: "Large Language Models as Corporate Lobbyists" (2023).☆171Updated 2 years ago
- A personal knowledge base that I can dump information to and help me learn☆24Updated 7 months ago
- 🚀 Template Haystack Search Application with Streamlit☆27Updated 11 months ago
- LLM plugin for embeddings using sentence-transformers☆73Updated 8 months ago
- Enable decision-making based on simulations☆231Updated last year
- LLM plugin providing access to Mistral models using the Mistral API☆205Updated 5 months ago
- Tutorial and template for a semantic search app powered by the Atlas Embedding Database, Langchain, OpenAI and FastAPI☆112Updated 2 years ago
- Completion After Prompt Probability. Make your LLM make a choice☆82Updated last year
- PanML is a high level generative AI/ML development and analysis library designed for ease of use and fast experimentation.☆116Updated 2 years ago
- Small python package to measure OCR quality and other related metrics.☆25Updated last year
- ☆95Updated 2 years ago
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆195Updated 7 months ago
- Data about 349K OpenAI Custom GPTs☆149Updated last year
- 🦄 An NLP application just for the lols: built with Haystack to get an overview of what a user is posting about on Twitter☆46Updated last year
- Plugin for LLM adding support for the GPT4All collection of models☆258Updated last year
- Knowledge Graph Generator app☆34Updated last year
- 📚 Datasets and models for instruction-tuning☆238Updated 2 years ago
- ☆62Updated 2 years ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆80Updated 2 years ago
- A web app to experiment with chained prompts faster.☆17Updated 2 years ago
- examples and guides to using Nomic Atlas☆37Updated 8 months ago
- LLM finetuning☆42Updated 2 years ago
- ☆22Updated 2 years ago
- Convert all of libgen to high quality markdown☆254Updated 2 years ago
- Python library to use Pleias-RAG models☆67Updated 7 months ago