psmedia / Books3InfoLinks
Data and information related to the Books3 dataset included as part of The Pile, and used to train Meta's LLaMA among others
☆32Updated 3 months ago
Alternatives and similar repositories for Books3Info
Users that are interested in Books3Info are comparing it to the libraries listed below
Sorting:
- LLM plugin for clustering embeddings☆82Updated last year
- LLM plugin providing access to Mistral models using the Mistral API☆196Updated last month
- Code for the paper: "Large Language Models as Corporate Lobbyists" (2023).☆171Updated 2 years ago
- LLM plugin for embeddings using sentence-transformers☆70Updated 4 months ago
- 🚀 Template Haystack Search Application with Streamlit☆27Updated 7 months ago
- ☆67Updated last year
- Structured Output Is All You Need!☆58Updated last year
- 💭 Build autonomous agents, retrieval augmented generation (RAG) processes and language model powered chat applications☆296Updated 3 months ago
- Import unstructured data (text and images) into structured tables☆154Updated 4 months ago
- LLM plugin for models hosted on Replicate☆64Updated last year
- A Chrome extension that saves conversations with Claude to GitHubGists or your clipboard.☆87Updated 9 months ago
- Completion After Prompt Probability. Make your LLM make a choice☆80Updated 10 months ago
- Tutorial and template for a semantic search app powered by the Atlas Embedding Database, Langchain, OpenAI and FastAPI☆115Updated last year
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- Demos utilizing the ChatGPT API☆94Updated 2 years ago
- A web app to experiment with chained prompts faster.☆16Updated 2 years ago
- Some tough questions to test new models.☆28Updated last year
- Convert all of libgen to high quality markdown☆253Updated last year
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. I…☆114Updated last month
- GPT-based Conversation Summarizer☆148Updated 2 years ago
- Tools for interactive visual exploration of semantic embeddings.☆37Updated 11 months ago
- ☆176Updated last year
- Tools to construct and process Common Crawl webgraphs☆93Updated last week
- Small python package to measure OCR quality and other related metrics.☆25Updated last year
- Chrome Extension for exploring Hugging Face datasets 🔎☆49Updated 11 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆103Updated last year
- 🦄 An NLP application just for the lols: built with Haystack to get an overview of what a user is posting about on Twitter☆45Updated last year
- Code repo for "Most Language Models can be Poets too: An AI Writing Assistant and Constrained Text Generation Studio" at the (CAI2) wor…☆212Updated 2 years ago
- Generate captions for images with Salesforce BLIP☆123Updated last year
- Easily create LLM automation/agent workflows☆59Updated last year