citiususc / pyplexity
Cleaning tool for web scraped text
☆39Updated last year
Alternatives and similar repositories for pyplexity:
Users that are interested in pyplexity are comparing it to the libraries listed below
- Embedding models from Jina AI☆58Updated last year
- Documentation effort for the BookCorpus dataset☆34Updated 3 years ago
- Extract knowledge from raw text☆13Updated 3 years ago
- LLM plugin for clustering embeddings☆75Updated last year
- Granular Viewer of Sentiments Between Entities in Massively Large Documents and Collections of Texts, powered by AREkit☆38Updated 3 months ago
- ☆27Updated 7 months ago
- LLM plugin for embeddings using sentence-transformers☆58Updated 3 weeks ago
- A utility for labeling clusters of text data.☆28Updated 3 years ago
- Command-line script for inferencing from models such as LLaMA, in a chat scenario, with LoRA adaptations☆33Updated last year
- Structured Output Is All You Need!☆57Updated last year
- Production-grade embedding generation, for any length of text, for transformer models.☆23Updated 5 months ago
- Question Generation - Question Answering for Automatic Flashcards☆64Updated 3 years ago
- Run semantic queries over your twitter history☆39Updated 2 years ago
- Python package for extractive NLP using the OpenAI API☆17Updated 7 months ago
- Factored Cognition Primer: How to write compositional language model programs☆48Updated 2 years ago
- ☆30Updated 2 years ago
- Run embedding models using ONNX☆32Updated last year
- Chrome Extension for exploring Hugging Face datasets 🔎☆49Updated 7 months ago
- assign color hues to a collection of text fragments based on embeddings☆20Updated 10 months ago
- 100k+ topic labeled news articles published from thousands of news websites☆19Updated 4 years ago
- A text-to-SQL prototype on the northwind sqlite dataset☆12Updated 7 months ago
- ☆29Updated last year
- LLM plugin adding support for the MPT-30B language model☆34Updated last year
- Sentence Embedding as a Service☆15Updated last year
- Testing various image matching algorithms' performance on the Pinecone vector DB☆43Updated last year
- YouTube Transcript Cleaner is a simple web-based application that improves the readability of YouTube transcripts.☆25Updated last month
- Finds linguistic patterns effortlessly☆36Updated last year
- An Infr app that automates data collection from your PC, macOS or Linux client.☆11Updated last year
- A tool to automatically turn any Wikipedia article into a video☆56Updated 2 years ago
- spaCy entry points for Curated Transformers☆29Updated 6 months ago