dhruvilgala / tvtropesLinks
☆62Updated 2 years ago
Alternatives and similar repositories for tvtropes
Users that are interested in tvtropes are comparing it to the libraries listed below
Sorting:
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 4 years ago
- Code repo for "Most Language Models can be Poets too: An AI Writing Assistant and Constrained Text Generation Studio" at the (CAI2) wor…☆212Updated 2 years ago
- A simple tool for splitting up an ebook into its chapters. Works well with Project Gutenberg texts. May also be used to clean up books fo…☆114Updated 7 years ago
- Documentation effort for the BookCorpus dataset☆34Updated 4 years ago
- Libraries, Archives and Museums (LAM)☆88Updated 3 years ago
- LLM plugin for clustering embeddings☆82Updated last year
- Repo for the paper "Detecting Logical Fallacies: From Quiz to Climate Change News" (2021)☆84Updated 2 years ago
- ☆67Updated last year
- YT_subtitles - extracts subtitles from YouTube videos to raw text for Language Model training☆45Updated 5 years ago
- ☆100Updated last year
- A TextTiling-based algorithm for text segmentation (aka topic segmentation) that uses neural sentence encoders, as well as extractive sum…☆51Updated 2 years ago
- Pipeline to generate the Standardized Project Gutenberg Corpus☆203Updated last year
- Code and data to support "Speak, Memory: An Archaeology of Books Known to ChatGPT/GPT-4"☆69Updated 2 years ago
- assign color hues to a collection of text fragments based on embeddings☆20Updated last year
- Highly concurrent and fast content processing for Mighty Inference Server☆10Updated 2 years ago
- The AI Knowledge Editor☆186Updated 3 years ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆80Updated 2 years ago
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆193Updated 6 months ago
- Frame Semantic Parser based on T5 and FrameNet☆63Updated 2 years ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆130Updated last year
- Seahorse is a dataset for multilingual, multi-faceted summarization evaluation. It consists of 96K summaries with human ratings along 6 q…☆89Updated last year
- RaKUn 2.0 - A fast keyword detection algorithm☆69Updated 4 months ago
- Metadata Extractor & Loader (MEL) ■ The NLP-NER Toolkit (TNNT)☆25Updated 2 years ago
- 💫 SpaCy wrapper for ConceptNet 💫☆95Updated 2 years ago
- A BERT-based application for reusable text classification at scale☆38Updated 2 years ago
- Semantic search engine indexing 110 million academic publications☆92Updated last week
- ☆35Updated 2 years ago
- Grammar Induction using a Template Tree Approach☆46Updated 7 months ago
- ☆176Updated last year
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated 2 years ago