dhruvilgala / tvtropes
☆55Updated 2 years ago
Alternatives and similar repositories for tvtropes:
Users that are interested in tvtropes are comparing it to the libraries listed below
- ☆67Updated last year
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 3 years ago
- Documentation effort for the BookCorpus dataset☆33Updated 3 years ago
- Repo for the paper "Detecting Logical Fallacies: From Quiz to Climate Change News" (2021)☆75Updated last year
- Universal Semantic Annotator (LREC 2022)☆17Updated last month
- A TextTiling-based algorithm for text segmentation (aka topic segmentation) that uses neural sentence encoders, as well as extractive sum…☆45Updated 2 years ago
- Factored Cognition Primer: How to write compositional language model programs☆48Updated 2 years ago
- An experiment replicating part of "Why Literary Time is Measured in Minutes" with GPT-4.☆32Updated 2 years ago
- A simple tool for splitting up an ebook into its chapters. Works well with Project Gutenberg texts. May also be used to clean up books fo…☆107Updated 6 years ago
- Libraries, Archives and Museums (LAM)☆82Updated 2 years ago
- Grammar Induction using a Template Tree Approach☆46Updated 2 years ago
- RaKUn 2.0 - A fast keyword detection algorithm☆66Updated last month
- Code repo for "Most Language Models can be Poets too: An AI Writing Assistant and Constrained Text Generation Studio" at the (CAI2) wor…☆209Updated last year
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆78Updated last year
- YT_subtitles - extracts subtitles from YouTube videos to raw text for Language Model training☆41Updated 4 years ago
- ☆18Updated last year
- Mining Legal Arguments in Court Decisions - Data and software☆66Updated last year
- Seed Machine Translation Data☆31Updated 4 months ago
- Code and data to support "Speak, Memory: An Archaeology of Books Known to ChatGPT/GPT-4"☆69Updated last year
- Pre-train Static Word Embeddings☆50Updated 3 weeks ago
- ☆158Updated 9 months ago
- Stylometry library for Burrows' Delta method☆36Updated 10 months ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆31Updated last year
- Highly concurrent and fast content processing for Mighty Inference Server☆10Updated 2 years ago
- A BERT-based application for reusable text classification at scale☆38Updated last year
- ☆29Updated last year
- Production-grade embedding generation, for any length of text, for transformer models.☆23Updated 4 months ago
- Code for constructing TLDR corpus from Reddit dataset☆27Updated 3 years ago
- LLM plugin for clustering embeddings☆72Updated last year
- A tropes scraper☆33Updated last year