jazmiahenry / aave_corporaLinks
☆19Updated 3 years ago
Alternatives and similar repositories for aave_corpora
Users that are interested in aave_corpora are comparing it to the libraries listed below
Sorting:
- ☆62Updated 2 years ago
- ☆67Updated last year
- An experiment replicating part of "Why Literary Time is Measured in Minutes" with GPT-4.☆34Updated 2 years ago
- Libraries, Archives and Museums (LAM)☆88Updated 3 years ago
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 4 years ago
- Tools for interactive visual exploration of semantic embeddings.☆39Updated last year
- ☆55Updated last year
- Documentation effort for the BookCorpus dataset☆34Updated 4 years ago
- Small python package to measure OCR quality and other related metrics.☆25Updated last year
- Ludwig benchmark☆19Updated 3 years ago
- Production-grade embedding generation, for any length of text, for transformer models.☆23Updated 6 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆103Updated last year
- MoodCat😼 classifies the mood of English sentences.☆14Updated 3 years ago
- Download, parse, and filter data from Phil Papers. Data-ready for The-Pile.☆19Updated 2 years ago
- Code and data to support "Speak, Memory: An Archaeology of Books Known to ChatGPT/GPT-4"☆69Updated 2 years ago
- ACL 2022☆128Updated 2 years ago
- Chrome Extension for exploring Hugging Face datasets 🔎☆49Updated last year
- A large scale Humor Dataset, containing more than 550k rated English jokes (LREC'20)☆71Updated 2 years ago
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆120Updated 2 months ago
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆40Updated 6 years ago
- Code for SaGe subword tokenizer (EACL 2023)☆27Updated last year
- A BERT-based application for reusable text classification at scale☆38Updated 2 years ago
- The AI Knowledge Editor☆186Updated 3 years ago
- Command Line Interface for running 🤗 Transformers Image Classification locally☆19Updated 7 months ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆25Updated last month
- A simple tool for splitting up an ebook into its chapters. Works well with Project Gutenberg texts. May also be used to clean up books fo…☆114Updated 7 years ago
- A collection of notebooks for Natural Language Processing☆25Updated 11 months ago
- Generate datasets based on core.ac.uk open research paper text mines.☆30Updated 7 years ago
- Repo for the LREC 2022 paper The Project Dialogism Novel Corpus: A Dataset for Quotation Attribution in Literary Texts.☆14Updated 3 years ago
- ☆69Updated 3 years ago