ragymorkos / LecturePunctuatorLinks
Automatically punctuate lecture transcripts obtained from YouTube.
☆18Updated 5 years ago
Alternatives and similar repositories for LecturePunctuator
Users that are interested in LecturePunctuator are comparing it to the libraries listed below
Sorting:
- Question Generation - Question Answering for Automatic Flashcards☆64Updated 3 years ago
- Use GPT-3 to process human conversations and extract context, identify information that would be useful, and suggest data sources to get …☆29Updated 3 years ago
- The little things give you away... A collection of various small helper stuff – Mirror repo only, no longer kept in sync, refer to gitea.…☆24Updated 4 years ago
- How readable is your text? Provide a text input and get its grade level. Validated against the source data.☆11Updated last month
- Dump of generated texts from GPT-2 trained on /r/legaladvice subreddit titles☆23Updated 6 years ago
- Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts☆18Updated 4 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- Python SDK for the TextRazor Text Analytics API☆20Updated last year
- "Translate" a plot from Mark Riedl's WikiPlots corpus into a poem. For NaPoGenMo 2017.☆20Updated 8 years ago
- Unreliable News Index (for Columbia Journalism Review)☆56Updated 3 years ago
- A client for OpenAI's GPT-3 API for ad hoc testing of prompt without using the web interface.☆89Updated 4 years ago
- Chrome extension that uses Memento to indicate that a page a user is viewing on the live web has an archived copy and to give the user ac…☆54Updated this week
- List of easy American-English words: The New Dale-Chall (1995)☆32Updated 2 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆57Updated last year
- A toolkit for CDX indices such as Common Crawl and the Internet Archive's Wayback Machine☆177Updated 5 months ago
- see also section scraping on custom levels of depth☆87Updated 4 months ago
- A machine readable JSON QAnon dataset, archiving all QAnon drops for research only☆25Updated last month
- Domain-specific language for extracting structured data from HTML documents☆53Updated last month
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆46Updated 7 years ago
- Dataset: BuzzFeed News “Trending” Strip, 2018–2023☆19Updated 2 years ago
- CLI to extract article contents in bulk using Newspaper3k and multithreading.☆13Updated 7 years ago
- ☆15Updated 12 years ago
- Code and visualizations for related/similar subreddits☆19Updated 9 years ago
- A minimal proof-of-concept Python script to tweet human-curated Tweets on a schedule.☆27Updated 4 years ago
- The top products in each subreddit from 2015 to 2017☆21Updated 7 years ago
- Search COVID-19 Open Research Dataset (CORD-19) using Vespa - the open source big data serving engine.☆38Updated this week
- A Corpus of Quotes☆68Updated 6 years ago
- A Tree View For Tweets☆95Updated 3 years ago
- A small Python script to get the heart rate data generated from an Apple Watch in a CSV form☆19Updated 7 years ago
- ☆13Updated 6 years ago