Aveek-Saha / Movie-Script-DatabaseLinks
A database of movie scripts from several sources
☆184Updated last year
Alternatives and similar repositories for Movie-Script-Database
Users that are interested in Movie-Script-Database are comparing it to the libraries listed below
Sorting:
- O-GIA is an umbrella for research, infrastructure and projects ecosystem that should provide open source, reproducible datasets, models, …☆87Updated 2 years ago
- ☆44Updated 3 years ago
- The ScriptBase Corpus☆47Updated 7 years ago
- YT_subtitles - extracts subtitles from YouTube videos to raw text for Language Model training☆45Updated 5 years ago
- 🚀🤗 A collection of templates for Hugging Face Spaces☆35Updated 2 years ago
- Semantic search with embeddings: index anything☆140Updated 3 years ago
- Codebase for LLM story generation; updated version of https//github.com/yangkevin2/doc-story-generation☆86Updated 2 years ago
- ☆100Updated last year
- ☆94Updated 3 years ago
- ☆255Updated 3 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogs☆115Updated 2 years ago
- Fine-tuning 6-Billion GPT-J (& other models) with LoRA and 8-bit compression☆68Updated 3 years ago
- KokoMind: Can LLMs Understand Social Interactions?☆104Updated 2 years ago
- A TextTiling-based algorithm for text segmentation (aka topic segmentation) that uses neural sentence encoders, as well as extractive sum…☆51Updated 2 years ago
- GenieNLP: A versatile codebase for any NLP task☆89Updated last year
- Tools for content datamining and NLP at scale☆44Updated last year
- ☆162Updated 2 years ago
- TimeLMs: Diachronic Language Models from Twitter☆112Updated last year
- Apps built using Inspired Cognition's Critique.☆57Updated 2 years ago
- Tools for managing datasets for governance and training.☆87Updated last week
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆34Updated 5 years ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆35Updated 2 years ago
- BigTranslate: Augmenting Large Language Models with Multilingual Translation Capability over 100 Languages☆229Updated 2 years ago
- ☆33Updated 2 years ago
- Official implementation of the paper "CoEdIT: Text Editing by Task-Specific Instruction Tuning" (EMNLP 2023)☆136Updated last year
- SAIL: Search Augmented Instruction Learning☆158Updated 6 months ago
- ☆57Updated 3 years ago
- A small seq2seq punctuator tool based on DistilBERT☆53Updated last year
- Official Repo for the Paper "AI as Humanity's Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution o…☆23Updated last year