Aveek-Saha / Movie-Script-DatabaseLinks
A database of movie scripts from several sources
☆180Updated last year
Alternatives and similar repositories for Movie-Script-Database
Users that are interested in Movie-Script-Database are comparing it to the libraries listed below
Sorting:
- ☆44Updated 2 years ago
- Semantic search with embeddings: index anything☆140Updated 3 years ago
- ☆100Updated last year
- 🚀🤗 A collection of templates for Hugging Face Spaces☆35Updated 2 years ago
- KokoMind: Can LLMs Understand Social Interactions?☆102Updated 2 years ago
- Tools for content datamining and NLP at scale☆44Updated last year
- Codebase for LLM story generation; updated version of https//github.com/yangkevin2/doc-story-generation☆86Updated last year
- The ScriptBase Corpus☆45Updated 7 years ago
- BigTranslate: Augmenting Large Language Models with Multilingual Translation Capability over 100 Languages☆228Updated 2 years ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆34Updated 5 years ago
- Automated Screenplay Annotation for Extracting Storytelling Knowledge☆45Updated 2 months ago
- TimeLMs: Diachronic Language Models from Twitter☆111Updated last year
- ☆92Updated 3 years ago
- O-GIA is an umbrella for research, infrastructure and projects ecosystem that should provide open source, reproducible datasets, models, …☆89Updated 2 years ago
- ☆162Updated 2 years ago
- Screenplay Summarization using Latent Narrative Structure☆38Updated 3 years ago
- ☆253Updated 2 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- A small seq2seq punctuator tool based on DistilBERT☆53Updated 11 months ago
- Inference script for Meta's LLaMA models using Hugging Face wrapper☆110Updated 2 years ago
- ☆194Updated last year
- A TextTiling-based algorithm for text segmentation (aka topic segmentation) that uses neural sentence encoders, as well as extractive sum…☆50Updated 2 years ago
- Python tools for processing the stackexchange data dumps into a text dataset for Language Models☆83Updated last year
- An open-source text summarization toolkit for non-experts. EMNLP'2021 Demo☆280Updated 2 years ago
- An instruction-based benchmark for text improvements.☆143Updated 2 years ago
- Curated list of open source and openly accessible large language models☆26Updated 2 years ago
- Implementation of "SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages" paper, accepted to E…☆28Updated 2 years ago
- SummScreen: A Dataset for Abstractive Screenplay Summarization (ACL 2022)☆39Updated 3 years ago
- Fine-tuning 6-Billion GPT-J (& other models) with LoRA and 8-bit compression☆68Updated 3 years ago
- Using pretrained encoder and language models to generate captions from multimedia inputs.☆98Updated 2 years ago