nazmulkazi / dataset_automated_medical_transcription
Dataset for training machine learning model for automatically generating psychiatric case notes from doctor-patient conversations.
☆54Updated 2 years ago
Alternatives and similar repositories for dataset_automated_medical_transcription:
Users that are interested in dataset_automated_medical_transcription are comparing it to the libraries listed below
- Dataset for Natural Language Processing using a corpus of medical transcriptions and custom-generated clinical stop words and vocabulary.☆88Updated 4 years ago
- Sentence tokenizer for clinical/medical text.☆25Updated 7 months ago
- Dataset of 57 mock medical primary care consultations: audio, consultation notes, human utterance-level transcripts.☆41Updated 2 years ago
- ☆22Updated last year
- Official Codes for "Publicly Shareable Clinical Large Language Model Built on Synthetic Clinical Notes"☆93Updated 4 months ago
- Zero-shot Audio Classification using Whisper☆77Updated 2 years ago
- A corpus of textual data corresponding to synthetic clinical encounters, including each encounters’ dialogue transcript and clinical note…☆31Updated last year
- This project develops compact transformer models tailored for clinical text analysis, balancing efficiency and performance for healthcare…☆18Updated 9 months ago
- A TextTiling-based algorithm for text segmentation (aka topic segmentation) that uses neural sentence encoders, as well as extractive sum…☆44Updated last year
- A Python Natural Language Processing Toolkit for Medical Text Generation☆75Updated 2 months ago
- Clinical text summarization by adapting large language models☆131Updated 5 months ago
- Supplementary material for "Understanding Parameter-Efficient Finetuning of Large Language Models: From Prefix Tuning to Adapters"☆44Updated last year
- ☆85Updated 5 months ago
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 3 years ago
- Consists of the largest (10K) human annotated code-switched semantic parsing dataset & 170K generated utterance using the CST5 augmentati…☆35Updated last year
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆139Updated last year
- Scripts to convert datasets from various sources to Hugging Face Datasets.☆58Updated 2 years ago
- An exploratory, tutorial and analytical view of the Unified Medical Language System (UMLS) & the software/technologies provided via being…☆39Updated 9 months ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated 10 months ago
- Transformers for Clinical NLP☆23Updated 3 weeks ago
- Using short models to classify long texts☆21Updated last year
- A PyTorch Lightning Callback for pushing models to the Hugging Face Hub 🤗⚡️☆36Updated 2 years ago
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023☆50Updated last year
- MAFAND-MT☆55Updated 6 months ago
- Official repository for the AnnoMI dataset: the first public collection of expert-annotated MI transcripts.☆60Updated last year
- Document Information Extraction & Anonymization using local LLMs☆28Updated last week
- Minimal implementation of multiple PEFT methods for LLaMA fine-tuning☆13Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆49Updated 10 months ago
- GPTNERMED is a language model-generated, synthetic dataset and an open neural NER model for medical entities designed for German data.☆15Updated last year
- Crosslingual Question Answering for African Languages☆29Updated 3 months ago