nazmulkazi / dataset_automated_medical_transcription
Dataset for training machine learning model for automatically generating psychiatric case notes from doctor-patient conversations.
☆54Updated 2 years ago
Alternatives and similar repositories for dataset_automated_medical_transcription:
Users that are interested in dataset_automated_medical_transcription are comparing it to the libraries listed below
- Sentence tokenizer for clinical/medical text.☆26Updated 9 months ago
- Dataset for Natural Language Processing using a corpus of medical transcriptions and custom-generated clinical stop words and vocabulary.☆89Updated 4 years ago
- GPTNERMED is a language model-generated, synthetic dataset and an open neural NER model for medical entities designed for German data.☆16Updated last year
- ☆53Updated last year
- A new collection of 1.7k doctor-patient conversations and corresponding clinical notes/summaries.☆67Updated last year
- ☆86Updated 3 weeks ago
- PLM-ICD: Automatic ICD Coding with Pretrained Language Models☆64Updated last year
- ☆50Updated last year
- MedAlign is a clinician-generated dataset for instruction following with electronic medical records.☆91Updated last year
- Clinical text summarization by adapting large language models☆133Updated 7 months ago
- A corpus of textual data corresponding to synthetic clinical encounters, including each encounters’ dialogue transcript and clinical note…☆33Updated last year
- A deidentifier / deidentification pipeline developed by Stanford and Penn as part of the MIDRC organization.☆83Updated 8 months ago
- Self-verification for LLMs.☆63Updated last year
- Preprocessing and analysis for training SNOMED-CT concept embeddings from CORD-19 corpus☆14Updated last year
- This project develops compact transformer models tailored for clinical text analysis, balancing efficiency and performance for healthcare…☆18Updated 11 months ago
- DrBERT: A Robust Pre-trained Model in French for Biomedical and Clinical domains☆19Updated last year
- Official Codes for "Publicly Shareable Clinical Large Language Model Built on Synthetic Clinical Notes"☆100Updated 6 months ago
- A Python Natural Language Processing Toolkit for Medical Text Generation☆76Updated 4 months ago
- ☆21Updated 4 years ago
- Transformers for Clinical NLP☆23Updated last month
- ☆57Updated last year
- Document Information Extraction & Anonymization using local LLMs☆39Updated this week
- A python package for removing duplicate text in clinical notes or other documents☆36Updated 4 years ago
- Medical reasoning using large language models☆87Updated last year
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.☆25Updated 2 years ago
- Using short models to classify long texts☆21Updated last year
- Consists of the largest (10K) human annotated code-switched semantic parsing dataset & 170K generated utterance using the CST5 augmentati…☆37Updated 2 years ago
- LongHealth: A Question Answering Benchmark with Long Clinical Documents☆21Updated 3 months ago
- Supplementary material for "Understanding Parameter-Efficient Finetuning of Large Language Models: From Prefix Tuning to Adapters"☆44Updated last year
- A TextTiling-based algorithm for text segmentation (aka topic segmentation) that uses neural sentence encoders, as well as extractive sum…☆44Updated 2 years ago