nazmulkazi / dataset_automated_medical_transcription
Dataset for training machine learning model for automatically generating psychiatric case notes from doctor-patient conversations.
☆50Updated last year
Related projects ⓘ
Alternatives and complementary repositories for dataset_automated_medical_transcription
- Sentence tokenizer for clinical/medical text.☆25Updated 5 months ago
- Dataset for Natural Language Processing using a corpus of medical transcriptions and custom-generated clinical stop words and vocabulary.☆82Updated 4 years ago
- ☆49Updated last year
- GPTNERMED is a language model-generated, synthetic dataset and an open neural NER model for medical entities designed for German data.☆15Updated last year
- Clinical text summarization by adapting large language models☆120Updated 3 months ago
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.☆25Updated last year
- Official Codes for "Publicly Shareable Clinical Large Language Model Built on Synthetic Clinical Notes"☆89Updated 2 months ago
- MedAlign is a clinician-generated dataset for instruction following with electronic medical records.☆89Updated last year
- Using short models to classify long texts☆20Updated last year
- This project develops compact transformer models tailored for clinical text analysis, balancing efficiency and performance for healthcare…☆18Updated 7 months ago
- BioDEX: Large-Scale Biomedical Adverse Drug Event Extraction for Real-World Pharmacovigilance.☆46Updated 9 months ago
- A Python Natural Language Processing Toolkit for Medical Text Generation☆70Updated 2 weeks ago
- Zero-shot Audio Classification using Whisper☆74Updated last year
- Supplementary material for "Understanding Parameter-Efficient Finetuning of Large Language Models: From Prefix Tuning to Adapters"☆43Updated last year
- ☆81Updated 3 months ago
- Speaker diarization service☆19Updated last week
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆132Updated 9 months ago
- A deidentifier / deidentification pipeline developed by Stanford and Penn as part of the MIDRC organization.☆77Updated 4 months ago
- A Streamlit application to visualize sentence embeddings☆20Updated last year
- A TextTiling-based algorithm for text segmentation (aka topic segmentation) that uses neural sentence encoders, as well as extractive sum…☆41Updated last year
- Self-verification for LLMs.☆62Updated last year
- ☆22Updated last year
- 🎹 pyannote + 🗒 notebook = pyannotebook☆25Updated last year
- This repository will be a summary and outlook on all our open, medical, AI advancements.☆29Updated last year
- A python package for removing duplicate text in clinical notes or other documents☆34Updated 4 years ago
- PLM-ICD: Automatic ICD Coding with Pretrained Language Models☆56Updated 11 months ago
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆34Updated last year
- Code repository for BEEP (Biomedical Evidence Enhanced Predictions) clinical outcome prediction system☆23Updated last year
- ☆23Updated 2 years ago
- Source codes for the paper "Bounding the Capabilities of Large Language Models in Open Text Generation with Prompt Constraints"☆27Updated last year