Lecture Video Summarization by Extracting Handwritten Content from Whiteboards
☆21Aug 22, 2019Updated 6 years ago
Alternatives and similar repositories for accessmath-icfhr2018
Users that are interested in accessmath-icfhr2018 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Detect mathematical expressions in worksheets and draw bounding boxes.☆20Jul 20, 2020Updated 5 years ago
- Website nhận diện và trích xuất thông tin từ Chứng Minh Nhân Dân☆11Oct 6, 2022Updated 3 years ago
- ☆25Jun 12, 2021Updated 5 years ago
- Transformer OCR is a Optical Character Recognition tookit built for researchers working on both OCR for both Vietnamese and English. This…☆10Dec 27, 2021Updated 4 years ago
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆12Mar 14, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"☆11May 16, 2023Updated 3 years ago
- Code for paper "Cross-Domain Slot Filling as Machine Reading Comprehension" in IJCAI 2021☆11Aug 24, 2021Updated 4 years ago
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆13Sep 27, 2024Updated last year
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆15Jun 23, 2024Updated 2 years ago
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆19Jul 16, 2024Updated last year
- Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student traini…☆13Mar 18, 2024Updated 2 years ago
- Code recipe for "Multimodal One-Shot Learning of Speech and Images"☆11Nov 21, 2018Updated 7 years ago
- CTE: Contextualized Table Extraction Dataset☆17Feb 23, 2023Updated 3 years ago
- ☆11Jul 16, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- EACL 2023 paper "MLASK: Multimodal Summarization of Video-based News Articles"☆11Nov 7, 2023Updated 2 years ago
- The code for Multi-Scale Receptive Field Graph Model for Emotion Recognition in Conversations☆11Jan 17, 2023Updated 3 years ago
- Ranking algorithms for Spark machine learning pipeline☆14Jan 6, 2018Updated 8 years ago
- SpExtor: Sparse Entity Extractor☆11Feb 10, 2020Updated 6 years ago
- Implementation of: Hydra Attention: Efficient Attention with Many Heads (https://arxiv.org/abs/2209.07484)☆14Jan 8, 2023Updated 3 years ago
- Dataset for Pinyin Regularization in Error Correction for Chinese Speech Recognition with Large Language Models in Interspeech 2024.☆15Jul 4, 2024Updated last year
- Source code of our EMNLP 2022 paper: Co-guiding Net: Achieving Mutual Guidances between Multiple Intent Detection and Slot Filling via He…☆12Nov 14, 2022Updated 3 years ago
- SLT 2024 Challenge: Post-ASR-Speaker-Tagging☆16Jun 16, 2024Updated 2 years ago
- A Cytoscape.js extension generator☆10Jan 16, 2018Updated 8 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A Python port of the R implementation of Kleinberg's burst detection algorithm☆12Apr 11, 2022Updated 4 years ago
- A pipeline architecture for temporal segmentation of video lectures.☆12Sep 8, 2020Updated 5 years ago
- ☆15Jan 16, 2024Updated 2 years ago
- Code for paper "Aiding Intra-Text Representations with Visual Context for Multimodal Named Entity Recognition"☆16Aug 19, 2019Updated 6 years ago
- Perform SOTA Speech2Text on Long Audio Files with/without diarization Using Google Cloud Speech API☆14Feb 21, 2022Updated 4 years ago
- ICASSP 2023: "Recursive Joint Attention for Audio-Visual Fusion in Regression Based Emotion Recognition"☆14Nov 29, 2024Updated last year
- Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models☆23Jul 10, 2024Updated last year
- ☆19May 19, 2024Updated 2 years ago
- Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake developme…☆12Feb 26, 2020Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A tool using Keras models which is implementation of YOLOv4 (Tensorflow backend) for detection and VietOCR for recognizion.☆20Oct 3, 2023Updated 2 years ago
- A video recommendation system in Python for a cold start, analyzing user behavior and lecture properties of a TunedIt dataset given by Vi…☆14Apr 14, 2019Updated 7 years ago
- RASA based voice bot after 1 months jump in to AI ;)☆29Sep 3, 2019Updated 6 years ago
- Code for "SLIM: Explicit Slot-Intent Mapping with BERT for Joint Multi-Intent Detection and Slot Filling"☆19Nov 22, 2022Updated 3 years ago
- Source code for the ACL-IJCNLP 2021 paper entitled "T-DNA: Taming Pre-trained Language Models with N-gram Representations for Low-Resourc…☆19Jan 12, 2023Updated 3 years ago
- [DEPRECATED] Vietnamese Handwriting Recognition with CRNN and CTC Loss☆32Apr 2, 2019Updated 7 years ago
- ☆27Mar 11, 2026Updated 3 months ago