Vision-Language Pre-Training for Boosting Scene Text Detectors (CVPR2022)
☆12Mar 21, 2022Updated 3 years ago
Alternatives and similar repositories for VLPT-STD
Users that are interested in VLPT-STD are comparing it to the libraries listed below
Sorting:
- Code for SEEG: Semantic Energized Co-speech Gesture Generation☆33Dec 3, 2022Updated 3 years ago
- Virtual news production using Tacotron2 and Wav2Lip☆11Nov 14, 2023Updated 2 years ago
- Code of paper "LP-Diff: Towards Improved Restoration of Real-World Degraded License Plate"☆18Jun 22, 2025Updated 8 months ago
- This package is EasyOCR-based optical character recognition. Unlike EasyOCR, the package uses a pre-saved with onnx language models, so i…☆13Mar 9, 2025Updated last year
- automatic music transcription application written in java☆12Jan 13, 2013Updated 13 years ago
- An open-source platform for building and deploying real-time, low-latency AI voice agents for call automation for marketing.☆18Oct 16, 2025Updated 4 months ago
- Balanced K-means in Pytorch with strong GPU acceleration☆12Apr 30, 2020Updated 5 years ago
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆19Jul 10, 2025Updated 7 months ago
- BanterBot: An OpenAI ChatGPT-powered chatbot with Azure Neural Voices. Supports multilingual speech-to-text and text-to-speech interactio…☆11Jan 23, 2026Updated last month
- Eliza Agent Weaver enables you to develop a set of Character files based on your own lore, and connects the narratives of multiple agents…☆10Dec 12, 2024Updated last year
- [AAAI2025] Revisiting Tampered Scene Text Detection in the Era of Generative AI☆61Updated this week
- A Cyberpunk 2077 First-Person Multi Rig for Blender (4.0+)☆11Jan 10, 2026Updated last month
- VexFS is a Linux kernel-native file system with built-in vector search and semantic memory. Designed for AI agents, RAG, and LLM workload…☆25Oct 19, 2025Updated 4 months ago
- Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4) and OpenAI Whisper.☆14Aug 13, 2023Updated 2 years ago
- Code for TCSVT paper "Exploring Spatio-Temporal Graph Convolution for Video-based Human-Object Interaction Recognition"☆12Mar 30, 2023Updated 2 years ago
- Arbitrary Shape Text Detection via Segmentation with Probability Maps; accepted by TPAMI2022☆104Jun 30, 2023Updated 2 years ago
- A proposed GPT chatbot for teachers that uses retrieval-augmentation to answer questions about their students.☆10Dec 7, 2024Updated last year
- 中译名著多译本翻译转述语料。语料仅限于用于科研教学活动。文本著作权归原著者。☆10Jul 26, 2018Updated 7 years ago
- ☆10Apr 22, 2021Updated 4 years ago
- Implementing an interactive AI avatar using Python, Blender and GPT☆11Dec 5, 2023Updated 2 years ago
- calvis: Chest, wAist and peLVIS circumference from 3D human Body meshes for Deep Learning.☆11May 15, 2025Updated 9 months ago
- Official Implementation (Pytorch) of "Super-class guided Transformer for Zero-Shot Attribute Classification", AAAI 2025☆15Jan 15, 2025Updated last year
- Official repository of Tapir Lab.'s Lip-Sync Method☆10Oct 3, 2023Updated 2 years ago
- A project about Virtual Try-On. Lines of code ~5,200.☆10Jan 27, 2021Updated 5 years ago
- CV_JOB_interview_related_file☆10Jul 3, 2022Updated 3 years ago
- A non-slop skill creator for competent expert-level skills. Extract expertise through guided interviews or expert conversations, separate…☆23Dec 24, 2025Updated 2 months ago
- CoPur: Certifiably Robust Collaborative Inference via Feature Purification (NeurIPS 2022)☆11Dec 7, 2022Updated 3 years ago
- A synthetic training data generator for a text recognition CNN☆10Jul 8, 2019Updated 6 years ago
- It is a VSCode theme which is based off of sublime text's monokai.☆10May 2, 2021Updated 4 years ago
- A pre-trained face parser based on SegNeXt☆50May 16, 2023Updated 2 years ago
- Placeholder for code of BSP.☆11Aug 13, 2021Updated 4 years ago
- ☆14Jun 16, 2023Updated 2 years ago
- Research on algorithms for garment perception, manipulation...☆12Sep 15, 2023Updated 2 years ago
- Implementation for WatchYourMouth: Silent Speech Recognition with Depth Sensing presented at CHI 2024☆16Oct 6, 2025Updated 5 months ago
- init☆11Sep 30, 2017Updated 8 years ago
- ☆12Sep 4, 2023Updated 2 years ago
- 基于文本的垃圾短信分类_文本预处理☆13Jan 11, 2016Updated 10 years ago
- Python app to sync Video Files to the beat of a song☆12Aug 5, 2019Updated 6 years ago
- Code and data for our paper "High-Fidelity 3D Digital Human Creation from RGB-D Selfies".☆20Dec 30, 2024Updated last year