JianqiangWan / VLPT-STDView external linksLinks
Vision-Language Pre-Training for Boosting Scene Text Detectors (CVPR2022)
☆12Mar 21, 2022Updated 3 years ago
Alternatives and similar repositories for VLPT-STD
Users that are interested in VLPT-STD are comparing it to the libraries listed below
Sorting:
- Code for SEEG: Semantic Energized Co-speech Gesture Generation☆33Dec 3, 2022Updated 3 years ago
- Virtual news production using Tacotron2 and Wav2Lip☆11Nov 14, 2023Updated 2 years ago
- Official implementation of SPTS: Single-Point Text Spotting (ACM MM 2022 Oral)☆144Jul 26, 2023Updated 2 years ago
- An open-source platform for building and deploying real-time, low-latency AI voice agents for call automation for marketing.☆18Oct 16, 2025Updated 4 months ago
- VexFS is a Linux kernel-native file system with built-in vector search and semantic memory. Designed for AI agents, RAG, and LLM workload…☆24Oct 19, 2025Updated 3 months ago
- automatic music transcription application written in java☆12Jan 13, 2013Updated 13 years ago
- BanterBot: An OpenAI ChatGPT-powered chatbot with Azure Neural Voices. Supports multilingual speech-to-text and text-to-speech interactio…☆11Jan 23, 2026Updated 3 weeks ago
- Eliza Agent Weaver enables you to develop a set of Character files based on your own lore, and connects the narratives of multiple agents…☆10Dec 12, 2024Updated last year
- A PyTorch implementation of "From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network" (ICCV2021)☆106Dec 9, 2021Updated 4 years ago
- Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4) and OpenAI Whisper.☆14Aug 13, 2023Updated 2 years ago
- A high-performance, distributed memory management system for LLM agents built with LangGraph, LangChain, Ray, and vLLM. Features multi-la…☆11Apr 23, 2025Updated 9 months ago
- CoPur: Certifiably Robust Collaborative Inference via Feature Purification (NeurIPS 2022)☆11Dec 7, 2022Updated 3 years ago
- Chatbot for NHS Medicines A-Z. Agentic Retrieval Augmented Generation utilising the OpenAI API, LangChain, and LangGraph to query a vecto…☆10Jun 24, 2024Updated last year
- Official repository of Tapir Lab.'s Lip-Sync Method☆10Oct 3, 2023Updated 2 years ago
- Implementing an interactive AI avatar using Python, Blender and GPT☆11Dec 5, 2023Updated 2 years ago
- It is a VSCode theme which is based off of sublime text's monokai.☆10May 2, 2021Updated 4 years ago
- Code for TCSVT paper "Exploring Spatio-Temporal Graph Convolution for Video-based Human-Object Interaction Recognition"☆12Mar 30, 2023Updated 2 years ago
- Arbitrary Shape Text Detection via Segmentation with Probability Maps; accepted by TPAMI2022☆104Jun 30, 2023Updated 2 years ago
- stop updating, further reading, pls go to https://github.com/rgtjf/Paper-Reading-Third-Edition☆11Oct 8, 2017Updated 8 years ago
- A proposed GPT chatbot for teachers that uses retrieval-augmentation to answer questions about their students.☆10Dec 7, 2024Updated last year
- calvis: Chest, wAist and peLVIS circumference from 3D human Body meshes for Deep Learning.☆11May 15, 2025Updated 9 months ago
- Agent building tools via block diagram UI☆12Dec 31, 2025Updated last month
- Multi-tenant RAG API powered by LightRAG/RAG-Anything. Auto-selects best parser (DeepSeek-OCR/MinerU/Docling) via complexity scoring☆24Dec 15, 2025Updated 2 months ago
- ☆10Apr 22, 2021Updated 4 years ago
- A project about Virtual Try-On. Lines of code ~5,200.☆10Jan 27, 2021Updated 5 years ago
- A Cyberpunk 2077 First-Person Multi Rig for Blender (4.0+)☆11Jan 10, 2026Updated last month
- Reference implementation of models from Nyonic Model Factory☆12May 13, 2024Updated last year
- ☆11May 2, 2022Updated 3 years ago
- real-time web visualizer for 3D gaussian splatting☆10Jan 31, 2025Updated last year
- 中译名著多译本翻译转述语料。语料仅限于用于科研教学活动。文本著作权归原著者。☆10Jul 26, 2018Updated 7 years ago
- Talk to your database as if you were chatting with a friend. Turn natural language into powerful SQL queries effortlessly, and get your a…☆10Nov 12, 2024Updated last year
- end-to-end automated video generation pipeline designed to create engaging, TikTok-style viral short videos using AI.☆20Jun 7, 2025Updated 8 months ago
- A pre-trained face parser based on SegNeXt☆50May 16, 2023Updated 2 years ago
- Implementation for WatchYourMouth: Silent Speech Recognition with Depth Sensing presented at CHI 2024☆16Oct 6, 2025Updated 4 months ago
- 基于文本的垃圾短信分类_文本预处理☆13Jan 11, 2016Updated 10 years ago
- ASLP Summer Inter@NPU☆12Jul 30, 2024Updated last year
- Placeholder for code of BSP.☆11Aug 13, 2021Updated 4 years ago
- Python app to sync Video Files to the beat of a song☆12Aug 5, 2019Updated 6 years ago
- ☆12Sep 4, 2023Updated 2 years ago