stayallive / whisper-subtitles
Generate subtitles (.srt and .vtt) from audio files using OpenAI's Whisper models.
☆26Updated last year
Alternatives and similar repositories for whisper-subtitles:
Users that are interested in whisper-subtitles are comparing it to the libraries listed below
- Convert your PDFs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficient…☆44Updated last week
- Falcon40B and 7B (Instruct) with streaming, top-k, and beam search☆40Updated last year
- Generate visual podcasts about novels using open source models☆25Updated 2 years ago
- converts url content into JSON with a simple prefix☆67Updated 9 months ago
- Complex RAG backend☆28Updated 10 months ago
- ☆16Updated 2 months ago
- Seamless Voice Interactions with LLMs☆11Updated last year
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆48Updated 6 months ago
- ☆59Updated last year
- Transcribe with ease :D☆14Updated last year
- AI-augmented, conversational information retrieval and data exploration☆39Updated 11 months ago
- Command-line script for inferencing from models such as LLaMA, in a chat scenario, with LoRA adaptations☆33Updated last year
- The very first artist assistant☆21Updated last year
- Multimodal Chat with Gemini API☆47Updated last year
- Easily trim 'messages' arrays for use with GPTs☆74Updated last year
- Jupyter Notebooks for Ollama integration☆123Updated 3 weeks ago
- Personalized language model from Whatsapp chat history☆49Updated last year
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Updated last year
- A spotify playlist agent using CrewAI☆81Updated 8 months ago
- Screenshot LLM is a Python application that leverages the power of AI to analyze screenshots. Built with PyQt6 for a user-friendly interf…☆36Updated 3 months ago
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆45Updated last month
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆44Updated 5 months ago
- Use Codestral Mamba with Visual Studio Code and the Continue extension. A local LLM alternative to GitHub Copilot.☆31Updated 7 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆56Updated 6 months ago
- OpenAI-Assistant API integration with Speech Recognition and Eleven Labs TTS. User can choose name, description, model of assistant and …☆18Updated last year
- Transcription with speaker diarization pipeline☆90Updated last year
- Python client for txtai☆12Updated last week
- One Repo To Quickly Build One Docker File for HuggingChat Front and BackEnd☆26Updated last year
- auto fine tune of models with synthetic data☆75Updated last year
- Browser-based Voice Assistant☆44Updated last year