krafton-ai / OrakView external linksLinks
β124Jun 17, 2025Updated 8 months ago
Alternatives and similar repositories for Orak
Users that are interested in Orak are comparing it to the libraries listed below
Sorting:
- π΅ Code for our EMNLP 2025 Main paper: "FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games"β24Dec 14, 2025Updated 2 months ago
- Evaluating Multimodal Generative AI with Korean Educational Standards, NAACL 2025.β24May 15, 2025Updated 9 months ago
- β11Nov 7, 2024Updated last year
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoderβ12Mar 11, 2025Updated 11 months ago
- Local Relighting of Real Scenesβ35Dec 29, 2022Updated 3 years ago
- Solving Inverse Problems with Diffusion Optimal Control [NeurIPS 2024]β18Dec 21, 2024Updated last year
- Official repo for CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversationsβ62Jan 16, 2025Updated last year
- νκ΅μ΄ λ²€μΉλ§ν¬ νκ° μ½λ ν΅ν©λ³Έ(?)β20Nov 15, 2024Updated last year
- AudioBERT π’ : Audio Knowledge Augmented Language Model (ICASSP 2025)β41Feb 1, 2025Updated last year
- Hate speech detection corpus in Korean, shared with EMNLP 2023 paperβ17Apr 19, 2024Updated last year
- [ICASSP 2025] "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"β107Jan 17, 2025Updated last year
- LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)β43Jun 13, 2024Updated last year
- <νΌμ λ§λ€λ©΄μ 곡λΆνλ νμ΄μ¬> μ± μ κΉνλΈ μλ£μ€β14Jan 14, 2026Updated last month
- The official Implementation of PeriodWave and PeriodWave-Turboβ217Apr 14, 2025Updated 10 months ago
- Official PyTorch implementation of Extract Free Dense Misalignment from CLIP (AAAI'25)β25Apr 20, 2025Updated 9 months ago
- SOTA Piano Transformer model trained on 4.2GB of Solo Piano MIDI musicβ27Nov 9, 2023Updated 2 years ago
- Code for the paper "Songs Across Borders: Singable and Controllable Neural Lyric Translation"β24Feb 3, 2026Updated 2 weeks ago
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variabilityβ108Jan 17, 2025Updated last year
- Custom nodes that bring Character.AI's Ovi video+audio generator to ComfyUI with streamlined setup, selectable precision, attention-backeβ¦β118Oct 16, 2025Updated 4 months ago
- Actually released!β10Feb 24, 2021Updated 4 years ago
- Official Pytorch Implementation for "DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior Mixup for Vβ¦β243Jul 31, 2024Updated last year
- Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesisβ27Mar 21, 2025Updated 10 months ago
- SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipelineβ297Jan 19, 2026Updated 3 weeks ago
- Official implementation of paper: Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesisβ50Sep 20, 2025Updated 4 months ago
- This repository implement a novel zero-shot TTS framework, named Flamed-TTS, focusing on the efficient generation and dynamic pacing in β¦β57Aug 9, 2025Updated 6 months ago
- οΌWIPοΌlong form speech generatoinsβ31Apr 2, 2025Updated 10 months ago
- ConsistencyTTA: Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillationβ38Nov 20, 2024Updated last year
- β102Feb 4, 2026Updated last week
- μ체 ꡬμΆν νκ΅μ΄ νκ° λ°μ΄ν°μ μ μ΄μ©ν νκ΅μ΄ λͺ¨λΈ νκ°β31May 31, 2024Updated last year
- Tacotron 2 - PyTorch implementation with faster-than-realtime inferenceβ30May 28, 2020Updated 5 years ago
- β82Jan 22, 2025Updated last year
- Music2Emo: Towards Unified Music Emotion Recognition across Dimensional and Categorical Modelsβ44Aug 24, 2025Updated 5 months ago
- Official repository for K-EXAONE built by LG AI Researchβ66Feb 6, 2026Updated last week
- Python library to forecast univariate time series through backtesting model selectionβ23Jun 12, 2024Updated last year
- TASU: A New Style of Alignment of Speech LLM with only Text Training Data, zero-shot on ASR and Other SU tasksβ21Jan 19, 2026Updated 3 weeks ago
- arxiv daily for speech translation, legal. Ref: Vincentqyw/cv-arxiv-dailyβ14Jan 6, 2025Updated last year
- Card Payments Simulation Tool For Indie Devs : Core Card Switch Engine, Fraud Engine, ATM/POS GUI Simulator , Admin Dash (Real-time MSG β¦β19Jun 15, 2025Updated 8 months ago
- Cog wrapper for microsoft/OmniParser-v2β12Feb 25, 2025Updated 11 months ago
- A working FE Bypass for all Roblox clientsβ19Jan 10, 2026Updated last month