Weird autoencoder experiments
☆25May 20, 2026Updated last month
Alternatives and similar repositories for owl-vaes
Users that are interested in owl-vaes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- OWL Control is a desktop application that records gameplay footage and input data from video games to create open-source datasets for AI …☆47Mar 3, 2026Updated 3 months ago
- Basic world models☆32Oct 30, 2025Updated 8 months ago
- Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation☆14Nov 16, 2020Updated 5 years ago
- Audio Entailment: Deductive Reasoning for Audio Understanding☆17Dec 10, 2024Updated last year
- Official PyTorch implementation of "MM-PoisonRAG: Disrupting Multimodal RAG with Local and Global Poisoning Attacks"☆16Dec 4, 2025Updated 6 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Arduino library for the Maxim DS1337 I2C RTC.☆11Aug 20, 2014Updated 11 years ago
- 一个将豆包 ASR 能力封装为 OpenAI 兼容接口的小项目,支持 Docker 启动,并提供一份可配合 Spokenly 使用的参考修正提示词,实现和 Typeless 类似的语音修正效果。☆42Feb 28, 2026Updated 4 months ago
- KUDA: Keypoints to Unify Dynamics Learning and Visual Prompting for Open-Vocabulary Robotic Manipulation☆22Apr 23, 2025Updated last year
- Goal-conditioned reinforcement learning like 🔥☆15Feb 3, 2024Updated 2 years ago
- ☆14Jan 2, 2025Updated last year
- Code release for 'Struct2D: A Perception-Guided Framework for Spatial Reasoning in MLLMs' (NeurIPS 2025)☆31Oct 28, 2025Updated 8 months ago
- ☆43Jun 6, 2025Updated last year
- Simple Recipe Works: Vision-Language-Action Models are Natural Continual Learners with Reinforcement Learning☆60Mar 16, 2026Updated 3 months ago
- A git-style way of managing LLM chats☆34Jan 26, 2026Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- DrillSat 2018☆16Oct 7, 2018Updated 7 years ago
- Searching for Music Mixing Graphs: A Pruning Approach☆26Feb 13, 2025Updated last year
- poorman's ar-dit tts☆45Dec 31, 2025Updated 6 months ago
- A library for making PyTorch models streamable☆67Jan 23, 2026Updated 5 months ago
- A SQL transformation engine that type-checks your whole pipeline and catches breaking changes before they run — branches, replay, column-…☆267Updated this week
- ☆24Apr 30, 2025Updated last year
- Semantic Map Learning of Traffic Light to Lane Assignment based on Motion Data☆11Mar 30, 2024Updated 2 years ago
- The official implementation of AAAI2024 paper of "Scribble Hides Class: Promoting Scribble-based Semantic Segmentation with its Class Lab…☆17Oct 10, 2024Updated last year
- Official implementation of 'A Large-Scale Exploration of mu-Transfer'☆32Jun 5, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Constant-Q harmonic coefficients (CQHCs), a timbre feature designed for music signals.☆29Sep 13, 2025Updated 9 months ago
- Video encoding and muxing through libobs in Rust☆39May 31, 2026Updated last month
- Understanding and Tackling Hallucinations in Large Audio-Language Models | ICASSP 2025, Interspeech 2024☆34Mar 14, 2025Updated last year
- DICE: End-to-end Deformation Capture of Hand-Face Interactions from a Single Image (ICLR 2025)☆24Jan 12, 2026Updated 5 months ago
- 五笔输入法拆字,可选五笔版本☆20Jul 1, 2019Updated 7 years ago
- This repository collects papers related to Speech Tokenizer.☆18Oct 16, 2024Updated last year
- ☆37Nov 26, 2025Updated 7 months ago
- The official github repo for "Diffusion Language Models are Super Data Learners".☆228Nov 6, 2025Updated 7 months ago
- Code for the paper: Separate but togerher: Unsupervised Federated Learning for Speech Enhancement from non-iid data☆41Nov 1, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Accompanying code for our paper "Optimizing Short-Time Fourier Transform Parameters via Gradient Descent"☆33Oct 30, 2020Updated 5 years ago
- This is a simple python script to retarget a rig in blender from a radical capture to a rig imported from Daz3D☆10Oct 10, 2020Updated 5 years ago
- A pytorch implementation of FFTNet.☆37Aug 31, 2018Updated 7 years ago
- ☆20Oct 14, 2024Updated last year
- Inference code for Interspeech 2025 paper, "LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec"☆36Oct 23, 2025Updated 8 months ago
- 语音合成端到端TTS模型vits中文版,VITS Mandarin☆15Sep 17, 2022Updated 3 years ago
- The source code for target sound detection☆15Feb 26, 2022Updated 4 years ago