☆13May 10, 2019Updated 7 years ago
Alternatives and similar repositories for Act2Vec
Users that are interested in Act2Vec are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Apr 13, 2026Updated last month
- VQ-VAE implementation in pytorch, supporting EMA and Gumbel trainings. Applicable for images and time series.☆11Oct 19, 2022Updated 3 years ago
- ☆27Oct 25, 2019Updated 6 years ago
- Simlulation code for paper "Cooperative caching for spectrum access in cognitive radio networks".☆10Oct 24, 2017Updated 8 years ago
- A python tool that generate latex(e.g. Table, matrix) code.☆10Jun 22, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Direct preference optimization with f-divergences.☆17Nov 3, 2024Updated last year
- A reinforcement learning agent playing as the turret, where its goal is to allow ten friendly units to enter the base, and loses if an en…☆14Dec 24, 2020Updated 5 years ago
- Implement FlashAttention v2 with minimal code to learn.☆16Jun 12, 2024Updated last year
- Recommendation Alogrithms code by pytorch☆14Mar 7, 2019Updated 7 years ago
- My homework solutions for UC Berkeley CS294: deep unsupervised learning☆14Mar 24, 2023Updated 3 years ago
- Blindspots in LLMs I've noticed while AI coding. Sonnet family emphasis.☆13Mar 20, 2025Updated last year
- diffusers with search engine☆12Jan 13, 2026Updated 4 months ago
- ☆14Mar 8, 2025Updated last year
- Framework for Algorithmic Correctness Testing of Operators☆17Mar 9, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Colab notebook for fine-tuning Qwen2-Audio with trl's SFT and PPO trainers.☆24Nov 23, 2024Updated last year
- ☆12Mar 7, 2024Updated 2 years ago
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning☆20Dec 30, 2022Updated 3 years ago
- Causal Analysis of Agent Behavior for AI Safety☆20Jun 27, 2023Updated 2 years ago
- The High-dimensional BayesOpt algorithms from "A Framework for Bayesian Optimization in Embedded Subspaces☆43Jun 8, 2019Updated 6 years ago
- Pytorch routines for (Ker)nel (Mac)hines☆12Oct 10, 2025Updated 7 months ago
- PyTorch implementation of context2vec from Melamud et al., CoNLL 2016☆19Sep 25, 2018Updated 7 years ago
- A Spark based semantic reasoning engine☆14Mar 28, 2017Updated 9 years ago
- My attempt to improve the speed of the newton schulz algorithm, starting from the dion implementation.☆38Apr 30, 2026Updated 3 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official Implementation of LANTERN (ICLR'25) and LANTERN++(ICLRW-SCOPE'25)☆21Mar 5, 2025Updated last year
- Repo for a generalised DQN Agent model capable of solving major discrete action space control problems☆18Aug 20, 2018Updated 7 years ago
- 本项目演示联邦学习方法☆10Aug 1, 2019Updated 6 years ago
- GPUDirect Async suite☆16Dec 5, 2018Updated 7 years ago
- Experiment code for testing effect of various action space transformations in reinforcement learning☆30May 26, 2020Updated 6 years ago
- ☆21Nov 5, 2018Updated 7 years ago
- Triton kernels for Flux☆23Jul 7, 2025Updated 10 months ago
- A Top-Down Profiler for GPU Applications☆22Feb 29, 2024Updated 2 years ago
- HunyuanDiT with TensorRT and libtorch☆18May 22, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- LLM training parallelisms (DP, FSDP, TP, PP) in pure C☆28Jan 27, 2026Updated 4 months ago
- LLVM/MLIR based compiler instrumentation of AMD GPU kernels☆21Jul 13, 2025Updated 10 months ago
- ☆33Aug 30, 2024Updated last year
- Decentralized Scheduling for Cooperative Localization with Deep Reinforcement Learning☆35Jun 1, 2019Updated 6 years ago
- ☆42Feb 9, 2020Updated 6 years ago
- ☆18Dec 3, 2020Updated 5 years ago
- A Modern Python Package Template☆34May 18, 2026Updated last week