zwhong714 / weak-to-strong-preference-optimizationView external linksLinks
[ICLR 2025 Spotlight] Weak-to-strong preference optimization: stealing reward from weak aligned model
☆16Feb 24, 2025Updated 11 months ago
Alternatives and similar repositories for weak-to-strong-preference-optimization
Users that are interested in weak-to-strong-preference-optimization are comparing it to the libraries listed below
Sorting:
- Documentation at☆14Mar 27, 2025Updated 10 months ago
- [AAAI 2025]Noise-Injected Spiking Graph Convolution for Energy-Efficient 3D Point Cloud Denoising☆14Sep 23, 2025Updated 4 months ago
- Code for paper: PoisonPrompt: Backdoor Attack on Prompt-based Large Language Models, IEEE ICASSP 2024. Demo//124.220.228.133:11107☆20Aug 10, 2024Updated last year
- [ICRA 2024] WLST: Weak Labels Guided Self-training for Weakly-supervised Domain Adaptation on 3D Object Detection☆12Feb 6, 2024Updated 2 years ago
- [ICLR 2025] Official PyTorch Implementation for CPE: Concept Pinpoint Eraser for Text-to-image Diffusion Models via Residual Attention Ga…☆12Apr 7, 2025Updated 10 months ago
- Implementation of the CVPR2025 paper LoTUS: Large-Scale Machine Unlearning with a Taste of Uncertainty.☆16Sep 10, 2025Updated 5 months ago
- [ICLR 2024] Inducing High Energy-Latency of Large Vision-Language Models with Verbose Images☆42Jan 25, 2024Updated 2 years ago
- CVPR 2025: VoxelSplat: Dynamic Gaussian Splatting as an Effective Loss for Occupancy and Flow Prediction☆73Aug 1, 2025Updated 6 months ago
- Code for "Zero-Shot Out-of-Distribution Detection with Feature Correlations"☆13Jan 19, 2020Updated 6 years ago
- ☆12Dec 22, 2025Updated last month
- ☆20Aug 8, 2025Updated 6 months ago
- Tool for testing IPv4 and IPv6 DHCP services☆13Mar 27, 2020Updated 5 years ago
- ☆11Mar 13, 2023Updated 2 years ago
- [ACL 2025 Main] (🏆 Outstanding Paper Award) Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Proba…☆15Aug 15, 2025Updated 6 months ago
- ☆12Jun 27, 2022Updated 3 years ago
- Official Pytorch implementation of the AAAI 2025 "Spiking Point Transformer for Point Cloud Classification"☆15Apr 12, 2025Updated 10 months ago
- Concise Reasoning via Reinforcement Learning☆13Apr 16, 2025Updated 9 months ago
- EXL2 quantization generalized to other models.☆10Mar 17, 2024Updated last year
- Official PyTorch code for "Vector Quantization Prompting for Continual Learning (NeurIPS2024)".☆10Oct 16, 2024Updated last year
- This is the code of our work CISS Certified Robustness Against Natural Language Attacks by Causal Intervention published on ICML 2022☆11Dec 6, 2022Updated 3 years ago
- ☆17May 3, 2025Updated 9 months ago
- Official Code Implementation for the CCS 2022 Paper "On the Privacy Risks of Cell-Based NAS Architectures"☆11Nov 21, 2022Updated 3 years ago
- ☆10Oct 25, 2024Updated last year
- [WACV 2025-Oral Presentation] Test-Time Adaptation in Point Clouds: Leveraging Sampling Variation with Weight Averaging☆12Mar 31, 2025Updated 10 months ago
- Public Codebase supporting the paper "Modeling Cellular Perturbations with The Sparse Additive Mechanism Shift Variational Autoencoder" b…☆14Oct 20, 2023Updated 2 years ago
- Source code to execute signal injection attacks against CCD image sensors☆11Aug 26, 2021Updated 4 years ago
- The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"☆15May 18, 2025Updated 8 months ago
- Efficient Decoupled Feature 3D Gaussian Splatting via Hierarchical Compression☆12Mar 17, 2025Updated 10 months ago
- HGL: Hierarchical Geometry Learning for Test-time Adaptation in 3D Point Cloud Segmentation☆13Sep 13, 2024Updated last year
- Official repository for Targeted Unlearning with Single Layer Unlearning Gradient (SLUG), ICML 2025☆15Aug 10, 2025Updated 6 months ago
- ☆12Apr 22, 2024Updated last year
- Benchmarking Deepseek R1 API response speeds across different providers for performance comparison.☆10Feb 15, 2025Updated last year
- Implementation of joint bayesian model, written in python.☆11Aug 2, 2021Updated 4 years ago
- Heterogeneous Model Reuse via Optimizing Multiparty Multiclass Margin☆11Jan 15, 2020Updated 6 years ago
- [NeurIPS 2025] Encoder-Decoder Diffusion Language Models for Efficient Training and Inference☆36Oct 29, 2025Updated 3 months ago
- Developer focused AI Gateway☆15Mar 7, 2025Updated 11 months ago
- [NeurIPS 23] Characterizing OOD Error via Optimal Transport☆13Nov 19, 2023Updated 2 years ago
- Official Repository for Heterogeneous Models Dataset Condensation (ECCV 2024, Oral)☆10Dec 15, 2024Updated last year
- trending repositories and news related to AI☆10Mar 22, 2019Updated 6 years ago