☆23Oct 20, 2023Updated 2 years ago
Alternatives and similar repositories for LLaVA-server
Users that are interested in LLaVA-server are comparing it to the libraries listed below
Sorting:
- [ICML 2024] Code for the paper "Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases"☆38Jul 12, 2024Updated last year
- DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support☆750Mar 22, 2024Updated 2 years ago
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).☆40May 9, 2024Updated last year
- [CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"☆244Apr 6, 2024Updated last year
- This is the pytorch implementation of the UAI2023 paper "A Trajectory is Worth Three Sentences: Multimodal Transformer for Offline Reinf…☆11Oct 9, 2023Updated 2 years ago
- ☆27Aug 17, 2023Updated 2 years ago
- A library for constrained RLHF.☆13Feb 19, 2024Updated 2 years ago
- Code for the paper "Training Diffusion Models with Reinforcement Learning"☆557Jul 5, 2023Updated 2 years ago
- SEIKO is a novel reinforcement learning method to efficiently fine-tune diffusion models in an online setting. Our methods outperform all…☆30Jul 18, 2024Updated last year
- [TACL 2024] Improving Probability-based Prompt Selection Through Unified Evaluation and Analysis☆11Nov 14, 2024Updated last year
- pix2pix model for generating terrain☆17Jan 7, 2023Updated 3 years ago
- Reproduction of DDPO paper (RLHF for diffusion)☆93Sep 20, 2023Updated 2 years ago
- ☆11Oct 25, 2021Updated 4 years ago
- Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis☆649May 24, 2024Updated last year
- In this codebase we establish a benchmark for egocentric user adaptation based on Ego4d.First, we start from a population model which ha…☆15Jan 16, 2025Updated last year
- ☆11Oct 4, 2018Updated 7 years ago
- team Doggeee's solution to Ego4D LTA challenge@CVPRW23'☆13Nov 4, 2023Updated 2 years ago
- 本项目是我在学习 CS336 课程过程中整理的学习笔记 This project is a collection of study notes I compiled while taking the CS336 course.☆24Nov 1, 2025Updated 4 months ago
- awesome unsupervised learning paper list☆12Jan 4, 2018Updated 8 years ago
- Code for Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning☆37Jun 16, 2024Updated last year
- [NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation☆1,650Oct 29, 2025Updated 4 months ago
- v1: Learning to Point Visual Tokens for Multimodal Grounded Reasoning☆19Oct 6, 2025Updated 5 months ago
- handy cli tool to convert your speech to clipboard text☆15Updated this week
- ☆11Jul 29, 2021Updated 4 years ago
- Code for "Aligning Optimization Trajectories with Diffusion Models for Constrained Design Generation" @ NeurIPS 2023☆25Oct 12, 2023Updated 2 years ago
- [EMNLP 2023 Findings] Efficiently Enhancing Zero-Shot Performance of Instruction Following Model via Retrieval of Soft Prompt☆20Nov 2, 2023Updated 2 years ago
- Gradient Estimation with Discrete Stein Operators (NeurIPS 2022)☆17Nov 14, 2023Updated 2 years ago
- Code for "Diffusion Model Alignment Using Direct Preference Optimization"☆671Nov 10, 2025Updated 4 months ago
- Official implementation for GATSBI: Generative Agent-centric Spatio-temporal Object Interaction (CVPR'2021)☆12Mar 23, 2022Updated 3 years ago
- Twitter API client for KotlIn Multiplatform☆10Feb 17, 2021Updated 5 years ago
- [ICLR 2025 Oral] Knowledge Entropy Decay during Language Model Pretraining Hinders New Knowledge Acquisition☆19Nov 25, 2024Updated last year
- [ICLR 2024 Spotlight] Code for the paper "Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making"☆12Apr 22, 2024Updated last year
- ☆25May 30, 2023Updated 2 years ago
- ☆10May 24, 2021Updated 4 years ago
- Scalable learning with pragmatics☆11Mar 31, 2018Updated 7 years ago
- Thử nghiệm gần đây mô hình MLP-Mixer trên bài toán nhận diện cảm xúc (Sentiment sentiment analysis)☆13Jul 9, 2021Updated 4 years ago
- benchmark for Speech-to-Intent engines☆17Dec 18, 2025Updated 3 months ago
- An Online Latent Dirichlet Allocation with Infinite Vocabulary implementation in Python.☆12Oct 4, 2018Updated 7 years ago
- Implementations of Curious Replay for model-based adaptation.☆43Jul 5, 2023Updated 2 years ago