Few-Shot Preference Optimization (FSPO) personalizes LLMs by reframing reward modeling as a meta-learning problem, enabling rapid adaptation to user preferences with minimal labeled data, leveraging synthetic datasets for scalability, and achieving high success rates in personalized content generation across multiple domains.
☆15Feb 27, 2025Updated last year
Alternatives and similar repositories for fewshot-preference-optimization
Users that are interested in fewshot-preference-optimization are comparing it to the libraries listed below
Sorting:
- ☆18Oct 8, 2024Updated last year
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆17Mar 31, 2025Updated 11 months ago
- Official code for our paper "Reasoning Models Hallucinate More: Factuality-Aware Reinforcement Learning for Large Reasoning Models"☆22Oct 31, 2025Updated 4 months ago
- ☆22Feb 8, 2025Updated last year
- 010Editor-Crack version:13.0.1☆10Mar 18, 2024Updated 2 years ago
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Models☆15Mar 8, 2023Updated 3 years ago
- ToyNLP: Learning NLP from Scratch☆32Mar 1, 2026Updated 2 weeks ago
- 登录脚本☆12Nov 4, 2022Updated 3 years ago
- Inverse Scaling in Test-Time Compute☆25Dec 3, 2025Updated 3 months ago
- ☆15Nov 18, 2025Updated 4 months ago
- Gradio UI to load crewAI configuration from excel xls and generate the python code. The source of the crews is in the xls. It allows for …☆11Oct 17, 2025Updated 5 months ago
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago
- YunoHost DynDNS Server☆13Updated this week
- Synthetic Data Generation with Execution-Based Verification and Grounding for LLM Training.☆19Feb 7, 2025Updated last year
- Automatically installs and configures XFCE, XRDP and variables for a one-script setup☆14Apr 14, 2021Updated 4 years ago
- ☆28Updated this week
- ☆10Jun 15, 2024Updated last year
- unofficial grok api library☆19Mar 27, 2025Updated 11 months ago
- Apply Iprompt on GLM with innovative new methods. Currently support Chinese QA, English QA and Chinese poem generation.☆20Jun 16, 2022Updated 3 years ago
- Code base for "A General Contextualized Rewriting Framework for Text Summarization"☆13Jul 17, 2022Updated 3 years ago
- [ICLR 2025] No Preference Left Behind: Group Distributional Preference Optimization☆15Apr 21, 2025Updated 10 months ago
- https://interactivetraining.ai/☆17Oct 2, 2025Updated 5 months ago
- ☆32Jun 21, 2024Updated last year
- ☆12Jan 20, 2024Updated 2 years ago
- MCP server empowering AI assistants with real-world capabilities: Gmail, Calendar, Tasks, Git integration, and note management. Bridges A…☆12Jun 28, 2025Updated 8 months ago
- ☆13Apr 17, 2018Updated 7 years ago
- Efficient Finetuning for OpenAI GPT-OSS☆23Oct 2, 2025Updated 5 months ago
- Contains the code for my Imperial College London Master's thesis on text summarization☆11Oct 25, 2022Updated 3 years ago
- ☆19Feb 13, 2025Updated last year
- On-the-fly Definition Augmentation of LLMs for Biomedical NER☆14Apr 14, 2025Updated 11 months ago
- ☆12Jul 6, 2023Updated 2 years ago
- AI-based job search in Python☆13Jan 25, 2021Updated 5 years ago
- ☆14Sep 7, 2022Updated 3 years ago
- The core MCP extension for Systemprompt MCP multimodal client☆14Feb 19, 2025Updated last year
- 中国科学院大学(国科大)研一课程☆18May 24, 2023Updated 2 years ago
- IDA Claude Code Plugins☆43Updated this week
- ☆11Mar 8, 2022Updated 4 years ago
- A simple Docker sandbox example and a ready-to-use autograder API. Based on asynchronous FastAPI and disposable Docker containers. Three …☆14Jan 10, 2022Updated 4 years ago
- ☆16Dec 14, 2022Updated 3 years ago