We propose Reinforcement Learning from Community Feedback (RLCF), a training paradigm that uses large-scale community signals as supervision, and formulate scientific taste learning as a preference modeling and alignment problem.
☆267Mar 18, 2026Updated this week
Alternatives and similar repositories for AI-Can-Learn-Scientific-Taste
Users that are interested in AI-Can-Learn-Scientific-Taste are comparing it to the libraries listed below
Sorting:
- We introduce 'Thinking with Video', a new paradigm leveraging video generation for multimodal reasoning. Our VideoThinkBench shows that S…☆283Feb 21, 2026Updated 3 weeks ago
- [IEEE TVCG 2025] Self-supervised Learning of Event-guided Video Frame Interpolation for Rolling Shutter Frames☆11Jun 1, 2025Updated 9 months ago
- A curated list of awesome resources about reward construction for AI agents. This repository covers cutting-edge research, and practical …☆58Sep 1, 2025Updated 6 months ago
- Codes for the paper "BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping…☆91Jan 29, 2026Updated last month
- 一个复旦幻灯片的 Typst 主题。An unofficial Fudan slide theme for Typst.☆16Mar 19, 2024Updated 2 years ago
- multicast learning in network programming course☆10Oct 30, 2020Updated 5 years ago
- [EMNLP 2024] Official code for "Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models"☆20Oct 17, 2024Updated last year
- An Ultra-Long Output Reinforcement Learning Approach☆23Jul 31, 2025Updated 7 months ago
- Substrate TypeScript SDK☆10Sep 20, 2024Updated last year
- [AAMAS 2026] Don’t Blind Your VLA: Aligning Visual Representations for OOD Generalization. https://blind-vla-paper.github.io☆61Jan 25, 2026Updated last month
- PaiNN in jax☆11Jan 14, 2025Updated last year
- Repository for "Training Language Models To Explain Their Own Computations"☆21Dec 22, 2025Updated 2 months ago
- ☆36Dec 13, 2023Updated 2 years ago
- ☆24Apr 29, 2025Updated 10 months ago
- 💬 MCP Server for notify to Weixin, Telegram, Bark, Lark, 飞书, 钉钉☆31Feb 24, 2026Updated 3 weeks ago
- [ICML 2022] Learning Efficient and Robust Ordinary Differential \\ Equations via Invertible Neural Networks☆10Apr 14, 2023Updated 2 years ago
- An Autonomous Curriculum Reinforcement Learning framework that steers agents to continually learn in specific environments with zero huma…☆24Feb 25, 2026Updated 3 weeks ago
- Automatically summarize lectures and ask questions about the course material☆13Apr 16, 2024Updated last year
- [NeurIPS '23] Official code of "A Hierarchical Spatial Transformer for Massive Point Samples in Continuous Space"☆13Jul 13, 2025Updated 8 months ago
- ☆10Apr 12, 2024Updated last year
- Contributed and additional nodes for maize☆21Feb 18, 2026Updated last month
- Codebase of the paper "Aligning Protein Conformation Ensemble Generation with Physical Feedback" (ICML 2025)☆16Jul 6, 2025Updated 8 months ago
- ☆10Jun 7, 2022Updated 3 years ago
- ☆12May 30, 2025Updated 9 months ago
- 关于AI,ML,DA,DV等的几个经典案例,包括堵车模拟(NagelSchreckenberg)、蒙特卡洛排队问题(Monte Carlo Queuing Problem)、人脸识别(RecognitionFace)、遗传算法推断图像(IconGenetic)☆10Oct 14, 2018Updated 7 years ago
- Outbound Phone GPT is a sophisticated prototype for a context-aware agent designed to autonomously handle outbound phone calls.☆17Apr 3, 2024Updated last year
- ☆15Nov 20, 2023Updated 2 years ago
- ☆14Dec 2, 2024Updated last year
- ☆14Apr 16, 2025Updated 11 months ago
- Library for computing anisotropy extension to SOAP descriptors☆11Mar 13, 2026Updated last week
- ☆13Feb 12, 2018Updated 8 years ago
- ☆65Jan 26, 2026Updated last month
- ☆19Apr 5, 2024Updated last year
- ☆21May 29, 2023Updated 2 years ago
- An official repo for WACV 2025 paper "LLaVA-SpaceSGG: Visual Instruct Tuning for Open-vocabulary Scene Graph Generation with Enhanced Spa…☆27Jan 27, 2025Updated last year
- Scaling Agentic Environments Automatically.☆54Jan 22, 2026Updated last month
- ☆28Apr 8, 2025Updated 11 months ago
- This repository contains the main scripts for local linear segmentation and subsequent analysis of the resulting model space☆15Oct 8, 2020Updated 5 years ago
- Towards Large Multimodal Models as Visual Foundation Agents☆258Apr 24, 2025Updated 10 months ago