tongjingqi/AI-Can-Learn-Scientific-Taste

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tongjingqi/AI-Can-Learn-Scientific-Taste)

tongjingqi / AI-Can-Learn-Scientific-Taste

We propose Reinforcement Learning from Community Feedback (RLCF), a training paradigm that uses large-scale community signals as supervision, and formulate scientific taste learning as a preference modeling and alignment problem.

☆425

Alternatives and similar repositories for AI-Can-Learn-Scientific-Taste

Users that are interested in AI-Can-Learn-Scientific-Taste are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tongjingqi / Thinking-with-Video
View on GitHub
We introduce 'Thinking with Video', a new paradigm leveraging video generation for multimodal reasoning. Our VideoThinkBench shows that S…
☆315Jun 21, 2026Updated last month
EnigmaYYYY / SocialClaw
View on GitHub
SocialClaw is a screen-aware social copilot that watches live chat windows, builds personalized memory and profile context, and suggests …
☆40Apr 9, 2026Updated 3 months ago
Linxi000 / MEDS
View on GitHub
☆142Jun 24, 2026Updated last month
OpenMOSS / MOSS-VL
View on GitHub
MOSS-VL is the core multimodal model series within the OpenMOSS ecosystem, dedicated to visual understanding.
☆398Updated this week
tianyilt / qzcli_tool
View on GitHub
启智平台任务管理 CLI：资源查询、任务提交、日志查看和 MCP/agent workflow
☆109Jul 17, 2026Updated last week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
netokeep / netokeep
View on GitHub
Create SSH and TCP Proxy to your company container.
☆29Jun 10, 2026Updated last month
tongjingqi / Awesome-Agent-RL
View on GitHub
A curated list of awesome resources about reward construction for AI agents. This repository covers cutting-edge research, and practical …
☆60Sep 1, 2025Updated 10 months ago
tongjingqi / Game-RL
View on GitHub
Game-RL: Synthesizing Multimodal Verifiable Game Data to Boost VLMs' General Reasoning
☆157Jul 18, 2026Updated last week
realZillionX / InspireSkill
View on GitHub
启智平台（qz.sii.edu.cn）的 Agent 驾驶舱：Skill + CLI，一条命令直达。Agent cockpit for the Inspire ML platform — one command, every operation, straight from…
☆182Updated this week
KnowledgeXLab / skill-git
View on GitHub
Supercharge your AI agents by versioning, tracking, and merging overlapping skills.
☆40Apr 9, 2026Updated 3 months ago
sxswz213 / DeepSlides
View on GitHub
☆28Apr 9, 2026Updated 3 months ago
sqs-ustc / tool-reasoning-framework-PTE
View on GitHub
☆38Jan 1, 2026Updated 6 months ago
OpenMOSS / FutureOmni
View on GitHub
☆26Jan 22, 2026Updated 6 months ago
LINs-lab / IOMM
View on GitHub
[CVPR 2026] IOMM: Fast Pre-training of Unified Multimodal Models without Text-Image Pairs
☆26Apr 11, 2026Updated 3 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
EmbodiedForge / Inspire-cli
View on GitHub
A tool for better use of Inspire platform (Beta: Codeberg version is more up-to-date)
☆28Apr 2, 2026Updated 3 months ago
euReKa025 / AgentLongBench
View on GitHub
☆21Jan 29, 2026Updated 5 months ago
HYLZ-2019 / SII_Thesis_Template
View on GitHub
☆50Dec 21, 2025Updated 7 months ago
OpenMOSS / BandPO
View on GitHub
Official implementation of BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning.…
☆49Apr 8, 2026Updated 3 months ago
WhitzardAgent / qitos
View on GitHub
Let's Qitos! A torch-like agent-native framework for researchers.
☆20Updated this week
tongjingqi / MathTrap
View on GitHub
In this work, we investigate the compositionality of large language models (LLMs) in mathematical reasoning. Specifically, we construct a…
☆60Mar 15, 2025Updated last year
yxzwang / FamilyTool
View on GitHub
FamilyTool benchmark
☆14Sep 10, 2025Updated 10 months ago
JingYiJun / awesome-inspire
View on GitHub
一个面向启智平台（Inspire）的 awesome list
☆37Mar 29, 2026Updated 3 months ago
OpenMOSS / MOVA
View on GitHub
MOVA: Towards Scalable and Synchronized Video–Audio Generation
☆1,083Jun 18, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Phospheneser / Phospheneser-awesome-academic-template
View on GitHub
An open-source personal academic homepage template characterized by its user-friendly design and extensive scalability.
☆37Oct 6, 2025Updated 9 months ago
OpenMOSS / Sparse-dLLM
View on GitHub
☆29Oct 16, 2025Updated 9 months ago
lcqysl / VideoSSR
View on GitHub
[CVPR 2026] Official repo for "VideoSSR: Video Self-Supervised Reinforcement Learning"
☆41Nov 11, 2025Updated 8 months ago
OpenMOSS / claude-codex-handoff
View on GitHub
Drop-in async file-based handoff protocol for two AI coding agents (Claude Code + Codex), installed as one shared .handoff/ in your proje…
☆30Jul 4, 2026Updated 3 weeks ago
sosppxo / mvggt
View on GitHub
[CVPR 2026] This repository is the official implementation of MVGGT: Multimodal Visual Geometry Grounded Transformer for Multiview 3D Ref…
☆128Mar 24, 2026Updated 4 months ago
OpenMOSS / MOSS-Audio-Tokenizer
View on GitHub
MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, i…
☆248Jun 16, 2026Updated last month
yuhui1038 / Muse
View on GitHub
ACL 2026 - Muse: Towards Reproducible Long-Form Song Generation with Fine-Grained Style Control
☆119Apr 11, 2026Updated 3 months ago
WooooDyy / BAPO
View on GitHub
Codes for the paper "BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping…
☆94Jan 29, 2026Updated 5 months ago
OpenMOSS / rope_pp
View on GitHub
[ICLR26] Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs
☆33Dec 9, 2025Updated 7 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
january-blue / OpenNovelty
View on GitHub
☆135May 12, 2026Updated 2 months ago
ssmisya / PolicyShiftGuard
View on GitHub
PolicyShiftGuard: Benchmarking and Improving Policy-Adaptive Image Guardrails
☆22Jul 8, 2026Updated 2 weeks ago
q1sun / Tutorial-AI4SC-SC4AI
View on GitHub
Where Scientific Computing Meets Artificial Intellegence
☆49Apr 30, 2026Updated 2 months ago
zjunlp / InnoEval
View on GitHub
[ICML 2026] InnoEval: On Research Idea Evaluation as a Knowledge-Grounded, Multi-Perspective Reasoning Problem
☆28Jun 21, 2026Updated last month
OpenMOSS / DiRL
View on GitHub
☆165Mar 30, 2026Updated 3 months ago
Zhiyuan-Zeng / RLVE
View on GitHub
[ICML 2026] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
☆226Apr 30, 2026Updated 2 months ago
liusida / ica-lens-paper
View on GitHub
ICA Lens: compact ICA-based interpretability tools for exploring LLM activations. Code release for the paper.
☆38Jul 5, 2026Updated 2 weeks ago