We propose Reinforcement Learning from Community Feedback (RLCF), a training paradigm that uses large-scale community signals as supervision, and formulate scientific taste learning as a preference modeling and alignment problem.
☆386Mar 29, 2026Updated 2 weeks ago
Alternatives and similar repositories for AI-Can-Learn-Scientific-Taste
Users that are interested in AI-Can-Learn-Scientific-Taste are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An open-source personal academic homepage template characterized by its user-friendly design and extensive scalability.☆37Oct 6, 2025Updated 6 months ago
- Use the tokenizer in parallel to achieve superior acceleration☆20Mar 21, 2024Updated 2 years ago
- [NeurIPS 2024] Can Language Models Learn to Skip Steps?☆22Jan 25, 2025Updated last year
- ACL 2026 - Muse: Towards Reproducible Long-Form Song Generation with Fine-Grained Style Control☆107Updated this week
- Automated bash script to set up a high-performance environment on Ubuntu Linux with RTX5090, including installations of PyTorch, Unsloth,…☆19Apr 1, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Codes for the paper "BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping…☆92Jan 29, 2026Updated 2 months ago
- ☆117Mar 16, 2026Updated 3 weeks ago
- MOSS-Speech is a true speech-to-speech large language model without text guidance.☆128Feb 13, 2026Updated last month
- The code for “PromptNER: A Prompting Method for Few-shot Named Entity Recognition via k Nearest Neighbor Search”☆19Mar 13, 2024Updated 2 years ago
- [ICLR 2025 Spotlight] Weak-to-strong preference optimization: stealing reward from weak aligned model☆18Feb 24, 2025Updated last year
- MOSS-VL is the core multimodal model series within the OpenMOSS ecosystem, dedicated to visual understanding.☆158Updated this week
- An official implementation of Style-Talker for Spoken Dialogue Generation☆23Jan 12, 2025Updated last year
- Code for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models☆23Jul 27, 2024Updated last year
- PyTorch Implementation for the paper "Let Me Help You! Neuro-Symbolic Short-Context Action Anticipation" accepted to RA-L'24.☆12Nov 27, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆23May 25, 2023Updated 2 years ago
- Java面试总结☆19May 11, 2020Updated 5 years ago
- We introduce CausalVQA, a benchmark dataset for video question answering (VQA) composed of question-answer pairs that probe models’ under…☆59Aug 18, 2025Updated 7 months ago
- 中文大语言模型评测第三期☆36Mar 22, 2026Updated 3 weeks ago
- ☆31Oct 23, 2024Updated last year
- Extend bert-nmt to context-aware translation.☆11May 24, 2021Updated 4 years ago
- ☆72Apr 1, 2026Updated last week
- Toolathlon-Gym for testing AI agents real-world tool-use capabilities across diverse MCP servers.☆110Apr 2, 2026Updated last week
- Code for paper "Modality Plug-and-Play: Elastic Modality Adaptation in Multimodal LLMs for Embodied AI"☆13Jan 19, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆15Nov 18, 2025Updated 4 months ago
- [EMNLP 2024] Official code for "Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models"☆20Oct 17, 2024Updated last year
- Substrate TypeScript SDK☆10Sep 20, 2024Updated last year
- PaiNN in jax☆11Jan 14, 2025Updated last year
- The OBMO module embedded in PatchNet☆10Feb 21, 2024Updated 2 years ago
- Repository for "Training Language Models To Explain Their Own Computations"☆21Dec 22, 2025Updated 3 months ago
- [ICML 2025] Streamline Without Sacrifice - Squeeze out Computation Redundancy in LMM☆20May 22, 2025Updated 10 months ago
- ☆12Nov 28, 2022Updated 3 years ago
- ☆24Apr 29, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- An Autonomous Curriculum Reinforcement Learning framework that steers agents to continually learn in specific environments with zero huma…☆26Feb 25, 2026Updated last month
- CLaMR: Contextualized Late-Interaction for Multimodal Content Retrieval☆24Jun 28, 2025Updated 9 months ago
- Automatically summarize lectures and ask questions about the course material☆13Apr 16, 2024Updated last year
- ☆10Apr 12, 2024Updated 2 years ago
- TPU + GPU,基于疫情期间网民微博评论的情感分析项目☆10Jul 17, 2024Updated last year
- Contributed and additional nodes for maize☆21Feb 18, 2026Updated last month
- Code for "3D Instance Segmentation via Multi-Task Metric Learning"☆11Sep 22, 2020Updated 5 years ago