☆97Jul 24, 2025Updated 8 months ago
Alternatives and similar repositories for study_rlhf
Users that are interested in study_rlhf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- WebResearcher: An Iterative Deep-Research Agent,迭代式深度研究智能体☆48Feb 13, 2026Updated 2 months ago
- ☆136Mar 18, 2026Updated 3 weeks ago
- A Foundation Language Model For Multilayer Regulation of RNA☆21Nov 30, 2025Updated 4 months ago
- 杭高院自然语言处理课程2023☆26Nov 22, 2023Updated 2 years ago
- 此项目创建的初衷是为了帮助人工智能、自然语言处理和大语言模型相关背景的同学找工作使用,欢迎加入项目的建设和维护☆18Mar 30, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Transferring Genshin PVs into a freehand style with Diffusion Model.☆10Jun 5, 2024Updated last year
- A pipeline for the automatic construction of geometry problems along with step-by-step solutions.☆17Aug 27, 2025Updated 7 months ago
- The supplementary material for the paper "Fine-tuning Large Language Models to Improve Accuracy and Comprehensibility of Automated Code R…☆16Aug 12, 2024Updated last year
- SMART introduces a novel test-time framework where Small Language Models (SLMs) reason step-by-step, and Large Language Models (LLMs) pro…☆11Jul 9, 2025Updated 9 months ago
- Documentation at☆14Mar 27, 2025Updated last year
- 武汉大学国家网安院软件安全☆16Dec 9, 2024Updated last year
- ☆17Jun 10, 2025Updated 10 months ago
- Lightning-responsive CosyVoice streaming API based on FastAPI.☆28Mar 23, 2026Updated 3 weeks ago
- Repository for the Findings of ACL'23 paper Label Agnostic Pre-training for Zero-shot Text Classification☆12Aug 10, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official Implementation of "Semantics-Consistent Feature Search for Self-Supervised Visual Representation Learning" in AAAI2024.☆13Feb 28, 2024Updated 2 years ago
- 大学Latex答辩模版,当前包含川大、哈工大、中科大。☆10Jul 22, 2024Updated last year
- Code for "An Empirical Study of Retrieval Augmented Generation with Chain-of-Thought"☆17Jul 27, 2024Updated last year
- [AAAI 2025] Neural-Symbolic Collaborative Distillation: Advancing Small Language Models for Complex Reasoning Tasks☆12Jun 19, 2025Updated 9 months ago
- [ECCV 2024] Official PyTorch implementation of LUT "Learning with Unmasked Tokens Drives Stronger Vision Learners"☆13Dec 1, 2024Updated last year
- WeKnora‑pro是基于原始 WeKnora 的二次开发版本,核心在于提升文档解析能力。 主要改进:1. 支持扫描件通过 (CPU/GPU 自动优化)进行 OCR 与表格提取;且兼容WeKnora多模态增加 2. 文档大小上限提升至 300 MB☆46Oct 29, 2025Updated 5 months ago
- 《多模态大模型部署微调指南》快速部署/微调多模态大模型☆12Dec 4, 2024Updated last year
- Source codes for the paper "Personalized Dynamic Music Emotion Recognition with Dual-Scale Attention-Based Meta-Learning" (PDMER) which p…☆13Mar 24, 2025Updated last year
- ☆15Apr 4, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code and Data for ACL 2025 Paper "Aristotle: Mastering Logical Reasoning with A Logic-Complete Decompose-Search-Resolve Framework".☆25Oct 3, 2025Updated 6 months ago
- Dataset for paper "OmniMotion-X: Versatile Multimodal Whole-Body Motion Generation"☆21Dec 22, 2025Updated 3 months ago
- ☆34Jul 8, 2025Updated 9 months ago
- [AAAI 2024] DTF-AT: Decoupled Time-Frequency Audio Transformer for Event Classification☆12Mar 10, 2025Updated last year
- [ICML 2025] Official Implementation of GLIDER☆74Oct 9, 2025Updated 6 months ago
- [EMNLP 2025] Reasoning-to-Defend: Safety-Aware Reasoning Can Defend Large Language Models from Jailbreaking☆12Aug 22, 2025Updated 7 months ago
- Koishi's Day 2025 Paper (NeurIPS 2025): "Codifying Character Logic in Role-Playing"☆23Jan 15, 2026Updated 3 months ago
- A curated collection of research and techniques for protecting intellectual property of large language models, including watermarking, fi…☆47Feb 15, 2026Updated 2 months ago
- An LLM training framework built from the ground up, featuring a custom BumbleBee architecture and end-to-end support for multiple open-so…☆66Feb 9, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Targeted Data Generation with Large Language Models☆19Jun 25, 2024Updated last year
- The reproduction of the paper "Robust Attention for Contextual Biased Visual Recognition" ICLR2023.☆12Feb 23, 2024Updated 2 years ago
- The code repository for the paper "A Comprehensive Survey of Large AI Models for Future Communications: Foundations, Applications and Cha…☆30Jun 2, 2025Updated 10 months ago
- ☆20May 14, 2025Updated 11 months ago
- Code for Multi-Aspect Cross-modal Quantization for Generative Recommendation. (AAAI 2026 Oral)☆37Dec 9, 2025Updated 4 months ago
- [AAAI2025] Official implementation of the paper "RAP-SR: RestorAtion Prior Enhancement in Diffusion Models for Realistic Image Super-Reso…☆18Mar 22, 2025Updated last year
- 集成Qwen与DeepSeek等先进大语言模型,支持纯LLM+分类层模式及LLM+LoRA+分类层模式,使用transformers模块化设计和训练便于根据需要调整或替换组件。☆21Sep 1, 2025Updated 7 months ago