Jiadong001 / DS2024-Course-ProjectLinks
USTC - 数据科学基础 2024 - 课程实践项目
☆9Updated 7 months ago
Alternatives and similar repositories for DS2024-Course-Project
Users that are interested in DS2024-Course-Project are comparing it to the libraries listed below
Sorting:
- ☆23Updated 2 months ago
- USTC 2021春季学期 深度学习导论实验:FNN,CNN,RNN,LSTM,BERT,GCN☆29Updated 3 years ago
- ☆13Updated 2 months ago
- USTC2020秋机器学习概论课程实验:LR,SVM,XGBoost,KMeans,LDA.☆8Updated 4 years ago
- (ICLR'25) A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents☆69Updated 4 months ago
- Yelp Simulator for WWW'25 AgentSociety Challenge☆80Updated last month
- 中科大2021秋《运筹学》课程资源☆8Updated 3 years ago
- ☆52Updated last week
- ☆131Updated 3 weeks ago
- Paper List of Inference/Test Time Scaling/Computing☆246Updated this week
- A research repo for experiments about Reinforcement Finetuning☆47Updated 2 months ago
- The official implementation of Natural Language Fine-Tuning☆50Updated 4 months ago
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆73Updated this week
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆82Updated 2 months ago
- My notebook of CS learning.☆66Updated last month
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆97Updated this week
- An up-to-date curated list of Retrieval-Augmented Generation (RAG) for Large Language Models (LLMs).☆77Updated this week
- 南京大学人工智能学院本科生开放日面试经验分享☆28Updated 2 weeks ago
- Generative AI Act II: Test Time Scaling Drives Cognition Engineering☆183Updated last month
- ☆62Updated 2 months ago
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆203Updated 6 months ago
- Description for MV-MATH☆12Updated 2 months ago
- Awesome Agent Training☆141Updated this week
- ☆50Updated 4 months ago
- llm & rl☆139Updated this week
- Sharing my research toolchain☆83Updated last year
- ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of…☆21Updated last week
- RFTT: Reasoning with Reinforced Functional Token Tuning☆27Updated 2 months ago
- CycleResearcher: Improving Automated Research via Automated Review☆185Updated 2 weeks ago
- LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment☆339Updated last year