linjh1118 / Chinese_Awesome_CVLinks
Awesome_CV的中文版本,clone本项目到overleaf即可轻松愉快编写自己的CV
☆12Updated last year
Alternatives and similar repositories for Chinese_Awesome_CV
Users that are interested in Chinese_Awesome_CV are comparing it to the libraries listed below
Sorting:
- ⭐️ Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning.☆194Updated last month
- MLLM @ Game☆14Updated 3 months ago
- llm & rl☆198Updated last week
- Run TRex with PPO☆39Updated 3 months ago
- ☆369Updated 6 months ago
- An Easy-to-use, Scalable and High-performance RLHF Framework designed for Multimodal Models.☆141Updated 4 months ago
- ICLR 2025 Agent-Related Papers☆72Updated 9 months ago
- 这是一个高效,快捷的arXiv论文爬虫,它可以将指定时间范围,指定主题,包含指定关键词的论文信息爬取到本地,并且将其中的标题和摘要翻译成中文。☆131Updated 11 months ago
- Extrapolating RLVR to General Domains without Verifiers☆146Updated 2 weeks ago
- Official Repository of "Learning to Reason under Off-Policy Guidance"☆285Updated last month
- Official Repository of "Learning what reinforcement learning can't"☆64Updated this week
- GPG: A Simple and Strong Reinforcement Learning Baseline for Model Reasoning☆161Updated 3 months ago
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"☆134Updated last month
- The official implementation of Natural Language Fine-Tuning☆53Updated 7 months ago
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆243Updated 2 weeks ago
- [arXiv2505] Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains☆45Updated last month
- ☆104Updated last month
- ☆261Updated last month
- 主要记录大语言大模型(LLMs) 算法(应用)工程师多模态相关知识☆230Updated last year
- Will Pre-Training Ever End? A First Step Toward Next-Generation Foundation MLLMs via Self-Improving Systematic Cognition☆31Updated 3 months ago
- 在没有sudo权限的情况下,在linux上使用clash☆136Updated 9 months ago
- ☆67Updated 3 weeks ago
- This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!☆48Updated 5 months ago
- Code repo for "Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning"☆27Updated last month
- ☆96Updated 2 months ago
- ☆68Updated 3 months ago
- Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme☆138Updated 4 months ago
- Agentic Workflow - Daily Track on Arxiv.org Paper☆46Updated 5 months ago
- ☆209Updated last week
- ☆102Updated 11 months ago