AaronLuo00 / phd-survival-guideLinks
This guide shares practical lessons from a fellow PhD student, helping you navigate your research journey smoothly and focus on what truly matters.
☆169Updated this week
Alternatives and similar repositories for phd-survival-guide
Users that are interested in phd-survival-guide are comparing it to the libraries listed below
Sorting:
- SDAR (Synergy of Diffusion and AutoRegression), a large diffusion language model(1.7B, 4B, 8B, 30B)☆329Updated last month
- Idea2Paper Offical Demo☆227Updated this week
- AI for Science 论文解读合集(持续更新ing),论文/数据集/教程下载:hyper.ai☆3,083Updated 10 months ago
- [Arxiv] Discrete Diffusion in Large Language and Multimodal Models: A Survey☆359Updated 3 months ago
- [ICLR'26] Official code of paper "d2Cache: Accelerating Diffusion-based LLMs via Dual Adaptive Caching"☆84Updated last week
- Deciphering Oracle Bone Language with Diffusion Models (ACL 2024 Best Paper)☆225Updated 4 months ago
- Comprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/g…☆2,069Updated this week
- A collection of multimodal reasoning papers, codes, datasets, benchmarks and resources.☆562Updated last month
- Turn paper/text/topic into editable research figures, technical route diagrams, and presentation slides.☆1,246Updated this week
- code based for rectified flow☆274Updated 2 months ago
- VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model☆1,953Updated 2 months ago
- [ACM CSUR 2025] Out-of-Distribution Detection: A Task-Oriented Survey of Recent Advances☆162Updated last month
- [NeurIPS 2025] Official repository of RiOSWorld: Benchmarking the Risk of Multimodal Computer-Use Agents☆109Updated 2 months ago
- A Transparent Generalist Model towards Holistic Medical Vision-Language Understanding☆570Updated 2 months ago
- When Agent Becomes the Scientist – Building Closed-Loop System from Hypothesis to Verification☆841Updated 2 months ago
- Mega Scale Multimodal DataPipeline for SOTA models☆38Updated this week
- Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model☆934Updated last month
- This is a public version of LASER: A Neuro-Symbolic Framework for Learning Spatial-Temporal Scene Graphs with Weak Supervision☆164Updated 2 months ago
- Official code of Motus: A Unified Latent Action World Model☆616Updated 3 weeks ago
- Explain Before You Answer: A Survey on Compositional Visual Reasoning☆306Updated 3 months ago
- 🔥 [AAAI 2026 Oral] Official code for Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptat…☆75Updated last year
- Awesome collection of resources and papers on Diffusion Models for Robotic Manipulation.☆757Updated 5 months ago
- Awesome-Efficient-Inference-for-LRMs is a collection of state-of-the-art, novel, exciting, token-efficient methods for Large Reasoning Mo…☆235Updated 7 months ago
- https://hcv.boyuai.com☆573Updated last year
- [Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics]: VisuoThink: Empowering LVLM Reasoning with Mul…☆101Updated 6 months ago
- iBKH: The integrative Biomedical Knowledge Hub☆513Updated 2 weeks ago
- Official repository of DARE: dLLM Alignment and Reinforcement Executor☆159Updated this week
- [Neurocomputing] Efficient Redundancy Reduction for Open-Vocabulary Semantic Segmentation☆22Updated last month
- RoboTwin 2.0 Offical Repo☆1,882Updated last week
- A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.☆706Updated 2 weeks ago