VIOLINARTHUR / HKU-DASC7606-A1
☆26Updated 6 months ago
Alternatives and similar repositories for HKU-DASC7606-A1:
Users that are interested in HKU-DASC7606-A1 are comparing it to the libraries listed below
- ☆16Updated 6 months ago
- ☆13Updated 4 months ago
- ICLR 2025 Agent-Related Papers☆59Updated 4 months ago
- ☆331Updated this week
- Awesome RL-based LLM Reasoning☆406Updated this week
- 《EasyOffer》(<大模型面经合集>)是针对LLM宝宝们量身打造的大模型暑期实习Offer指南,主要记录大模型暑期实习和秋招准备的一些常见大厂手撕代码、大厂面经经验、常见大厂思考题等;小白一个,正在学习ing......有问题各位大佬随时指正,希望大家都能拿到心仪Of…☆122Updated 2 weeks ago
- A curated collection of resources, tools, and frameworks for developing GUI Agents.☆11Updated last week
- Awesome RL Reasoning Recipes ("Triple R")☆146Updated this week
- Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey☆413Updated this week
- MM-EUREKA: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning☆500Updated this week
- The official implementation of the paper "AgentSquare: Automatic LLM Agent Search in Modular Design Space""☆165Updated 3 weeks ago
- This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-bas…☆572Updated this week
- ☆20Updated 11 months ago
- A Survey on Efficient Reasoning for LLMs☆281Updated this week
- The official repo for paper, LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods.☆332Updated 3 months ago
- Agent from scratch (memory, reflection, planning, tooling, etc.)☆33Updated last month
- Survey on LLM Agents (Published on CoLing 2025)☆200Updated 3 weeks ago
- ☆215Updated this week
- ☆76Updated 7 months ago
- ☆512Updated 3 months ago
- ☆99Updated 2 weeks ago
- ☆165Updated this week
- The code repo of paper "X-Boundary: Establishing Exact Safety Boundary to Shield LLMs from Multi-Turn Jailbreaks without Compromising Usa…☆23Updated 3 weeks ago
- A collection on the recent reproduction papers and projects on DeepSeek-R1☆29Updated last month
- Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis☆121Updated last week
- A collection of resources that investigate social agents.☆139Updated 3 weeks ago
- 😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond☆152Updated this week
- R1-onevision, a visual language model capable of deep CoT reasoning.☆488Updated 2 weeks ago
- Agentic Workflow - Daily Track on Arxiv.org Paper☆43Updated last month
- Demo code for the paper "Reversal of Thought: Enhancing Large Language Models with Preference-Guided Reverse Reasoning Warm-up."☆3Updated last week