anakin87 / qwen-scheduler-grpo
Train a Language Model with GRPO to create a schedule from a list of events and priorities
☆84Updated last week
Alternatives and similar repositories for qwen-scheduler-grpo:
Users that are interested in qwen-scheduler-grpo are comparing it to the libraries listed below
- Countdown Game Distill&RL☆43Updated last week
- The Level-Navi Agent, a framework that requires no training and utilizes large language models for deep query understanding and precise s…☆79Updated 4 months ago
- Using APPL to reimplement popular algorithms for Large Language Models (LLMs) and prompts☆44Updated 3 months ago
- Qwen GRPO Graph Extraction RL Finetune☆46Updated last month
- Chrome / Edge extension to turn arXiv papers into Markdown codes in one click.☆78Updated last month
- ☆76Updated 3 weeks ago
- ☆51Updated 2 months ago
- ☆86Updated last month
- https://no-ocr.com/about☆119Updated 3 months ago
- Try out HallOumi, a state-of-the-art claim verification model in a simple UI!☆32Updated last month
- Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors☆73Updated last week
- Jina DeepSearch UI☆101Updated this week
- Unsloth Fine-tuning Notebooks for Google Colab, Kaggle, Hugging Face and more.☆251Updated this week
- support BM25+vecetor☆26Updated 5 months ago
- Fetch arxiv data to LLM-friendly text☆116Updated 2 months ago
- ☆29Updated last week
- An LLM-based agent simulation framework that simulates human behavior and generates dynamic, text-based social graphs.☆73Updated 3 weeks ago
- ☆141Updated 2 months ago
- LLM-as-SERP☆64Updated 2 months ago
- Full list of LLM API with Internet Access☆70Updated 2 months ago
- Chat with any website on your local machine☆72Updated 10 months ago
- Repo for NAACL 2025 Paper "Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization"☆269Updated 3 months ago
- SkillWeaver is a framework to enable web agent self-improvement through environment exploration and skill synthesis.☆72Updated 3 weeks ago
- ☆144Updated 2 months ago
- The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"☆28Updated last week
- Learn about the fundamentals of LangGraph through a series of notebooks☆95Updated this week
- ☆57Updated 2 months ago
- Awesome LLM pre-training resources, including data, frameworks, and methods.☆128Updated last week
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet. Powered by Vercel…☆117Updated 2 months ago
- CursorCore: Assist Programming through Aligning Anything☆121Updated 2 months ago