Qwen GRPO Graph Extraction RL Finetune
☆60Apr 2, 2025Updated 11 months ago
Alternatives and similar repositories for grpo-graph-extraction
Users that are interested in grpo-graph-extraction are comparing it to the libraries listed below
Sorting:
- NebulaGraph Desktop version on Windows and macOS☆27Feb 28, 2025Updated last year
- ☆10Dec 19, 2025Updated 2 months ago
- Implementation of "PaLM2-VAdapter:" from the multi-modal model paper: "PaLM2-VAdapter: Progressively Aligned Language Model Makes a Stron…☆17Nov 11, 2024Updated last year
- [ICML 2025] Logits are All We Need to Adapt Closed Models☆21May 2, 2025Updated 10 months ago
- ☆25Oct 28, 2024Updated last year
- Test Environment Booking tool☆14Nov 16, 2020Updated 5 years ago
- ☆27Aug 27, 2025Updated 6 months ago
- Model Context Protocol Server for NebulaGraph 3.x☆27Mar 17, 2025Updated 11 months ago
- H.AI cookbook provides code examples and guides to help developers use models developed by H Company.☆66Feb 20, 2026Updated 2 weeks ago
- dqn autoplay mario bros☆21Jul 24, 2017Updated 8 years ago
- Chrome extension to add a link from each Arxiv page to the corresponding HF Paper page☆26Jan 4, 2024Updated 2 years ago
- A travel agent based on Qwen2.5, fine-tuned by SFT + DPO/PPO/GRPO using traveling question-answer dataset, a mindmap can be output using …☆58Nov 14, 2025Updated 3 months ago
- entropix style sampling + GUI☆27Oct 30, 2024Updated last year
- support BM25+vecetor☆29May 26, 2025Updated 9 months ago
- OpenVLA Lightweight Version(0.5B). It uses qwen2-0.5B and fine-tunes using mllm format, without occupying LLM's inherent tokens. It repre…☆16Jan 7, 2026Updated 2 months ago
- Trading algorithm for Bitcoins in USD on quantconnect.com☆13Jan 12, 2018Updated 8 years ago
- Python tools for processing the stackexchange data dumps into a text dataset for Language Models☆86Dec 6, 2023Updated 2 years ago
- NebulaGraph DGL(Deep Graph Library) Integration Package. (WIP)☆38Mar 14, 2024Updated last year
- New York Times Scraper☆11Feb 19, 2024Updated 2 years ago
- 海思设备上部署阉割版yolov5☆13Nov 22, 2021Updated 4 years ago
- RLCar Gazebo v2☆12Jun 28, 2024Updated last year
- 强化学习的数学原理代码练习☆19Apr 17, 2024Updated last year
- This repository is a reimplementation of the paper(BERT has a Mouth, and It Must Speak: BERT as a Markov Random Field Language Model: htt…☆11Nov 14, 2019Updated 6 years ago
- Part of a research scholarship. I built a basic 2d driving sim with simulated lidar data to train Deep Q Neural Network. So far after abo…☆11Feb 15, 2017Updated 9 years ago
- This is a tool that can make you run intel openVINO Demos and samples easily.☆11Jan 31, 2023Updated 3 years ago
- RDF Community Discussions. Ask anything here!☆13Apr 11, 2024Updated last year
- Applescripts for controlling Spotify☆23Oct 20, 2016Updated 9 years ago
- Code for the experiments in the ACL 2020 paper "Estimating predictive uncertainty for rumour verification models"☆11May 15, 2020Updated 5 years ago
- Final Project of ME5413 Autonomous Mobile Robotics @ NUS☆10Oct 13, 2023Updated 2 years ago
- Optimized Generative Adversarial Network with Graph Convolutional Networks for Novel Molecule Design☆12Jan 2, 2024Updated 2 years ago
- Autonomous navigation simulation of an agricultural robot during soil fertilization in open fields using ROS and Gazebo.☆10Apr 8, 2025Updated 11 months ago
- ☆11Aug 9, 2018Updated 7 years ago
- A Kivy tutorial for PyOhio 2013☆14Apr 30, 2014Updated 11 years ago
- NLP on Korean news articles. Automatic topic extraction through dynamic clustering.☆12Sep 15, 2017Updated 8 years ago
- Conversational Retrieval Evaluation Dataset☆101Aug 19, 2025Updated 6 months ago
- Deep Learning Visualization Tools Using PyTorch☆11Feb 2, 2021Updated 5 years ago
- Centralized AI agent skills for Obsidian plugin and theme development.☆36Updated this week
- Code for the blog post on GAN stability☆10Oct 8, 2016Updated 9 years ago
- CuteRest is a REST client tool dedicated for JSON☆11Dec 12, 2023Updated 2 years ago