hooman650 / MedQwenReasoner
A simple tutorial to add medical reasoning using GRPO
☆16Updated last month
Alternatives and similar repositories for MedQwenReasoner:
Users that are interested in MedQwenReasoner are comparing it to the libraries listed below
- Code for our AAMAS 2020 paper: "A Story of Two Streams: Reinforcement Learning Models from Human Behavior and Neuropsychiatry".☆27Updated last year
- Stores the saved models of the UvA-DLC notebooks☆24Updated 2 years ago
- Extending Conformal Prediction to LLMs☆64Updated 9 months ago
- Public repository holding examples for dataheroes library☆22Updated 4 months ago
- Feature Selection using Simulated Annealing☆11Updated 2 years ago
- Testing Language Models for Memorization of Tabular Datasets.☆33Updated last month
- LLM-guided hyperparameter tuning☆10Updated last year
- Code for "Counterfactual Token Generation in Large Language Models", Arxiv 2024.☆25Updated 4 months ago
- ☆19Updated last month
- Visual Clustering: Clustering Plotted Data by Image Segmentation☆24Updated last month
- PyTorch reimplementation of computing Shapley values via Truncated Monte Carlo sampling from "What is your data worth? Equitable Valuatio…☆27Updated 3 years ago
- The LangChain Crash Course Repository is a concise and comprehensive collection of learning materials for the LangChain programming langu…☆21Updated last year
- This is the repository for the CONFLARE (CONformal LArge language model REtrieval) Python package.☆18Updated 11 months ago
- Code for Stop&Hop, a method for learning to classify irregularly-sampled time series early☆18Updated 5 months ago
- This repo will be an effort to learn and implement some of the milestone papers and models in Deep Learning based language models.☆11Updated 2 years ago
- Unsupervised Domain Adaptation for Time Series Classification☆29Updated last year
- ☆19Updated this week
- Tutorial on time-series forcasting with scikit-learn☆33Updated 2 years ago
- A Library for Scaling Mixed-Integer Optimization-Based Machine Learning.☆11Updated 9 months ago
- Code repo for KDD'22 paper : 'RES: A Robust Framework for Guiding Visual Explanation'☆32Updated 2 years ago
- Causal Agent based on Large Language Model☆42Updated 7 months ago
- Counterfactual SHAP: a framework for counterfactual feature importance☆18Updated last year
- Implementation of MLP (python) and CNN (PyTorch) with Information Plane visualization.☆13Updated 7 years ago
- A regularized self-labeling approach to improve the generalization and robustness of fine-tuned models☆28Updated 2 years ago
- Using ChatGPT to build a Kedro ML pipeline and Streamlit frontend☆30Updated 2 years ago
- Tree prompting: easy-to-use scikit-learn interface for improved prompting.☆35Updated last year
- ☆10Updated last year
- Interactive Weak Supervision: Learning Useful Heuristics for Data Labeling☆31Updated 4 years ago
- Deep Critical Learning. Implementation of ProSelfLC, IMAE, DM, etc.☆31Updated 2 years ago
- Code for Quantifying Ignorance in Individual-Level Causal-Effect Estimates under Hidden Confounding☆21Updated 2 years ago