β41Dec 7, 2025Updated 3 months ago
Alternatives and similar repositories for verl-community
Users that are interested in verl-community are comparing it to the libraries listed below
Sorting:
- π gigasmol: a lightweight wrapper for gigachat api model for seamless use with smolagents.β15Oct 23, 2025Updated 4 months ago
- Code for the experiments in the ACL 2020 paper "Estimating predictive uncertainty for rumour verification models"β11May 15, 2020Updated 5 years ago
- This repository is a reimplementation of the paper(BERT has a Mouth, and It Must Speak: BERT as a Markov Random Field Language Model: httβ¦β11Nov 14, 2019Updated 6 years ago
- Code for the papers: Correlation Coefficients and Semantic Textual Similarity, NAACL-HLT 2019 & Correlations between Word Vector Sets, EMβ¦β38Jul 14, 2022Updated 3 years ago
- IIRC baselineβ10Jan 13, 2021Updated 5 years ago
- Repository for "CoMix: Comprehensive Benchmark for Multi-Task Comic Understanding"β16Nov 20, 2024Updated last year
- Official Code for "Painting with Words: Elevating Detailed Image Captioning with Benchmark and Alignment Learning" (ICLR 2025)β12Mar 6, 2025Updated last year
- CommonsenseQAβ10Mar 28, 2020Updated 5 years ago
- Engineering Blog article prototypesβ17Oct 12, 2025Updated 4 months ago
- Awesome Multimodal Fusion in Speech Emotion Recognitionβ13Nov 11, 2025Updated 3 months ago
- Large-scale text embedding modelβ38Sep 6, 2025Updated 6 months ago
- The GPT-4 function calls used in everchanging quest for the HF game jamβ10Jul 9, 2023Updated 2 years ago
- β11Jun 11, 2021Updated 4 years ago
- Wave - The Software as a Service Starter Kit, designed to help you build the SAAS of your dreams π π°β12Jan 30, 2026Updated last month
- VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selectionβ25May 31, 2025Updated 9 months ago
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.β13Mar 30, 2024Updated last year
- Risky Object Localization (ROL) in a Driving Scene Datasetβ15Dec 24, 2023Updated 2 years ago
- β14Apr 23, 2025Updated 10 months ago
- We propose a novel modular framework that learns to dynamically mix low-rank adapters (LoRAs) to improve visual analogy learning, enablinβ¦β65Feb 18, 2026Updated 2 weeks ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systemsβ125Jun 11, 2025Updated 8 months ago
- lanmt ebmβ12Jun 19, 2020Updated 5 years ago
- Personalized Fashion Compatibility Modeling via Metapath-guided Heterogeneous Graph Learning.β15Nov 7, 2022Updated 3 years ago
- From Hero to ZΓ©roe: A Benchmark of Low-Level Adversarial Attacksβ14Feb 23, 2023Updated 3 years ago
- Deploy docs from your source tree to a GitHub wikiβ13Jun 14, 2023Updated 2 years ago
- β14May 9, 2024Updated last year
- MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehensionβ53Dec 3, 2024Updated last year
- β11Aug 20, 2019Updated 6 years ago
- β14Nov 2, 2024Updated last year
- β11Oct 16, 2023Updated 2 years ago
- β27Jan 4, 2026Updated 2 months ago
- the instructions and demonstrations for building a formal logical reasoning capable GLMβ54Sep 3, 2024Updated last year
- β15Sep 28, 2020Updated 5 years ago
- β12Mar 18, 2021Updated 4 years ago
- β11May 11, 2022Updated 3 years ago
- [EMNLP 2024] Holistic Automated Red Teaming for Large Language Models through Top-Down Test Case Generation and Multi-turn Interactionβ17Nov 9, 2024Updated last year
- Code for the paper "Critical Thinking for Language Models"β12Jun 1, 2021Updated 4 years ago
- Code for "Fine-Tuning by Curriculum Learning for Non-Autoregressive Neural Machine Translation"β13Jul 10, 2020Updated 5 years ago
- A demonstration project to show the layout for a Python packageβ14May 5, 2023Updated 2 years ago
- This repo has scripts to compare various powerful RL methodsβ39Feb 23, 2026Updated 2 weeks ago