wenzhaoabc/llm-tap-rl

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wenzhaoabc/llm-tap-rl)

wenzhaoabc / llm-tap-rl

Reinforcement Learning for LLM

☆38

Alternatives and similar repositories for llm-tap-rl

Users that are interested in llm-tap-rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

shengtaovvv / Dialogue
View on GitHub
本项目由三个模块构成。意图识别：判断用户的意图是业务型还是闲聊型；模型检索：该部分构建一个语料库，当用户发起新的query（通过意图识别判断为业务型对话）时，为用户匹配query检索的最佳response，使用HSWN进行召回（粗排），然后构建句子的相似度，并利用Lig…
☆12Feb 18, 2021Updated 5 years ago
liunian-Jay / AgenticRAG-RL
View on GitHub
A minimal implementation of Agentic RAG using GRPO
☆17Jun 11, 2025Updated last year
StonyBrookNLP / tellmewhy
View on GitHub
Website for release of TellMeWhy dataset for why question answering
☆14Nov 11, 2022Updated 3 years ago
Intelligent-Microsystems-Lab / QuantizedSNNs
View on GitHub
This repository contains the models and training scripts used in the papers: "Quantizing Spiking Neural Networks with Integers" (ICONS 20…
☆13Oct 20, 2020Updated 5 years ago
Yesterday17 / go-drcom-jlu
View on GitHub
JLU drcom client written in golang.
☆12Sep 4, 2019Updated 6 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
nengo / nengo-fpga
View on GitHub
Nengo extension to connect to FPGAs
☆18Mar 18, 2025Updated last year
mufeiteng / CausalEmbedding
View on GitHub
☆10Oct 20, 2020Updated 5 years ago
yakt00 / IRGen
View on GitHub
☆26Jun 9, 2023Updated 3 years ago
chridey / altlex
View on GitHub
☆11Apr 4, 2018Updated 8 years ago
AndrzejKucik / SNN4Space
View on GitHub
ANN to SNN conversion on land cover and land use classification problem for increased energy efficiency.
☆14Feb 8, 2022Updated 4 years ago
valar1234 / SDAI
View on GitHub
An Synthesizable Deep Learning Library based on Xilinx High Level Synthesis(HLS) tool
☆16Feb 20, 2017Updated 9 years ago
yt-koike / dify-cron
View on GitHub
Regular self-call plugin for Dify on self-hosted servers and cloud.dify.ai
☆16Mar 18, 2026Updated 4 months ago
redsk / neo_concept
View on GitHub
ConceptNet to neo4j 2.2
☆10Nov 6, 2015Updated 10 years ago
maltanar / fpga-booleanring-bfs
View on GitHub
Hybrid BFS on Xilinx Zynq
☆18Jun 9, 2015Updated 11 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
fastmachinelearning / qonnx_model_zoo
View on GitHub
Model zoo for the Quantized ONNX (QONNX) model format
☆15Oct 31, 2025Updated 8 months ago
duncanka / Causeway
View on GitHub
Tagger for explicit cause-and-effect relationships in text
☆11Jan 8, 2020Updated 6 years ago
qiangning / TemporalCausalReasoning
View on GitHub
☆16Jan 8, 2020Updated 6 years ago
jadfi / relation-extraction
View on GitHub
对论文Neural Relation Extraction with Selective Attention over Instances的改进
☆12Aug 17, 2018Updated 7 years ago
sshleifer / backtranslated-imdb
View on GitHub
Backtranslations of IMDB movie reviews for Data Augmentation Purposes
☆10Apr 1, 2019Updated 7 years ago
ict-bigdatalab / CorpusBrain
View on GitHub
CIKM 2022: CorpusBrain: Pre-train a Generative Retrieval Model for Knowledge-Intensive Language Tasks
☆34Aug 31, 2022Updated 3 years ago
LifangD / DSGAN
View on GitHub
Implementation of DSGAN (not fully completed)
☆13Dec 28, 2019Updated 6 years ago
allanchen95 / CODE
View on GitHub
AAAI'22-"CODE: Contrastive Pre-training with Adversarial Fine-tuning for Zero-shot Expert Linking."
☆12Apr 12, 2021Updated 5 years ago
Vitorian / awesome-mpsoc
View on GitHub
Public resources available for Xilinx MPSOC+ and SDSOC hardware
☆18May 26, 2017Updated 9 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
KMnO4-zx / blog
View on GitHub
项目的issue会存放我的所有blog
☆21Sep 12, 2025Updated 10 months ago
talhasarit / agentic-prd-generation
View on GitHub
An AI-powered platform that uses an agentic workflow to automatically generate Project Requirement Documents (PRDs).
☆16Mar 10, 2026Updated 4 months ago
enyac-group / NeuralPower
View on GitHub
The code for paper: Neuralpower: Predict and deploy energy-efficient convolutional neural networks
☆24Jul 10, 2019Updated 7 years ago
jgoeders / dac_sdc_2021_designs
View on GitHub
☆19Mar 16, 2022Updated 4 years ago
CQiang27 / Spark_Python
View on GitHub
Spark—Python学习笔记
☆11Sep 25, 2018Updated 7 years ago
cuiwang / DatasMark
View on GitHub
离线版中文标注工具，支持NER、文本分类、关系标注、对话标注等。
☆14Jul 29, 2022Updated 3 years ago
nasiryahm / SNNSimulatorComparison
View on GitHub
Comparison of Spiking Neural Network Simulator Performance
☆21Oct 12, 2019Updated 6 years ago
ankane / safetensors-ruby
View on GitHub
Simple, safe way to store and distribute tensors
☆15Jun 29, 2026Updated 3 weeks ago
GohUnTsuan / JLU-Beamer-Theme
View on GitHub
A LaTeX beamer theme template for Jilin University students. 吉林大学beamer模板.
☆18May 12, 2021Updated 5 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
DanielGerlinghoff / radix-encoding
View on GitHub
Framework for radix encoded SNN on FPGA
☆18Dec 7, 2021Updated 4 years ago
ankane / neighbor-s3
View on GitHub
Nearest neighbor search for Ruby and S3 Vectors
☆14Apr 9, 2026Updated 3 months ago
ModelTC / LPCV2021_Winner_Solution
View on GitHub
☆28Nov 5, 2021Updated 4 years ago
muzhiyun / xsyudogcom
View on GitHub
西安石油大学哆点破解路由器限制脚本及相关资源感谢开源项目drcom-generic
☆11Jun 18, 2018Updated 8 years ago
leoluopy / autotvm_tutorial
View on GitHub
autoTVM神经网络推理代码优化搜索演示，基于tvm编译开源模型centerface，并使用autoTVM搜索最优推理代码，　最终部署编译为c++代码，演示平台是cuda，可以是其他平台，例如树莓派，安卓手机，苹果手机．Thi is a demonstration of …
☆31May 6, 2021Updated 5 years ago
WenRichard / ELMO-NLP
View on GitHub
ELMO在QA问答，文本分类等NLP上面的应用
☆15Apr 13, 2019Updated 7 years ago
ankane / torchdata-ruby
View on GitHub
Composable data loading for Ruby
☆13Apr 8, 2026Updated 3 months ago