romanlee6 / langgroundView external linksLinks
Project page for the NeurIPS 2024 paper, Language Grounded Multi-agent Reinforcement Learning with Human-interpretable Communication.
☆16Dec 6, 2024Updated last year
Alternatives and similar repositories for langground
Users that are interested in langground are comparing it to the libraries listed below
Sorting:
- A repo of fake committed secrets to test tools that find committed secrets ([dont submit for BB :-) ]☆10Mar 22, 2018Updated 7 years ago
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Jan 10, 2019Updated 7 years ago
- ☆13Feb 22, 2023Updated 2 years ago
- ☆11Nov 21, 2022Updated 3 years ago
- Simple tool to send the json output from HTTPX to BBRF☆11Mar 30, 2023Updated 2 years ago
- A collection of reading material for the Workshop on "Structure & Priors in Reinforcement Learning" (SPiRL) at ICLR 2019.☆13May 5, 2021Updated 4 years ago
- Stochastic Variance Reduction Policy Gradient Estimation☆11Nov 6, 2018Updated 7 years ago
- 复现论文代码☆12Jul 27, 2020Updated 5 years ago
- named entity recognition combined with rule from entity dict☆13Aug 25, 2020Updated 5 years ago
- Scripts that I've written that others may find useful☆14Aug 17, 2022Updated 3 years ago
- A parallel scanner that utilises axiom to spin up servers and parallel scan using masscan.☆16Jul 1, 2020Updated 5 years ago
- Python client for the Open eXecution Protocol (OXP)☆17May 16, 2025Updated 8 months ago
- robai☆15Apr 25, 2025Updated 9 months ago
- Language-agnostic workflow builder. Modular code that goes from dev to prod in a minute with principled design decisions.☆13Mar 11, 2024Updated last year
- The software associated with a paper accepted at EMNLP 2021 titled "Open Knowledge Graphs Canonicalization using Variational Autoencoders…☆16Sep 27, 2021Updated 4 years ago
- Reimplementation of ToMNet with some extensions for RL as well☆14Apr 28, 2018Updated 7 years ago
- This library provides you with an easy way to create and run Hive Agents.☆19Nov 9, 2024Updated last year
- Windows Privesc Check☆20May 20, 2014Updated 11 years ago
- Video short title classification.☆12Dec 6, 2017Updated 8 years ago
- A self-healing internet switch that automatically resets the connection when a failure is detected, ensuring continuous uptime without hu…☆19Oct 8, 2024Updated last year
- Intrinsic Motivation and Automatic Curricula via Asymmetric Self-Play☆14May 1, 2018Updated 7 years ago
- Feature partitioner by imbalance or correlation (ICLR 2024)☆17Jan 15, 2025Updated last year
- PGQ is an approach to combine Policy Gradient and Q-Learning. This repository will contain an implementation of PGQ.☆15Mar 9, 2017Updated 8 years ago
- Easy subdomain finder from a list of company names, IP ranges or domains.☆15Jan 26, 2021Updated 5 years ago
- This is a repository containing example code for how you can use unit tests to protect against security regression.☆19Jun 26, 2017Updated 8 years ago
- Source code for the Energy-Latency Attacks via Sponge Poisoning paper.☆15Mar 14, 2022Updated 3 years ago
- Tachikoma is a security alerting framework for human beings☆22Sep 7, 2018Updated 7 years ago
- Creates an ACM certificate with DNS validation, creates the validation records directly in Route 53☆21Dec 7, 2022Updated 3 years ago
- TL;DR: We propose a large-scale cross-domain persuasion dataset covers 13,000 scenarios in 35 domains, with the developed PersuGPT model …☆17Feb 12, 2025Updated last year
- Scalable Multi-Agent Reinforcement Learning☆15Dec 25, 2021Updated 4 years ago
- using information theory to encourage agents to cooperate and compete☆19Oct 4, 2018Updated 7 years ago
- gomoku AI with deep learning and monte carlo tree search☆18Mar 23, 2018Updated 7 years ago
- Tensorflow 1.x solution for chinese NER task, using ALBERT-LSTM-CRF model☆18Apr 19, 2020Updated 5 years ago
- The Paper Artifact Availability☆20Aug 26, 2022Updated 3 years ago
- A simple bash script that uses smbclient to test access to Windows file shares in automated fashion.☆19Jul 9, 2015Updated 10 years ago
- Malleable C2 is a domain specific language to redefine indicators in Beacon's communication. This repository is a collection of Malleable…☆18Feb 17, 2015Updated 10 years ago
- Practical Vertical Federated Learning with Unsupervised Representation Learning (TBD 2022)☆22Feb 21, 2022Updated 3 years ago
- A Common Vulnerability PoC Knowledge Base一个普遍漏洞POC知识库☆24Jun 24, 2023Updated 2 years ago
- Enable users to securely sign in with their own OpenAI API key.☆24Jan 24, 2024Updated 2 years ago