☆38Feb 16, 2024Updated 2 years ago
Alternatives and similar repositories for LLM-evaluation-datasets
Users that are interested in LLM-evaluation-datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An exploration of Android App Functions☆17May 26, 2025Updated last year
- An Interactive Hex-Rays Microcode Explorer☆17Feb 8, 2024Updated 2 years ago
- Writeup and exploit for CVE-2025-22441: Privilege escalation from installed app to SystemUI process on Android due to pass of untrusted A…☆100Oct 8, 2025Updated 8 months ago
- Rerousces related to time-travel debugging (TTD)☆45Jan 6, 2026Updated 5 months ago
- ☆19Sep 7, 2025Updated 9 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Cross-Site Scripting (XSS) is a common vulnerability that allows attackers to inject malicious scripts into web pages viewed by users. In…☆11Sep 10, 2024Updated last year
- 面向大模型的民族文化数据集☆12May 26, 2025Updated last year
- ☆14May 7, 2024Updated 2 years ago
- ☆12Jul 8, 2022Updated 3 years ago
- Red Team AI Benchmark: Evaluating Uncensored LLMs for Offensive Security☆44Dec 25, 2025Updated 5 months ago
- Benchmarks for the VNN Comp 2023☆16Jun 7, 2024Updated 2 years ago
- [NAACL 2024 Findings] Deja vu: Contrastive Historical Modeling with Prefix-tuning for Temporal Knowledge Graph Reasoning☆14Jul 8, 2024Updated last year
- MIT IEEE URTC 2024. GSET 2024. Repository for the "MBASED: Practical Simplifications of Mixed Boolean-Arithmetic Obfuscation". A Binary N…☆44Aug 8, 2025Updated 10 months ago
- The official repository of NeurIPS'25 paper "Ada-R1: From Long-Cot to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization"☆24May 6, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for the paper "Watermarking Makes Language Models Radioactive"☆23Oct 25, 2024Updated last year
- Binary Ninja deobfuscation plugin☆22Jul 23, 2025Updated 10 months ago
- NLQF is a tool to filter query-appropriate comments for building high-quality code search datasets.☆19Feb 15, 2022Updated 4 years ago
- How a leaked JWT secret inside a JavaScript file led to full admin access — and why most devs still don't see it coming.☆15Jul 22, 2025Updated 10 months ago
- KeySentry – Find leaked API keys & secrets in any GitHub repo. No mercy.☆41May 29, 2026Updated last week
- The official repo for "OpenMoE 2: Sparse Diffusion Language Models".☆58Dec 28, 2025Updated 5 months ago
- For Certified Robustness to Text Adversarial Attacks by Randomized [MASK]☆17Oct 8, 2024Updated last year
- ☆12Sep 27, 2021Updated 4 years ago
- LogicBench is a natural language question-answering dataset consisting of 25 different reasoning patterns spanning over propositional, fi…☆39May 2, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [NeurIPS'24] RedCode: Risky Code Execution and Generation Benchmark for Code Agents☆81Apr 24, 2026Updated last month
- frida脚本集合☆36Feb 6, 2026Updated 4 months ago
- ☆32Sep 13, 2024Updated last year
- 🥇 Amazon Nova AI Challenge Winner - ASTRA emerged victorious as the top attacking team in Amazon's global AI safety competition, defeati…☆72May 11, 2026Updated 3 weeks ago
- Code for the benchmarking single-cell foundation models (scGPT, scBERT, and Geneformer) for cell-type annotation task using skewed single…☆15Dec 8, 2024Updated last year
- This is the official implementation of TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data☆13Jul 21, 2024Updated last year
- Formally proving the security of Fast Reed-Solomon interactive oracle proofs of proximity☆91Dec 11, 2025Updated 5 months ago
- a secret detection tool☆40Mar 1, 2026Updated 3 months ago
- Extension for CoEdPilot☆21Feb 25, 2026Updated 3 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ios application class-dump use frida☆39Apr 28, 2023Updated 3 years ago
- 大创项目,层级注意力机器翻译☆17Apr 12, 2021Updated 5 years ago
- ☆13Jun 4, 2023Updated 3 years ago
- Chain of Images for Intuitively Reasoning☆10Nov 29, 2023Updated 2 years ago
- ☆15Mar 22, 2021Updated 5 years ago
- VQVAE | VAE | GumbelVAE | PixelCNN☆21Jun 15, 2020Updated 5 years ago
- ☆12Jun 30, 2024Updated last year