A multi-lingual benchmark for evaluating industrial domain knowledge of LLMs.
☆111May 13, 2026Updated last week
Alternatives and similar repositories for IndustryBench
Users that are interested in IndustryBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 🚀enhanced GRPO with more verifiable rewards and real-time evaluators☆37Jan 27, 2026Updated 3 months ago
- 这是一套专为 **懒猫微服 (LazyCat MicroServer)** 平台开发者打造的 AI 智能体技能包 (Agent Skills)。☆45Apr 3, 2026Updated last month
- Implementation for EACL 2024 paper "Corpus-Steered Query Expansion with Large Language Models"☆12Mar 19, 2024Updated 2 years ago
- Repository for DEMETR: Diagnosing Evaluation Metrics for Translation☆17Nov 29, 2022Updated 3 years ago
- auto star for repo lists☆10Aug 26, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆14Aug 18, 2022Updated 3 years ago
- Tensorflow code for "Hierarchical Decompositional Mixtures of Variational Autoencoders" (ICML'19)☆12Jun 7, 2020Updated 5 years ago
- The official implementation of InfoRM [NeurIPS 2024].☆15Oct 25, 2025Updated 6 months ago
- Finetuning LLaMA with DeepSpeed☆10Apr 14, 2023Updated 3 years ago
- ☆41Updated this week
- [ChatGPT4MTevaluation] ErrorAnalysis Prompt for MT Evaluation in ChatGPT☆91Oct 14, 2025Updated 7 months ago
- Code for safety test in "Keeping LLMs Aligned After Fine-tuning: The Crucial Role of Prompt Templates"☆22Sep 21, 2025Updated 8 months ago
- ModelBox: OpenAI-protocol proxy for context debugging (mock/passthrough + payload capture)☆48Apr 15, 2026Updated last month
- Source code of ACL 2023 Main Conference Paper "PAD-Net: An Efficient Framework for Dynamic Networks".☆13Feb 28, 2026Updated 2 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Code base for the EMNLP 2021 paper, "Multi-granularity Textual Adversarial Attack with Behavior Cloning".☆13Apr 18, 2022Updated 4 years ago
- awesome video representation learning☆15Mar 22, 2021Updated 5 years ago
- 极不平衡样本下的预测☆40Oct 28, 2025Updated 6 months ago
- ♻️💾 Yet Another Memory Layer, inspired by Cognitive Science, designed for Cyber Waifu☆61Apr 25, 2026Updated 3 weeks ago
- Run /yolo, walk away, come back to a working MVP. Autonomous build skill for AI coding agents.☆36Feb 25, 2026Updated 2 months ago
- Custom-made start page for browsers☆29Feb 22, 2026Updated 3 months ago
- verifying machine unlearning by backdooring☆20Mar 25, 2023Updated 3 years ago
- 🎁 Modern e-commerce system built with Go (Gin + Gorm + Redis + JWT). Enhanced version of yshop-gin with improved UI, performance and fea…☆37Oct 17, 2025Updated 7 months ago
- The mcpchainx enables blockchains to plug directly into large models, powering truly trustworthy AI agents.☆51Jan 13, 2026Updated 4 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- AI-powered SDK for Software Development☆56Apr 19, 2026Updated last month
- [Neurips 2025] Max-Former.☆60Jan 19, 2026Updated 4 months ago
- In-Context Reinforcement Learning for Tool Use in Large Language Models☆47Mar 26, 2026Updated last month
- PRB's collection of agent skills☆59Updated this week
- A GNOME Shell extension that puts your GitHub dashboard in the top bar. Track notifications, monitor repository stats, view issues, and m…☆45May 7, 2026Updated 2 weeks ago
- The open-source Mixture of Depths code and the official implementation of the paper "Router-Tuning: A Simple and Effective Approach for E…☆31May 12, 2026Updated last week
- ☆42Nov 14, 2025Updated 6 months ago
- To help everyone to build their blog to learn☆48Nov 5, 2025Updated 6 months ago
- Towards Safe LLM with our simple-yet-highly-effective Intention Analysis Prompting☆21Mar 25, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- AI-powered website replication toolkit. Clone any website into a production-ready Next.js codebase.☆132Apr 6, 2026Updated last month
- A prompt-based pipeline for finding, validating, and proving vulnerabilities using LLM sub-agents.☆83Mar 27, 2026Updated last month
- Source code of EMNLP 2022 Findings paper "SparseAdapter: An Easy Approach for Improving the Parameter-Efficiency of Adapters"☆22Feb 28, 2026Updated 2 months ago
- ☆66May 6, 2026Updated 2 weeks ago
- Official implementation of "HLRTF: Hierarchical Low-Rank Tensor Factorization for Inverse Problems in Multi-Dimensional Imaging," CVPR 20…☆21Aug 6, 2022Updated 3 years ago
- The first Object-Oriented Programming (OOP) Evaluation Benchmark for LLMs☆27Jan 15, 2025Updated last year
- ☆96Mar 13, 2026Updated 2 months ago