☆19Aug 3, 2024Updated last year
Alternatives and similar repositories for FreeEval
Users that are interested in FreeEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆19May 25, 2024Updated 2 years ago
- [ACL'24] A Knowledge-grounded Interactive Evaluation Framework for Large Language Models☆40Jul 19, 2024Updated last year
- Exploiting Unlabeled Data for Target-Oriented Opinion Words Extraction☆24Sep 30, 2022Updated 3 years ago
- ☆17Feb 28, 2024Updated 2 years ago
- Code for Semantic-Aligned Adversarial Evolution Triangle for High-Transferability Vision-Language Attack(TPAMI 2025)☆42Aug 28, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating☆100Jan 29, 2024Updated 2 years ago
- [KDD'22] Partial Label Learning with Discrimination Augmentation☆10May 21, 2024Updated 2 years ago
- 🎉 TrustJudge is accepted to ICLR 2026!☆47Sep 27, 2025Updated 9 months ago
- [NDSS'24] Inaudible Adversarial Perturbation: Manipulating the Recognition of User Speech in Real Time☆56Sep 28, 2024Updated last year
- ☆470Feb 7, 2025Updated last year
- SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt Types☆26Nov 29, 2024Updated last year
- Efficient Dictionary Learning with Switch Sparse Autoencoders (SAEs)☆25Dec 1, 2024Updated last year
- A Japanese G2P tool based on pyopenjtalk☆25Aug 6, 2022Updated 3 years ago
- ☆32May 31, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆48Sep 5, 2024Updated last year
- Analyzing and Reducing Catastrophic Forgetting in Parameter Efficient Tuning☆37Nov 17, 2024Updated last year
- Unofficial implementation of deepseek/Janus in ComfyUI.☆17Mar 12, 2025Updated last year
- [NeurIPS'20] Semi-Supervised Partial Label Learning via Confidence-Rated Margin Maximization☆21May 29, 2022Updated 4 years ago
- [ACL 2024]Controlled Text Generation for Large Language Model with Dynamic Attribute Graphs☆40Sep 24, 2024Updated last year
- Fault-aware neural code rankers☆32Dec 9, 2022Updated 3 years ago
- Evaluating ChatGPT’s Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness☆143Aug 17, 2024Updated last year
- semi-autoregressive neural machine translation☆23Sep 9, 2018Updated 7 years ago
- PyTorch implementation of batched GRU encoder and decoder.☆30Jan 24, 2018Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [CCS'24] SafeGen: Mitigating Unsafe Content Generation in Text-to-Image Models☆138Mar 30, 2026Updated 2 months ago
- This repo provides the codebase for "A General Framework for Weak Supervision"☆40Jun 3, 2024Updated 2 years ago
- A production-grade implementation of an Investment Portfolio Management System created for testing LLM translation of real world legacy a…☆27Oct 30, 2024Updated last year
- Official Implementation of "Learning Harmonized Representations for Speculative Sampling" (HASS)☆56Mar 14, 2025Updated last year
- ☆926May 22, 2024Updated 2 years ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- Making Espnet easier to use☆54Apr 9, 2021Updated 5 years ago
- ☆12Apr 21, 2025Updated last year
- LLM benchmarks☆13Feb 22, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆12Nov 5, 2024Updated last year
- 中文金融大模型测评基准,六大类二十五任务、等级化评价,国内模型获得A级☆11May 6, 2024Updated 2 years ago
- Knowledge Graph based Question Answering benchmark.☆10Feb 1, 2020Updated 6 years ago
- A SapientML plugin of SapientMLGenerator☆11Apr 6, 2026Updated 2 months ago
- Markdown Editor with React + TS + shadcn UI / Tailwind css☆11Jun 3, 2025Updated last year
- ☆13Mar 5, 2025Updated last year
- Code and data for automatic paraphrase dataset augmentation.☆11Mar 8, 2021Updated 5 years ago