Official implementation of "OffsetBias: Leveraging Debiased Data for Tuning Evaluators"
☆25Sep 11, 2024Updated last year
Alternatives and similar repositories for offsetbias
Users that are interested in offsetbias are comparing it to the libraries listed below
Sorting:
- Chain-of-thought 방식을 활용하여 llama2를 fine-tuning☆10Nov 18, 2023Updated 2 years ago
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆53Aug 10, 2025Updated 6 months ago
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Models☆15Mar 8, 2023Updated 2 years ago
- [NeurIPS 2025] Reasoning Models Better Express Their Confidence"☆22Nov 19, 2025Updated 3 months ago
- Critique-out-Loud Reward Models☆74Oct 18, 2024Updated last year
- Performs benchmarking on two Korean datasets with minimal time and effort.☆46Jan 22, 2026Updated last month
- Evaluate gpt-4o on CLIcK (Korean NLP Dataset)☆20May 18, 2024Updated last year
- NC NLP Techblog. NC의 NLP가 열어갈 도전과 변화를 소개합니다.☆22Jan 22, 2025Updated last year
- [NAACL 2024] Official repository for "KTRL+F: Knowledge-Augmented In-Document Search"☆23Oct 11, 2024Updated last year
- ☆46Jun 24, 2025Updated 8 months ago
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- Platform API Project seed☆12Nov 8, 2023Updated 2 years ago
- Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io☆39Feb 19, 2026Updated last week
- hwpxlib 패키지 python에서 쉽게 사용 할수 있게 만든 github repo 입니다.☆36Mar 29, 2025Updated 11 months ago
- [ICLR 2024 Spotlight] FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets☆218Dec 24, 2023Updated 2 years ago
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated last year
- Source code of ACL 2023 accepted paper "AD-KD: Attribution-Driven Knowledge Distillation for Language Model Compression"☆12Jun 14, 2023Updated 2 years ago
- Do Multilingual Language Models Think Better in English?☆42Aug 3, 2023Updated 2 years ago
- [EACL 2023] CoTEVer: Chain of Thought Prompting Annotation Toolkit for Explanation Verification☆42Apr 29, 2023Updated 2 years ago
- ☆12Jun 19, 2024Updated last year
- Automatic Thief Detection via CCTV with Alarm System and Perpetrator Image Capture using YOLOv5 + ROI. This project utilizes computer vis…☆14Oct 21, 2024Updated last year
- Application for Agent re-engineering for better and reliable Gen AI workflows.☆10Jul 20, 2025Updated 7 months ago
- Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization☆12Dec 3, 2024Updated last year
- A relatively simple, unified method for reporting on Kubernetes resource issues.☆12Mar 5, 2020Updated 5 years ago
- Deep learning introduction to beginners with PyTorch☆12Apr 24, 2020Updated 5 years ago
- Evaluation of Oasis Platform - simple install, UI and API☆14Feb 9, 2026Updated 2 weeks ago
- Indexing framework designed for the automated creation of structured knowledge bases in Azure AI Search☆14Jun 18, 2025Updated 8 months ago
- This repo contains documentation related to the operation of the OpenBytes project.☆13Oct 29, 2021Updated 4 years ago
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆96Oct 30, 2024Updated last year
- ☆110Nov 7, 2024Updated last year
- A modular, agentic-AI-based adaptive cybersecurity architecture for digital ecosystems. Combines Zero Trust, real-time telemetry, and int…☆21Jul 4, 2025Updated 7 months ago
- This repository contains the source code for the cloud.gov.au website.☆12Dec 7, 2022Updated 3 years ago
- Code and Dataset release of "Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models" (NAACL 2024)☆10Oct 16, 2024Updated last year
- About The corresponding code from our paper " Making Reasoning Matter: Measuring and Improving Faithfulness of Chain-of-Thought Reasoning…☆13Jan 14, 2026Updated last month
- This is a simple example of how to serve a DeepSeek model with Azure ML.☆10Feb 10, 2025Updated last year
- A tool to explore ideas generated from artificial intelligence chats.☆10Apr 3, 2023Updated 2 years ago
- ☆10Jul 13, 2024Updated last year
- ☆11Aug 15, 2024Updated last year
- ☆10Dec 14, 2018Updated 7 years ago