☆46Mar 20, 2023Updated 2 years ago
Alternatives and similar repositories for benchmark_llm_summarization
Users that are interested in benchmark_llm_summarization are comparing it to the libraries listed below
Sorting:
- Perform facts checks on your conversations with LLMs to catch fake-news, misleading information, and LLMs confusion.☆12Apr 22, 2023Updated 2 years ago
- Implementation of "Can we obtain significant success in RST discourse parsing by using Large Language Models?" (accepted by EACL 2024)☆19May 13, 2024Updated last year
- ☆25Dec 13, 2024Updated last year
- ☆22Feb 26, 2024Updated 2 years ago
- Articles, White Papers, Technical Write-Ups and more authored by members of the GreySec community. Curated by staff, selected for excelle…☆27Aug 17, 2021Updated 4 years ago
- Code for paper Towards Mitigating LLM Hallucination via Self Reflection☆30Oct 9, 2023Updated 2 years ago
- A PyTorch implementation of DeepFD (Deep Structure Learning for Fraud Detection)☆32Oct 2, 2020Updated 5 years ago
- [NeurIPS 2025] RL Tango: Reinforcing Generator and Verifier Together for Language Reasoning☆52Oct 23, 2025Updated 4 months ago
- fine-tuning tutorial☆18Feb 20, 2026Updated 2 weeks ago
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- DreamSmooth: Improving Model-Based RL with Reward Smoothing (ICLR 2024)☆12May 6, 2024Updated last year
- ☆11Jan 11, 2022Updated 4 years ago
- LLM의 다양한 튜닝 방법과 데이터 전처리 코드를 정리해놓았습니다.☆14Feb 23, 2026Updated last week
- LLM Skirmish☆44Feb 3, 2026Updated last month
- ☆14Apr 29, 2025Updated 10 months ago
- The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"☆29Feb 23, 2026Updated last week
- grpo to train long form QA and instructions with long-form reward model☆17Jul 17, 2025Updated 7 months ago
- Embodied-Planner-R1: Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning☆25Jan 5, 2026Updated 2 months ago
- code for polite☆11Feb 28, 2024Updated 2 years ago
- Fine-tuning GPT-2 to generate research paper abstracts☆12Apr 28, 2021Updated 4 years ago
- The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.☆13Jun 17, 2024Updated last year
- 抓取汽车之家全站☆10Dec 26, 2019Updated 6 years ago
- Teaching a humanoid to walk(ish), then displaying in your browser (using tensorflow.js and reinforcement learning)☆10Sep 7, 2020Updated 5 years ago
- Example of test code with Chrome Driver on OpenFin Runtime☆12Mar 10, 2023Updated 2 years ago
- [NAACL 24 Oral] LoRETTA: Low-Rank Economic Tensor-Train Adaptation for Ultra-Low-Parameter Fine-Tuning of Large Language Models☆39Jan 9, 2025Updated last year
- A collection of heat engines, based on the OpenAI Gym environment framework for use with reinforcement learning applications.☆15Dec 20, 2021Updated 4 years ago
- ☆16Feb 22, 2025Updated last year
- ☆14Mar 21, 2024Updated last year
- ☆16Jun 25, 2025Updated 8 months ago
- 깃헙에 NLP 잔디심기 시즌 5☆10Aug 19, 2024Updated last year
- ☆10Feb 17, 2019Updated 7 years ago
- Canopy is a machine learning learning compiler stack with the capability of adopting high-end FPGAs. As a part of OpenAIOS project, Canop…☆12May 7, 2021Updated 4 years ago
- Emotion detection on multiparty dialogue.☆40Apr 13, 2018Updated 7 years ago
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆73May 25, 2025Updated 9 months ago
- ☆37Mar 26, 2024Updated last year
- Repo for "On Learning to Summarize with Large Language Models as References"☆43May 24, 2023Updated 2 years ago
- ☆39Aug 9, 2022Updated 3 years ago
- Cross Linugual COVID-19 Fake News Dataset☆38Mar 20, 2021Updated 4 years ago
- unifloc on python☆15Nov 14, 2020Updated 5 years ago