Improving transparency of large language models' reasoning
☆15Nov 25, 2025Updated 3 months ago
Alternatives and similar repositories for cot-transparency
Users that are interested in cot-transparency are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Code for our paper: "Language Models Learn to Mislead Humans via RLHF""☆19Oct 11, 2024Updated last year
- A spruced up version of the built-in python list. More post fixed methods for lovely typesafe chaining!☆13Nov 24, 2025Updated 3 months ago
- Creating a game to play Figgie & Train an agent to play against☆15Dec 3, 2022Updated 3 years ago
- ☆26Sep 5, 2024Updated last year
- ☆13Nov 21, 2016Updated 9 years ago
- Code for "On Measuring Faithfulness of Natural Language Explanations"☆21Jul 23, 2024Updated last year
- Codebase for "On Targeted Manipulation and Deception when Optimizing LLMs for User Feedback". This repo implements a generative multi-tur…☆24Dec 3, 2024Updated last year
- PyTorch implementation of "Weight Uncertainties in Neural Networks" (Bayes-by-Backprop)☆15Sep 10, 2018Updated 7 years ago
- ☆12Feb 28, 2025Updated last year
- ☆15Apr 26, 2025Updated 10 months ago
- ☆27Sep 15, 2025Updated 6 months ago
- G2Net Competition☆12Aug 2, 2023Updated 2 years ago
- Tensorflow implementation of Bayes-by-Backprop algorithm from "Weight uncertainty in neural networks" paper☆14Mar 6, 2019Updated 7 years ago
- Can Large Language Models Solve Security Challenges? We test LLMs' ability to interact and break out of shell environments using the Over…☆13Aug 21, 2023Updated 2 years ago
- ☆13Jun 4, 2024Updated last year
- Inference API for many LLMs and other useful tools for empirical research☆111Mar 11, 2026Updated last week
- Sentiment words are employed to compute the tendency of a sentence, and then a document. To detect sentiment words in Chinese documents, …☆12Jun 20, 2022Updated 3 years ago
- See the official code and checkpoints for "Timer: Generative Pre-trained Transformers Are Large Time Series Models"☆16Aug 19, 2024Updated last year
- Repository for DISRPT2019 shared task☆12Sep 5, 2022Updated 3 years ago
- ☆33Nov 7, 2024Updated last year
- Awesome Long-CoT Data☆19Mar 26, 2025Updated 11 months ago
- Attribution-based Parameter Decomposition☆34Jun 11, 2025Updated 9 months ago
- ☆42Jun 11, 2025Updated 9 months ago
- Scripts for training Qwen 2.5 VL with ms-swift and GRPO☆12Feb 27, 2025Updated last year
- Natural Language Processing models using private and secure data. Powered by OpenMined's tools PySyft and SyferText.☆11Feb 11, 2021Updated 5 years ago
- A curated list of resources about SOLID, the future of the Web!☆15Apr 25, 2019Updated 6 years ago
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆109Oct 25, 2023Updated 2 years ago
- Automatically create Anki cards from text using language models☆20Jan 7, 2023Updated 3 years ago
- What do CLIP Vision Transformers learn? Feature Visualization can show you!☆15Aug 29, 2024Updated last year
- Measuring the situational awareness of language models☆40Feb 12, 2024Updated 2 years ago
- A toy Inspect implementation of the Bliss Attractor eval from Claude 4 System Card Welfare Assessment☆38Jun 5, 2025Updated 9 months ago
- A Python web framework inspired by Spring Boot. Combines FastAPI, and Pydantic to provide auto dependency injection, configuration manage…☆23Jul 24, 2025Updated 8 months ago
- Reasoning in Space via Grounding in the World (ICLR 2025)☆50Nov 3, 2025Updated 4 months ago
- da website☆11Mar 16, 2024Updated 2 years ago
- A primer on large language models (LLM) as of Jan 2023, with bonus ChatGPT topic☆20Jan 29, 2024Updated 2 years ago
- More highlight colors in Logseq☆41Nov 16, 2021Updated 4 years ago
- A theme for Obsidian, inspired by and borrowing elements from Ubuntu☆26Jun 27, 2025Updated 8 months ago
- [ICLR 2022] Linking Emergent and Natural Languages via Corpus Transfer☆33Jun 2, 2024Updated last year
- ☆19Feb 5, 2025Updated last year