☆144Nov 11, 2024Updated last year
Alternatives and similar repositories for implicit_chain_of_thought
Users that are interested in implicit_chain_of_thought are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆206Apr 19, 2025Updated 11 months ago
- [ICML 2025] Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment (https://arxiv.org/abs/2410.02197)☆40Sep 8, 2025Updated 7 months ago
- Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"☆75May 20, 2025Updated 10 months ago
- ☆124Feb 21, 2025Updated last year
- WorldSense benchmark for grounded reasoning in language models☆24Nov 28, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- official code for paper Probing the Decision Boundaries of In-context Learning in Large Language Models. https://arxiv.org/abs/2406.11233…☆19Jul 27, 2025Updated 8 months ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆48Jan 17, 2024Updated 2 years ago
- TL;DR: We propose a large-scale cross-domain persuasion dataset covers 13,000 scenarios in 35 domains, with the developed PersuGPT model …☆17Feb 12, 2025Updated last year
- https://interactivetraining.ai/☆17Oct 2, 2025Updated 6 months ago
- ☆20Nov 15, 2024Updated last year
- ☆16Mar 22, 2025Updated last year
- ☆19Jun 21, 2025Updated 9 months ago
- [NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"☆206Mar 4, 2025Updated last year
- ☆49Jan 7, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Unofficial Implementation of Selective Attention Transformer☆21Oct 31, 2024Updated last year
- SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters (ICLR 2025)☆17Aug 22, 2025Updated 7 months ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆124Sep 9, 2024Updated last year
- DeepDip, a DRL Gym agent that plays no-press Diplomacy in BANDANA☆13Jul 22, 2019Updated 6 years ago
- LLM experiments done during SERI MATS - focusing on activation steering / interpreting activation spaces☆104Sep 21, 2023Updated 2 years ago
- ☆39Mar 29, 2024Updated 2 years ago
- ☆16Oct 4, 2024Updated last year
- ☆28Oct 2, 2025Updated 6 months ago
- Function Vectors in Large Language Models (ICLR 2024)☆194Apr 17, 2025Updated 11 months ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A python tool help to interact with chatgpt.☆10Dec 11, 2022Updated 3 years ago
- Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"☆546Jan 17, 2025Updated last year
- code for COLING paper "A Hybrid Model of Classification and Generation for Spatial Relation Extraction"☆10Oct 20, 2022Updated 3 years ago
- [COLM 2025] LIMO: Less is More for Reasoning☆1,071Jul 30, 2025Updated 8 months ago
- DCR-Consistency: Divide-Conquer-Reasoning for Consistency Evaluation and Improvement of Large Language Models☆25May 23, 2024Updated last year
- Here we will test various linear attention designs.☆62Apr 25, 2024Updated last year
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆175Jan 16, 2025Updated last year
- Encoder-decoders for translating different chemical formats.☆20Sep 17, 2025Updated 6 months ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Training Large Language Model to Reason in a Continuous Latent Space☆1,563Updated this week
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆65Dec 25, 2023Updated 2 years ago
- ☆146Sep 12, 2025Updated 7 months ago
- ☆15May 30, 2024Updated last year
- Papers of Implicit Reasoning in LLMs.☆24Mar 13, 2025Updated last year
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…☆29May 23, 2024Updated last year
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆69Feb 27, 2024Updated 2 years ago