augmented LLM with self reflection
☆143Nov 21, 2023Updated 2 years ago
Alternatives and similar repositories for awesome-llm-self-reflection
Users that are interested in awesome-llm-self-reflection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Self-Reflection in LLM Agents: Effects on Problem-Solving Performance☆94Nov 25, 2024Updated last year
- ☆23Nov 11, 2024Updated last year
- Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning with LLMs☆41Jan 30, 2024Updated 2 years ago
- Official implementation of "MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model". Our co…☆26Dec 20, 2024Updated last year
- ☆20Dec 7, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [AAAI26] Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational Capabilitie…☆10Feb 7, 2026Updated 2 months ago
- Awesome LLM Self-Consistency: a curated list of Self-consistency in Large Language Models☆122Jul 20, 2025Updated 9 months ago
- [NeurIPS 2025@FoRLM] R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search☆17Jan 24, 2026Updated 3 months ago
- ☆17Apr 26, 2024Updated 2 years ago
- The substitution of qsub.☆12Jan 25, 2019Updated 7 years ago
- ☆20Nov 3, 2024Updated last year
- A resource repository for representation engineering in large language models☆150Nov 14, 2024Updated last year
- ☆20Jul 15, 2024Updated last year
- Llemma formal2formal (tactic prediction) theorem proving experiments☆20Oct 17, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Multimodal Model for Memotion Dataset☆12May 17, 2021Updated 4 years ago
- Awesome LLM papers, news and projects about learning to reason with LLM, OpenAI o1, reasonning techniques, chain-of-thought (COT), Large …☆28Oct 10, 2024Updated last year
- Code repository for "RL Grokking Recipe: How RL Unlocks and Transfers New Algorithms in LLMs""☆33Oct 12, 2025Updated 6 months ago
- 🤝 The code for "Can Large Language Model Agents Simulate Human Trust Behaviors?"☆117Apr 6, 2025Updated last year
- Analyzing LLM Alignment via Token distribution shift☆18Jan 26, 2024Updated 2 years ago
- This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.☆571Oct 28, 2024Updated last year
- [2025-TMLR] A Survey on the Honesty of Large Language Models☆64Dec 8, 2024Updated last year
- A Unified Benchmark and Toolbox for Multimodal Jailbreak Attack–Defense Evaluation☆68Mar 2, 2026Updated 2 months ago
- From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 🍓☆3,601Apr 20, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆25Apr 26, 2024Updated 2 years ago
- Official repo for the TMLR paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"☆29Apr 27, 2024Updated 2 years ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆35Jan 4, 2025Updated last year
- Solving Logic Grid Puzzles with Part-of-Speech Tagging and First-Order Logic☆11Dec 18, 2016Updated 9 years ago
- ☆191Mar 8, 2026Updated last month
- [ICML'24] TroVE: Inducing Verifiable and Efficient Toolboxes for Solving Programmatic Tasks☆33Sep 20, 2024Updated last year
- papers related to LLM-agent that published on top conferences☆319Apr 14, 2025Updated last year
- This is the official implementation for paper "PENCIL: Long Thoughts with Short Memory".☆79May 9, 2025Updated 11 months ago
- [ICLR 2025] Code for Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models☆25Apr 14, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- This repository contains the implementation of Concept Activation Regions, a new framework to explain deep neural networks with human con…☆16Oct 7, 2022Updated 3 years ago
- Comparing sequential forecasters via confidence sequences & e-processes☆10Oct 24, 2023Updated 2 years ago
- ☆74Apr 2, 2024Updated 2 years ago
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- Estimating hardware and cloud costs of LLMs and transformer projects☆21Apr 1, 2026Updated last month
- ☆41May 24, 2024Updated last year
- ☆28Nov 28, 2024Updated last year