rohinmanvi / Capability-Aware_and_Mid-Generation_Self-Evaluations
☆19Updated 2 months ago
Alternatives and similar repositories for Capability-Aware_and_Mid-Generation_Self-Evaluations:
Users that are interested in Capability-Aware_and_Mid-Generation_Self-Evaluations are comparing it to the libraries listed below
- ☆48Updated 3 months ago
- ☆12Updated last month
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆54Updated 5 months ago
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆24Updated 2 months ago
- ☆23Updated 5 months ago
- ☆13Updated 2 months ago
- ☆108Updated 3 weeks ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆46Updated last year
- The official implementation of Self-Exploring Language Models (SELM)☆61Updated 8 months ago
- ☆71Updated 6 months ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆48Updated 2 months ago
- A repository for research on medium sized language models.☆76Updated 8 months ago
- ☆27Updated this week
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM☆57Updated 8 months ago
- The official code repo and data hub of top_nsigma sampling strategy for LLMs.☆21Updated last week
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners☆72Updated last month
- ☆50Updated 3 months ago
- Codebase for Instruction Following without Instruction Tuning☆33Updated 4 months ago
- ☆31Updated 8 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆41Updated last year
- Lottery Ticket Adaptation☆37Updated 3 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆82Updated 4 months ago
- ☆20Updated 8 months ago