rohinmanvi / Capability-Aware_and_Mid-Generation_Self-Evaluations
☆19Updated last month
Alternatives and similar repositories for Capability-Aware_and_Mid-Generation_Self-Evaluations:
Users that are interested in Capability-Aware_and_Mid-Generation_Self-Evaluations are comparing it to the libraries listed below
- ☆46Updated 2 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆53Updated 4 months ago
- ☆23Updated 4 months ago
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆44Updated last month
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆22Updated last month
- The official implementation of Self-Exploring Language Models (SELM)☆60Updated 7 months ago
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆57Updated this week
- ☆69Updated 5 months ago
- ☆90Updated this week
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆46Updated last year
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners☆66Updated 2 weeks ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆47Updated last month
- ☆40Updated 8 months ago
- ☆31Updated 7 months ago
- ☆13Updated last month
- ☆47Updated last month
- A repository for research on medium sized language models.☆76Updated 7 months ago
- The first dense retrieval model that can be prompted like an LM☆65Updated 4 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆77Updated 3 months ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆50Updated 3 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆34Updated 8 months ago
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM☆56Updated 7 months ago
- ☆83Updated last week
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated last month
- My fork os allen AI's OLMo for educational purposes.☆30Updated last month
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆20Updated 3 weeks ago
- Replicating O1 inference-time scaling laws☆70Updated last month
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆100Updated 2 weeks ago
- Benchmarking Chat Assistants on Long-Term Interactive Memory☆35Updated last month