Pervasive-AI-Lab / LuckyMera
☆15Updated 6 months ago
Alternatives and similar repositories for LuckyMera:
Users that are interested in LuckyMera are comparing it to the libraries listed below
- Implementation☆24Updated last month
- ☆80Updated 3 months ago
- ☆48Updated 5 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated 11 months ago
- ☆18Updated 7 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆33Updated last week
- Based on the tree of thoughts paper☆48Updated last year
- An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic mult…☆68Updated last year
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆63Updated last month
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 4 months ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated 10 months ago
- 👩🤝🤖 A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)☆23Updated last year
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆32Updated 6 months ago
- Flexible, efficient, and context-aware generation from large unstructured knowledge sources.☆16Updated 11 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆145Updated 2 months ago
- Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"☆60Updated 3 weeks ago
- ☆27Updated 7 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆16Updated last year
- A repository of projects and datasets under active development by Alignment Lab AI☆22Updated last year
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated 5 months ago
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Updated last year
- Code for ICML 2024 paper☆21Updated last month
- Data preparation code for CrystalCoder 7B LLM☆44Updated 11 months ago
- A Framework For Intelligence Farming☆14Updated 3 weeks ago
- Learning to route instances for Human vs AI Feedback☆23Updated 2 months ago
- alternative way to calculating self attention☆18Updated 11 months ago
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆55Updated 2 months ago
- ☆48Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 7 months ago
- ☆90Updated 9 months ago