autogenai / easy-problems-that-llms-get-wrong
☆25Updated last month
Related projects ⓘ
Alternatives and complementary repositories for easy-problems-that-llms-get-wrong
- ☆40Updated this week
- ☆31Updated 2 weeks ago
- Data preparation code for CrystalCoder 7B LLM☆42Updated 6 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 4 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆46Updated 2 months ago
- entropix style sampling + GUI☆25Updated last week
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆20Updated this week
- ☆64Updated 5 months ago
- Codes and datasets for the paper Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Ref…☆23Updated last month
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆20Updated 9 months ago
- Simple examples using Argilla tools to build AI☆38Updated last week
- ☆31Updated 4 months ago
- ☆20Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆59Updated last week
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT☆26Updated 8 months ago
- ☆44Updated last month
- Routing on Random Forest (RoRF)☆83Updated last month
- Small and Efficient Mathematical Reasoning LLMs☆71Updated 9 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆75Updated 3 weeks ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆40Updated 8 months ago
- ☆21Updated last month
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆32Updated 5 months ago
- ☆41Updated last month
- ☆76Updated 10 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆114Updated this week
- ☆48Updated last year
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆45Updated last month
- ☆53Updated 5 months ago
- Codebase accompanying the Summary of a Haystack paper.☆72Updated last month
- Data preparation code for Amber 7B LLM☆82Updated 6 months ago