bipul1010 / agents_tutorialLinks
☆19Updated 11 months ago
Alternatives and similar repositories for agents_tutorial
Users that are interested in agents_tutorial are comparing it to the libraries listed below
Sorting:
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆55Updated 6 months ago
- ☆64Updated 2 months ago
- ☆77Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated last year
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated 8 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆78Updated last week
- Example implementation of Iteration of Tought - Gives a star if you like the project☆42Updated 7 months ago
- ☆20Updated last year
- Simple examples using Argilla tools to build AI☆53Updated 8 months ago
- ☆47Updated last year
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 6 months ago
- ☆66Updated last year
- Writing Blog Posts with Generative Feedback Loops!☆50Updated last year
- Simple GRPO scripts and configurations.☆59Updated 5 months ago
- Verbosity control for AI agents☆64Updated last year
- ☆87Updated last year
- GPT-4 Level Conversational QA Trained In a Few Hours☆63Updated 11 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆111Updated 3 months ago
- Score LLM pretraining data with classifiers☆55Updated last year
- ☆49Updated 5 months ago
- ☆53Updated 8 months ago
- Project code for training LLMs to write better unit tests + code☆21Updated 2 months ago
- ☆13Updated 3 months ago
- ☆20Updated 9 months ago
- An introduction to LLM Sampling☆79Updated 7 months ago
- Solving data for LLMs - Create quality synthetic datasets!☆150Updated 6 months ago
- An automated tool for discovering insights from research papaer corpora☆138Updated last year
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆59Updated 3 weeks ago
- ☆70Updated 2 weeks ago
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated last year