ai-hero / llm-research-fine-tuning
☆15Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for llm-research-fine-tuning
- ☆75Updated 5 months ago
- Self-host LLMs with vLLM and BentoML☆74Updated last week
- Leverage your LangChain trace data for fine tuning☆38Updated 3 months ago
- Writing Blog Posts with Generative Feedback Loops!☆43Updated 8 months ago
- ☆75Updated 5 months ago
- End-to-End LLM Guide☆97Updated 4 months ago
- Track OpenAI compatible requests to a dataset☆57Updated this week
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆36Updated 7 months ago
- Experimental Code for StructuredRAG: Structured Outputs in Retrieval-Augmented Generation☆94Updated this week
- ☆47Updated 2 weeks ago
- experiments with inference on llama☆105Updated 5 months ago
- Fiddler Auditor is a tool to evaluate language models.☆171Updated 8 months ago
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆60Updated this week
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆74Updated 2 months ago
- ☆64Updated 5 months ago
- ☆22Updated 6 months ago
- Just a bunch of benchmark logs for different LLMs☆116Updated 3 months ago
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆114Updated 10 months ago
- Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.☆100Updated 2 months ago
- Generate Tools and Toolkits from any Python SDK -- no extra code required☆49Updated 2 weeks ago
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆94Updated 5 months ago
- Resources for exploring Generative Feedback Loops with Weaviate!☆36Updated this week
- ☆200Updated 9 months ago
- ☆47Updated this week
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 4 months ago
- ☆15Updated last month
- Foyle is a copilot to help developers deploy and operate their applications.☆109Updated this week
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆106Updated this week
- ☆83Updated last year
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆168Updated 2 weeks ago