jrzmnt / rl-vs-llm-chess
☆20Updated 6 months ago
Alternatives and similar repositories for rl-vs-llm-chess:
Users that are interested in rl-vs-llm-chess are comparing it to the libraries listed below
- ☆143Updated 8 months ago
- Collection of resources for RL and Reasoning☆25Updated 2 months ago
- ☆51Updated last month
- Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning☆197Updated last month
- A comprehensive repository of reasoning tasks for Medical LLMs (and beyond)☆116Updated 6 months ago
- A practical RAG where you can download and chat with github repo☆72Updated last month
- Official code of the paper "SimGRAG: Leveraging Similar Subgraphs for Knowledge Graphs Driven Retrieval-Augmented Generation"☆105Updated 3 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆107Updated last month
- 📝 Automatically annotate papers using LLMs☆310Updated 3 months ago
- A notebook based tutorial series on buildling a LLM from scratch☆24Updated 6 months ago
- ☆83Updated last month
- ☆502Updated 2 months ago
- model activation visualiser☆90Updated this week
- ☆40Updated 4 months ago
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆31Updated last month
- A programming framework for agentic AI. Discord: https://discord.gg/pAbnFJrkgZ☆127Updated last month
- CodeScientist: An automated scientific discovery system for code-based experiments☆67Updated this week
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆102Updated 11 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalization☆273Updated 8 months ago
- Granite Snack Cookbook -- easily consumable recipes (python notebooks) that showcase the capabilities of the Granite models☆162Updated last week
- ☆146Updated last month
- Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.☆166Updated 11 months ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆47Updated 10 months ago
- A library for evaluating Retrieval-Augmented Generation (RAG) systems (The traditional ways).☆33Updated 7 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 8 months ago
- Repo contains code for LLM dataset generator which can help create question answer pairs using your very own PDF file.☆20Updated 5 months ago
- ☆59Updated 6 months ago
- An agentic AI application that allows you to chat with your papers and gather also information from papers on ArXiv and on PubMed☆67Updated last month
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆264Updated 3 months ago
- Fine tuning ModernBERT-embed-base on synthetic domain specific data for improvement to unseen queries☆23Updated 2 months ago