π± A little course on Reinforcement Learning Environments for evaluating and training Language Models
β67Apr 11, 2026Updated this week
Alternatives and similar repositories for llm-rl-environments-lil-course
Users that are interested in llm-rl-environments-lil-course are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- opensource and fastest vectorDBβ90Dec 12, 2025Updated 4 months ago
- Supercharge your Gaianet node by generating a vector knowledge base from any API. Demo slides: https://hackmd.io/@santteegt/ByoykY4nC#/ Lβ¦β11Nov 29, 2024Updated last year
- Python package to download and use the SSB datasetsβ11Aug 3, 2023Updated 2 years ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"β10Mar 15, 2023Updated 3 years ago
- minimalistic AI library that resembles HF's transformersβ13Dec 31, 2024Updated last year
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- β12Dec 23, 2024Updated last year
- Materials for "Reinforcement Learning for Deliberation in ROS 2" workshop at ROSCon 2025β44Oct 27, 2025Updated 5 months ago
- Contextualized per-token embeddingsβ35May 11, 2025Updated 11 months ago
- This repository benchmarks multiple vector databases for music semantic search, using a shared dataset and query set. It provides both a β¦β35Aug 31, 2025Updated 7 months ago
- Class of data structures that can be unfolded.β22Jan 6, 2026Updated 3 months ago
- β22Sep 22, 2025Updated 6 months ago
- Nearly Inference Free Embeddings: make your RAG queries 500x fasterβ74Feb 20, 2026Updated last month
- β26Sep 21, 2025Updated 6 months ago
- Find why PyTorch training is slow while itβs still runningβ149Updated this week
- Wordpress hosting with auto-scaling on Cloudways β’ AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- β62Jan 28, 2026Updated 2 months ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zetaβ13Nov 11, 2024Updated last year
- β20Oct 25, 2025Updated 5 months ago
- [CVPR'25] Conformal prediction for vision-language models. Enhancing VLMs deployment with reliability gurarantees.β19Jun 7, 2025Updated 10 months ago
- Docs powering https://docs.venice.aiβ46Updated this week
- Project code for training LLMs to write better unit tests + codeβ21May 19, 2025Updated 10 months ago
- Let's try every rust gui library and see how they fareβ20Mar 1, 2024Updated 2 years ago
- a python package for loadimg and converting imagesβ29Feb 18, 2026Updated last month
- Trust your gut on gitβ62Jul 30, 2025Updated 8 months ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Adds some useful YAML frontmatter to your Obsidian notesβ11May 3, 2024Updated last year
- LangChain-Compatible Wrapper for Any Private LLM APIsβ77Nov 17, 2025Updated 4 months ago
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Modelβ42Aug 4, 2024Updated last year
- Extract streaming data from text using prefix completion.β10Oct 6, 2024Updated last year
- This repository contains small scripts and a notebook to convert conversation exports into a combined JSONL dataset suitable for fine-tunβ¦β39Oct 2, 2025Updated 6 months ago
- Fine-tuning LLM with LoRA (Low-Rank Adaptation) from scratch (Oct 2023)β32Jul 30, 2025Updated 8 months ago
- My implementation of the model KosmosG from "KOSMOS-G: Generating Images in Context with Multimodal Large Language Models"β14Nov 11, 2024Updated last year
- When Reasoning Meets Its Lawsβ36Jan 2, 2026Updated 3 months ago
- A Structured Output Benchmark whose 'ground-truth' is actually rightβ19Dec 5, 2025Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- An agent using Langgraph and Agent Inbox to close stale issuesβ68Sep 12, 2025Updated 7 months ago
- A distributed execution framework built upon lunatic.β16Jan 19, 2024Updated 2 years ago
- Xenon is a WebDriver proxy, for running multiple WebDriver sessions through a single hubβ12Jun 30, 2022Updated 3 years ago
- Project Euler GPT Resolverβ10Feb 12, 2024Updated 2 years ago
- Version lock, cache, and run binaries from any Github Release assets. Pull in external tools and keep the versions in sync across your teβ¦β15Jan 3, 2024Updated 2 years ago
- MCP Server to interact with Amazon Adsβ20May 21, 2025Updated 10 months ago
- β35Oct 26, 2025Updated 5 months ago