π± A little course on Reinforcement Learning Environments for evaluating and training Language Models
β200May 22, 2026Updated last week
Alternatives and similar repositories for llm-rl-environments-lil-course
Users that are interested in llm-rl-environments-lil-course are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Analyze coinbase orderbook in real-time in Python with Bytewaxβ11Apr 23, 2024Updated 2 years ago
- Project code for training LLMs to write better unit tests + codeβ22May 19, 2025Updated last year
- 3D Gaussian Splatting Viewerβ32Mar 7, 2026Updated 2 months ago
- minimalistic AI library that resembles HF's transformersβ13Dec 31, 2024Updated last year
- Official Code Repository for paper "HYDRA: Model Factorization Framework for Black-Box LLM Personalization"β16Oct 7, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- β34Nov 18, 2025Updated 6 months ago
- Container images and tool for running machine learning with Rust on Amazon SageMakerβ11Jul 25, 2024Updated last year
- Implemention based on lightrag and nano-graphrag to connect with psqlβ15Oct 28, 2024Updated last year
- [ICLR 2026] "When AI Agents Collude Online: Financial Fraud Risks by Collaborative LLM Agents on Social Platforms"β42Feb 3, 2026Updated 3 months ago
- Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.β18Dec 19, 2024Updated last year
- A library for training crosscodersβ17May 28, 2025Updated last year
- Kakao Mobility MCP Server for directions and transit informationβ11Sep 14, 2025Updated 8 months ago
- β11Jul 2, 2024Updated last year
- a single interface around speech-to-speech foundation modelsβ28Jun 27, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Join 15k builders to the Real-World ML Newsletter β¬οΈβ¬οΈβ¬οΈβ19Apr 19, 2024Updated 2 years ago
- Contextualized per-token embeddingsβ36May 11, 2025Updated last year
- μΌκ°νμ μ€μ ! Tritonβ16Feb 15, 2024Updated 2 years ago
- OAuth Login for Gradio. Supports multiple identity providers.β16Jan 20, 2025Updated last year
- Simple servers to benchmark FastAPI vs Axum with Postgresβ20May 1, 2024Updated 2 years ago
- Large language model of Medical AI, General Medical AI (GMAI)β17Jan 30, 2024Updated 2 years ago
- This repository benchmarks multiple vector databases for music semantic search, using a shared dataset and query set. It provides both a β¦β35Aug 31, 2025Updated 8 months ago
- Class of data structures that can be unfolded.β22Jan 6, 2026Updated 4 months ago
- Build a trading bot with OpenAI GPT-3.5, real-time data and prompt experimentationβ23Oct 18, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- β22Sep 22, 2025Updated 8 months ago
- A quick way to get started with Transformer Lensβ14Dec 13, 2023Updated 2 years ago
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.β14Mar 20, 2024Updated 2 years ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignmentβ62Aug 30, 2024Updated last year
- Nearly Inference Free Embeddings: make your RAG queries 500x fasterβ77Apr 27, 2026Updated last month
- β26Sep 21, 2025Updated 8 months ago
- A library that allows interacting with Replit's code-exec APIβ26Dec 24, 2024Updated last year
- Local interpretability for survival modelsβ24May 27, 2024Updated 2 years ago
- Unofficial Implementation of Chain-of-Thought Reasoning Without Promptingβ35Mar 19, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A chatbot UI for RAG, multimodal, text completion. (support Transformers, llama.cpp, MLX, vLLM)β20Apr 18, 2024Updated 2 years ago
- Test LLMs automatically with Giskard and CI/CDβ31Aug 7, 2024Updated last year
- Opinionated Go Project Templateβ13May 19, 2026Updated last week
- A Golang client for FalkorDBβ21May 11, 2026Updated 2 weeks ago
- Simple customizable evaluation for text retrieval performance of Sentence Transformers embedders on PDFsβ30Jan 20, 2025Updated last year
- A search engine implementation using OpenAI's clip modelβ10Jun 20, 2021Updated 4 years ago
- β67Jan 28, 2026Updated 4 months ago