🌱 A little course on Reinforcement Learning Environments for evaluating and training Language Models
☆186Apr 23, 2026Updated 2 weeks ago
Alternatives and similar repositories for llm-rl-environments-lil-course
Users that are interested in llm-rl-environments-lil-course are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Analyze coinbase orderbook in real-time in Python with Bytewax☆11Apr 23, 2024Updated 2 years ago
- Project code for training LLMs to write better unit tests + code☆21May 19, 2025Updated 11 months ago
- Python package to download and use the SSB datasets☆11Aug 3, 2023Updated 2 years ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 3 years ago
- Distribute and install Go binaries via NPM☆12Mar 27, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- minimalistic AI library that resembles HF's transformers☆13Dec 31, 2024Updated last year
- ☆34Nov 18, 2025Updated 5 months ago
- Container images and tool for running machine learning with Rust on Amazon SageMaker☆11Jul 25, 2024Updated last year
- Implemention based on lightrag and nano-graphrag to connect with psql☆15Oct 28, 2024Updated last year
- Official code for "A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning"☆17Mar 1, 2023Updated 3 years ago
- ☆21Jun 27, 2024Updated last year
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆131Apr 30, 2026Updated last week
- SiDeGame - Simplified Defusal Game☆13Apr 17, 2025Updated last year
- This is the official git report for SIDDMs in NeurIPS2023 and officially unofficial implementation for UFOGen CVPR2024☆20Oct 3, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.☆18Dec 19, 2024Updated last year
- A library for training crosscoders☆17May 28, 2025Updated 11 months ago
- Sparse Embedding Compression for Scalable Retrieval in Recommender Systems☆35Nov 21, 2025Updated 5 months ago
- Wanna learn Rust with me? 👇👇👇☆20Sep 5, 2024Updated last year
- 🚀 this project aims to develop an app using an existing open-source LLM with data collected for domain-specific Jenkins knowledge that c…☆12Aug 29, 2024Updated last year
- a single interface around speech-to-speech foundation models☆28Jun 27, 2025Updated 10 months ago
- Contextualized per-token embeddings☆35May 11, 2025Updated 11 months ago
- OAuth Login for Gradio. Supports multiple identity providers.☆16Jan 20, 2025Updated last year
- Simple servers to benchmark FastAPI vs Axum with Postgres☆20May 1, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This repository benchmarks multiple vector databases for music semantic search, using a shared dataset and query set. It provides both a …☆35Aug 31, 2025Updated 8 months ago
- Build a trading bot with OpenAI GPT-3.5, real-time data and prompt experimentation☆23Oct 18, 2023Updated 2 years ago
- ☆22Sep 22, 2025Updated 7 months ago
- A quick way to get started with Transformer Lens☆14Dec 13, 2023Updated 2 years ago
- ☆16Mar 21, 2024Updated 2 years ago
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆33Nov 4, 2024Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆62Aug 30, 2024Updated last year
- ☆26Sep 21, 2025Updated 7 months ago
- A chatbot UI for RAG, multimodal, text completion. (support Transformers, llama.cpp, MLX, vLLM)☆20Apr 18, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Test LLMs automatically with Giskard and CI/CD☆31Aug 7, 2024Updated last year
- Opinionated Go Project Template☆13Updated this week
- A Golang client for FalkorDB☆20Updated this week
- ☆66Jan 28, 2026Updated 3 months ago
- Simple customizable evaluation for text retrieval performance of Sentence Transformers embedders on PDFs☆30Jan 20, 2025Updated last year
- An end-to-end batch scoring machine learning system that produces hourly predictions of the number of arrivals and departures that will t…☆26Apr 24, 2026Updated 2 weeks ago
- This is a PoC using native windows API directx, to hide and decrypt shellcode via compute shader☆10May 3, 2025Updated last year