MaxGalindo150 / deeprl
☆13Updated this week
Related projects ⓘ
Alternatives and complementary repositories for deeprl
- Testing KAN-based text generation GPT models☆15Updated 6 months ago
- Alpha-Zero Connect Four NN trained via self play☆13Updated last month
- A high throughput, end-to-end RL library for infinite horizon tasks.☆18Updated 5 months ago
- Binary vector search example using Unum's USearch engine and pre-computed Wikipedia embeddings from Co:here and MixedBread☆19Updated 7 months ago
- Enable moe for nanogpt.☆21Updated 11 months ago
- LLama implementations benchmarking framework☆12Updated last year
- Tool to take your ML model from local to production with one-line of code.☆23Updated 9 months ago
- Use Grounding DINO, Segment Anything, and CLIP to label objects in images.☆23Updated 10 months ago
- Hypercube Viewer is a program that draws a hypercube of 3 to 10 dimensions.☆13Updated 4 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆23Updated last week
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆36Updated 9 months ago
- Light WebUI for lm.rs☆21Updated last month
- Machine learning library, Distributed training, Deep learning, Reinforcement learning, Models, TensorFlow, PyTorch☆56Updated this week
- A Survey Analyzing Generalization in Deep Reinforcement Learning☆29Updated 2 weeks ago
- ☆31Updated 9 months ago
- NLP with Rust for Python 🦀🐍☆59Updated 5 months ago
- Rust bindings for CTranslate2☆13Updated last year
- Jax like function transformation engine but micro, microjax☆26Updated 2 weeks ago
- 🛠 Self-hosted, fast, and consistent remote configuration for apps.☆12Updated 2 years ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆44Updated last year
- Connect to your customer data using any LLM and gain actionable insights. IdentityRAG creates a single comprehensive customer 360 view (g…☆21Updated this week
- https://mlabonne.github.io/blog/☆35Updated 2 weeks ago
- Using modal.com to process FineWeb-edu data☆19Updated 2 months ago
- Run Llama 2 using MLX on macOS☆33Updated 10 months ago
- Build Agentic workflows with function calling☆20Updated last week
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆21Updated 4 months ago
- ☆24Updated last year
- Code accompanying the paper "A Language Model's Guide Through Latent Space". It contains functionality for training and using concept vec…☆16Updated 8 months ago
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆12Updated 2 months ago
- ☆25Updated last month