MaxGalindo150 / deeprl
☆15Updated this week
Related projects ⓘ
Alternatives and complementary repositories for deeprl
- A high throughput, end-to-end RL library for infinite horizon tasks.☆18Updated 5 months ago
- A Survey Analyzing Generalization in Deep Reinforcement Learning☆29Updated 3 weeks ago
- Machine learning library, Distributed training, Deep learning, Reinforcement learning, Models, TensorFlow, PyTorch☆57Updated this week
- Alpha-Zero Connect Four NN trained via self play☆13Updated last month
- Testing KAN-based text generation GPT models☆15Updated 6 months ago
- Binary vector search example using Unum's USearch engine and pre-computed Wikipedia embeddings from Co:here and MixedBread☆19Updated 7 months ago
- LLama implementations benchmarking framework☆12Updated last year
- a version of baby agi using dspy and typed predictors☆17Updated 8 months ago
- Use Grounding DINO, Segment Anything, and CLIP to label objects in images.☆23Updated 10 months ago
- Run LLMs on Replicate with vLLM☆15Updated last month
- Enable moe for nanogpt.☆21Updated 11 months ago
- Explore the use of DSPy for extracting features from PDFs 🔎☆33Updated 8 months ago
- Rust bindings for CTranslate2☆13Updated last year
- ☆18Updated 2 years ago
- Some experiments on transformer models☆11Updated 9 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆23Updated last week
- ☆36Updated 3 months ago
- LLM-driven automated knowledge graph construction from text using DSPy and Neo4j☆14Updated 3 months ago
- ☆31Updated 8 months ago
- NLP with Rust for Python 🦀🐍☆59Updated 5 months ago
- Hypercube Viewer is a program that draws a hypercube of 3 to 10 dimensions.☆13Updated 4 months ago
- Build and Deploy a voice-based Chatbot with Langchain and BentoML☆19Updated last year
- RAG on codebases using treesitter and LanceDB☆31Updated this week
- Generate Glue Code in seconds to simplify your Nvidia Triton Inference Server Deployments☆15Updated 4 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Updated 8 months ago
- ☆26Updated 2 months ago
- LMQL implementation of tree of thoughts☆33Updated 9 months ago
- I saw this [Blog Post](https://www.morling.dev/blog/one-billion-row-challenge/) on a Billion Row challenge for Java so naturally I tried …☆14Updated 10 months ago
- Fullstack chatbot application☆11Updated 3 months ago
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆21Updated 5 months ago