sapientml / core
A SapientML plugin of SapientMLGenerator
☆10Updated last week
Related projects ⓘ
Alternatives and complementary repositories for core
- ☆26Updated 2 months ago
- ☆17Updated 2 weeks ago
- Notes and artifacts from the ONNX steering committee☆25Updated 2 weeks ago
- The no-code AI toolchain☆74Updated 3 weeks ago
- Run Llama 2 using MLX on macOS☆33Updated 10 months ago
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆35Updated 6 months ago
- asynchronous/distributed speculative evaluation for llama3☆37Updated 3 months ago
- Benchmarking PyTorch 2.0 different models☆21Updated last year
- Structured inference with Llama 2 in your browser☆51Updated last week
- Flax Image Models - State-of-the-art pre-trained vision backbones for Flax.☆17Updated last year
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆98Updated last month
- Binary vector search example using Unum's USearch engine and pre-computed Wikipedia embeddings from Co:here and MixedBread☆19Updated 7 months ago
- iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh☆46Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆23Updated last week
- Podlite specification documents ( v1.0 released 🎉 )☆23Updated last week
- ONNX Adapter for model-explorer☆25Updated last month
- ☆43Updated 3 months ago
- Explore training for quantized models☆10Updated this week
- Run embedding models using ONNX☆23Updated 9 months ago
- AI Assistant running within your browser.☆40Updated 2 weeks ago
- Evolutionary Search for expert-level performance on any task with environmental feedback☆14Updated 9 months ago
- Control VS Code using your smartphone! Support for sending files, images and more. Take AI assistance when you need it.☆22Updated 2 weeks ago
- Maintain a FAISS index for specified Datasette tables☆35Updated 4 months ago
- Personal solutions to the Triton Puzzles☆16Updated 3 months ago
- 360M model running in the browser on WebGPU☆20Updated 2 months ago
- A tracing JIT for PyTorch☆17Updated 2 years ago
- Simple implementation of a GPT (training and inference) in PyTorch.☆10Updated 11 months ago
- Optimum graph creation and distribution for underground networks.☆33Updated 4 months ago
- tenstorrent kernel from twitch☆27Updated 7 months ago
- GGML implementation of BERT model with Python bindings and quantization.☆51Updated 8 months ago