TrelisResearch / code-llama-32k
Run code-llama with 50k tokens using flash attention and better transformer
☆12Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for code-llama-32k
- ☆30Updated last year
- Example of running LangChain on Cloud Run☆61Updated last year
- Geniusrise: Framework for building geniuses☆60Updated 5 months ago
- LLM Agents: Landing Page Generation for an E-commerce Platform Using CrewAI, Groq-LangChain and Qdrant☆13Updated 5 months ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆25Updated last year
- Connect to your customer data using any LLM and gain actionable insights. IdentityRAG creates a single comprehensive customer 360 view (g…☆21Updated this week
- ☆15Updated 3 weeks ago
- Interface for interacting with Gradient AI in Python☆14Updated 4 months ago
- ☆45Updated 10 months ago
- Self-host LLMs with vLLM and BentoML☆72Updated last week
- Streamlit app for recommending eval functions using prompt diffs☆25Updated 10 months ago
- Build Agentic workflows with function calling☆20Updated last week
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆23Updated last week
- applications of https://github.com/PrefectHQ/marvin☆12Updated 9 months ago
- Helm charts to deploy Weaviate to k8s☆50Updated this week
- Deploy and Scale LLM-based applications☆26Updated last year
- Search through the Weaviate Podcast!☆57Updated last year
- This is the repo for the container that holds the models for the text2vec-transformers module☆40Updated last week
- Tool to take your ML model from local to production with one-line of code.☆23Updated 9 months ago
- Github repo for storing LlamaDatasets☆29Updated 9 months ago
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆28Updated 8 months ago
- ☆47Updated last year
- This repository contains the source code for running llamaindex tutorials from https://howaibuildthis.substack.com/☆38Updated 10 months ago
- Explore the use of DSPy for extracting features from PDFs 🔎☆33Updated 8 months ago
- Super performant RAG pipeline for AI apps.☆14Updated 8 months ago
- Using modal.com to process FineWeb-edu data☆19Updated 2 months ago
- Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.☆27Updated this week
- An open source collection of agentic Github workflows☆12Updated 6 months ago
- Tutorial for DSPy☆21Updated 6 months ago
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆16Updated 2 months ago