TrelisResearch / code-llama-32k
Run code-llama with 50k tokens using flash attention and better transformer
β12Updated last year
Alternatives and similar repositories for code-llama-32k:
Users that are interested in code-llama-32k are comparing it to the libraries listed below
- Streamlit app for recommending eval functions using prompt diffsβ27Updated last year
- β25Updated last week
- π Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platformβ37Updated last year
- Tools for formatting large language model prompts.β12Updated last year
- Demos of some issues with LangChain.β31Updated last year
- Data extraction with LLM on CPUβ68Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and minβ¦β24Updated 2 months ago
- β30Updated last year
- QLLM: A powerful CLI for seamless interaction with multiple Large Language Models. Simplify AI workflows, streamline development, and unlβ¦β32Updated last week
- β12Updated 5 months ago
- β45Updated last year
- Agent computer interface for AI software engineer.β26Updated this week
- Explore a curated collection of exceptional open-source libraries for generative AI meticulously reviewed or slated for review by The AI β¦β48Updated last year
- LLM Agents: Landing Page Generation for an E-commerce Platform Using CrewAI, Groq-LangChain and Qdrantβ13Updated 8 months ago
- Applying Evaluation Driven Development (EDD) to aid in the design decision of RAG pipelinesβ31Updated last year
- Search through the Weaviate Podcast!β57Updated last month
- β47Updated last year
- Helm charts to deploy Weaviate to k8sβ57Updated last week
- T.I.M.E: Thoroughly Intelligent Mail Explorer" Repo to try and build an incredible RAG system over email (this is to test the SOTA in RAGβ¦β18Updated 3 weeks ago
- Mistral-7B finetuned for function callingβ15Updated last year
- A multimodal RAG application that enables semantic search on multimedia sources like audio, video and imagesβ28Updated last year
- A fast & easy way to train ML models in your cloud, directly from your laptop.β14Updated 2 years ago
- β31Updated last year
- Repository hosting Langchain helm charts.β43Updated this week
- A specification for OpenInference, a semantic mapping of ML inferencesβ45Updated 9 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Modelsβ100Updated last month
- β12Updated 2 weeks ago
- Nexusflow function call, tool use, and agent benchmarks.β19Updated last month
- LLM finetuningβ42Updated last year
- Writing Blog Posts with Generative Feedback Loops!β47Updated 10 months ago