CosineAI/experiments

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CosineAI/experiments)

CosineAI / experiments

Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.

☆14

Alternatives and similar repositories for experiments

Users that are interested in experiments are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

omotolani12 / Building-an-Advanced-RAG-Chatbot-with-Knowledge-Graphs
View on GitHub
☆12Jun 12, 2024Updated 2 years ago
kbmurali / som-driven-qa-rag
View on GitHub
Self Organizing Maps (SOM) ML model can be used to conduct semantic search to populate context required for Retrieval Augmented Generatio…
☆15Mar 16, 2024Updated 2 years ago
Brent-Morrison / Stock_master
View on GitHub
The "Stock Master" database collates fundamental and price data for US stocks.
☆12Dec 30, 2024Updated last year
jc-ryan / holistic_automated_red_teaming
View on GitHub
[EMNLP 2024] Holistic Automated Red Teaming for Large Language Models through Top-Down Test Case Generation and Multi-turn Interaction
☆17Nov 9, 2024Updated last year
OpenHands / openhands-agent-monitor
View on GitHub
☆14Oct 8, 2025Updated 9 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
dividor / storm-with-local-docs
View on GitHub
A simple demo repo to show using storm with local PDF documents
☆16Oct 27, 2024Updated last year
vectara / Search-UI
View on GitHub
☆10Aug 7, 2023Updated 2 years ago
epoch-research / training-cost-trends
View on GitHub
☆27Apr 1, 2026Updated 3 months ago
hamelsmu / wandb-modal-webhook
View on GitHub
A webhook that integrates the W&B model registry with Modal Labs
☆15Dec 24, 2023Updated 2 years ago
ctlllll / reward_collapse
View on GitHub
☆26May 30, 2023Updated 3 years ago
d-one / d-one-mlops-aws
View on GitHub
Repository for the D ONE MLOps AWS BlogPost
☆10May 5, 2026Updated 2 months ago
glaive-ai / function-calling-server
View on GitHub
☆35Feb 8, 2024Updated 2 years ago
hackclub / apocalypse
View on GitHub
🧟 The hackathon where 150 teens built fun tech to survive the zombie apocalypse.
☆13Jun 30, 2026Updated 3 weeks ago
Yangyi-Chen / PaperList-Trustworthy-Applications
View on GitHub
Mostly recording papers about models' trustworthy applications. Intending to include topics like model evaluation & analysis, security, c…
☆21May 30, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
nicoalbanese / aie-feb-25-starter
View on GitHub
☆13Feb 22, 2025Updated last year
developersdigest / bee-agent
View on GitHub
☆18Jan 10, 2025Updated last year
ftes / phoenix-headlessui
View on GitHub
Phoenix LiveView + HeadlessUI React web components
☆13Nov 6, 2024Updated last year
cytan17726 / KBQA_QueryGraphGeneration
View on GitHub
一种面向中文复杂问句的查询图生成方法，以及一份含有多种复杂句的中文知识图谱问答数据集
☆18Mar 16, 2023Updated 3 years ago
dendrofen / react-konva-to-svg
View on GitHub
Extend Konva's functionality to export stages as SVG. Enhance the quality of exported images with SVG format.
☆24Jan 11, 2025Updated last year
unbiarirang / Fixed-Input-Parameterization
View on GitHub
This repository contains the official code for the paper: "Prompt Injection: Parameterization of Fixed Inputs"
☆32Sep 13, 2024Updated last year
gsidsid / ai-express
View on GitHub
Instantly turn AI prompts into production-ready API endpoints.
☆22May 4, 2023Updated 3 years ago
PrasannS / rlhf-length-biases
View on GitHub
☆27Mar 13, 2024Updated 2 years ago
writer / framework-tutorials
View on GitHub
Framework Tutorials Repo
☆27May 19, 2026Updated 2 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
github / artifact-attestations-workflows
View on GitHub
Demo repository showcasing how to use reusable workflows to build artifact attestations
☆16Jul 9, 2026Updated last week
alex000kim / ML-Pipeline-With-DVC-SkyPilot-HuggingFace
View on GitHub
☆15Sep 9, 2023Updated 2 years ago
AstroPilot-AI / DenarioApp
View on GitHub
GUI for AstroPilot
☆26Nov 5, 2025Updated 8 months ago
nicoalbanese / aie-deepresearch
View on GitHub
☆18Feb 23, 2025Updated last year
workos / mastra-agents-meme-generator
View on GitHub
☆25Feb 4, 2026Updated 5 months ago
cc-hpc-itwm / gpispace
View on GitHub
GPI-Space: Memory Driven Computing and Big Data
☆10Mar 17, 2026Updated 4 months ago
OpenMOSE / RWKV-Infer
View on GitHub
A large-scale RWKV v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy…
☆51Oct 21, 2025Updated 8 months ago
recursal / minmodmon
View on GitHub
Mini Model Daemon
☆13Nov 9, 2024Updated last year
OpenHands / agent-analysis
View on GitHub
A collection of scripts and tools for analyzing SWE agents.
☆16May 7, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
arjunguha / BigCodeBench-X
View on GitHub
A benchmark of programming tasks for LLMs that supports almost any programming language.
☆13Jun 30, 2025Updated last year
neelnanda-io / Neuroscope
View on GitHub
Accompanying codebase for neuroscope.io, a website for displaying max activating dataset examples for language model neurons
☆14Feb 13, 2023Updated 3 years ago
elicit / fave-dataset
View on GitHub
Paper dataset for "Factored Verification: Detecting and Reducing Hallucination in Summaries of Academic Papers"
☆13Oct 20, 2024Updated last year
marco-bertelli / rag.flask-start
View on GitHub
Code of Medium story
☆28Dec 14, 2025Updated 7 months ago
ecosia / pycon22-prometheus-workshop
View on GitHub
Workshop material for PyCon DE 2022 by @Vinesse and @sleepypioneer
☆19Dec 14, 2022Updated 3 years ago
deepgram-devs / prerecorded-audio-notebook
View on GitHub
☆13Nov 28, 2025Updated 7 months ago
huggingface / ioi
View on GitHub
☆42Mar 26, 2025Updated last year