[Tool] cosmos-flake-detector: Find Unreliable RPC Endpoints

Ignismeow · December 31, 2025, 8:48am

Hi Cosmos community!

I’ve built a tool to help operators detect flaky RPC endpoints before they cause issues.

The Problem

We’ve all been there: RPC endpoints that work 95% of the time but fail when it matters. Traditional monitoring only checks `/health` or `/status`, missing query-specific issues.

The Solution: cosmos-flake-detector

A Rust CLI tool that:

Tests specific query paths (not just `/health`)
Measures latency with microsecond precision (HDR histogram)
Calculates flakiness scores (0-100)
Exports JSON for CI/CD integration
Runs concurrent load tests

Example Usage

cosmos-flake-detector \
  --endpoints "https://rpc1.com,https://rpc2.com" \
  --duration 120 \
  --output results.json

Features

Query-specific testing (abci_info, status, genesis, etc.)
p50/p95/p99 latency metrics
Flakiness scoring algorithm
Concurrent testing
JSON export
Open source (MIT)

GitHub

saadaltafofficial/cosmos-flake-detector

Feedback and contributions welcome!

Use Cases

Validator operations (pre state-sync testing)
Chain indexers (CosmWasm endpoint testing)
CI/CD health checks
Continuous monitoring

Would love to hear if this solves a pain point for you!

yoda · January 1, 2026, 9:54am

This would probably work best as a stand-alone site with an api

Ignismeow · January 1, 2026, 10:48am

yes no doubt, thanks for advice.

Topic		Replies	Views
Choosing the Right RPC for Cosmos Development Conversation	3	65	November 16, 2025
How to query data of a validator using Cosmos Api available in the docs Cosmos-SDK	0	605	December 30, 2020
Validator Candidates Validation	96	59194	July 25, 2022
Cosmos Hub v17.1 Chain Halt - Post-mortem Security	6	1357	June 7, 2024
Cosmos Delegator Assistant propotype by the Protofire team Miscellaneous	0	377	May 11, 2020

[Tool] cosmos-flake-detector: Find Unreliable RPC Endpoints

Related topics