Research Project: This is a free AI research project. No warranties, SLAs, or company associations. Learn more

Back to Guides
Product Guides
Intermediate15 min

Sentinel Grid Monitoring

Understand the 30-sentinel monitoring grid — probe, assess, heal, and escalate across 5 tiers

The Sentinel Grid is BrainstormMSP's continuous monitoring layer. 30 specialized sentinels watch every dependency — from databases to AI providers — using a probe→assess→heal→escalate cycle. The grid is designed to detect issues before they impact users.

1

5 Sentinel Tiers

Platform Sentinels (8)

Monitor core infrastructure:

PostgreSQL, Redis, Supabase Auth

DO App Platform, DO Spaces

DNS/SSL, API Health, WebSocket

Vendor Sentinels (6)

Monitor integration health:

Acronis, ConnectWise, Datto

HaloPSA, NinjaOne, Defender

AI Sentinels (5)

Monitor AI provider availability:

Claude, OpenAI, Gemini, Perplexity, Grok

Intelligence Sentinels (7)

Monitor the brain's own systems:

Brain Decisions, Signal Processing, Evidence Chain

Approval Gates, Learning Loop, Control Graph, OODA Cycle

Fleet Sentinels (4)

Monitor the edge agent fleet:

Edge Heartbeat, Auto-Update, Tool Dispatch, Telemetry

2

Probe Cycle

How Probes Work

Each sentinel runs probes on a configurable interval:

1. **Probe**: Execute a health check (API call, query, ping)

2. **Assess**: Compare result against expected thresholds

3. **Record**: Store probe result in the sentinel buffer

4. **Flush**: Periodically flush buffer to database for history

Probe Intervals

TierDefault Interval

|------|-----------------|

Platform60 seconds
Vendor5 minutes
AI60 seconds
Intelligence30 seconds
Fleet60 seconds

Probe Retention

Probe history is retained for:

Last 24 hours: Full resolution

Last 7 days: 5-minute aggregates

Last 30 days: Hourly aggregates

3

Self-Healing

Automatic Remediation

When a sentinel detects an issue, the self-healing cycle activates:

1. **Detect**: Probe fails or returns degraded status

2. **Diagnose**: Sentinel determines the issue type

3. **Heal**: Execute auto-remediation if available:

- Restart failed services

- Retry failed connections

- Clear stale caches

- Re-establish WebSocket connections

4. **Escalate**: If healing fails after 3 attempts, escalate to the brain

Healing Actions by Tier

**Platform**: Service restart, connection pool refresh

**Vendor**: Token refresh, retry with backoff

**AI**: Provider fallback, model swap

**Intelligence**: Brain cycle restart, buffer flush

**Fleet**: Agent reconnect, heartbeat reset

4

Status Reading

Sentinel Status Colors

**GREEN**: All probes passing, no issues

**YELLOW**: Degraded performance or transient errors

**RED**: Critical failure, service down

Grid Status Page

View the sentinel grid at **Brain > Observatory**:

Grid layout shows all 30 sentinels organized by tier

Each sentinel shows current status, last probe time, and trend

Click any sentinel for detailed probe history

Status Aggregation

The overall system health is computed from sentinel statuses:

All GREEN = System Healthy

Any YELLOW = System Degraded

Any RED = System Alert (notifications sent)

5

Dormant vs Active

When Sentinels Go Dormant

A sentinel enters dormant mode when:

The monitored service is not configured (e.g., no Grok API key)

The integration is disabled by the tenant

The feature is not available in the current plan

Dormant Behavior

Dormant sentinels are shown as gray in the grid

They do not run probes or consume resources

They automatically activate when the dependency is configured

Activating Dormant Sentinels

1. Configure the missing dependency (API key, integration, etc.)

2. The sentinel detects the configuration change

3. First probe runs within one interval period

4. Sentinel transitions from dormant to active (GREEN/YELLOW/RED)

Completed!

You've completed the Sentinel Grid Monitoring guide. Ready to continue learning?