Files
aitbc/docs/3_miners/6_monitoring.md
oib 15427c96c0 chore: update file permissions to executable across repository
- Change file mode from 644 to 755 for all project files
- Add chain_id parameter to get_balance RPC endpoint with default "ait-devnet"
- Rename Miner.extra_meta_data to extra_metadata for consistency
2026-03-06 22:17:54 +01:00

1.8 KiB
Executable File

Monitoring & Alerts

Monitor your miner performance and set up alerts.

Real-time Monitoring

Dashboard

aitbc miner dashboard

Shows:

  • GPU utilization
  • Memory usage
  • Temperature
  • Active jobs
  • Earnings rate

CLI Stats

aitbc miner stats

Prometheus Metrics

# Enable metrics endpoint
aitbc miner metrics --port 9090

Available at: http://localhost:9090/metrics

Alert Configuration

Set Alerts

# GPU temperature alert
aitbc miner alert --metric temp --threshold 85 --action notify

# Memory usage alert
aitbc miner alert --metric memory --threshold 90 --action throttle

# Job failure alert
aitbc miner alert --metric failures --threshold 3 --action pause

Alert Types

Type Description
temp GPU temperature
memory GPU memory usage
utilization GPU utilization
jobs Job success/failure rate
earnings Earnings below threshold

Alert Actions

Action Description
notify Send notification
throttle Reduce job acceptance
pause Stop accepting jobs
restart Restart miner

Log Management

View Logs

# Recent logs
aitbc miner logs --tail 100

# Filter by level
aitbc miner logs --level error

# Filter by job
aitbc miner logs --job-id <JOB_ID>

Log Rotation

# Configure log rotation
aitbc miner logs --rotate --max-size 100MB --keep 5

Health Checks

# Run health check
aitbc miner health

# Detailed health report
aitbc miner health --detailed

Shows:

  • GPU health
  • Driver status
  • Network connectivity
  • Storage availability

Next