Monitoring — Spotting Issues Before They're Crises
What to watch, where to watch it, and what ‘normal’ looks like. Building the situational awareness that catches problems early.
Monitoring is the layer that turns ‘something seems off’ from intuition into actionable signal. The Dashboard surfaces every metric you need, but you have to know what to watch and what ‘normal’ looks like to spot deviation. This page tells you what.
Is the TradingBot connected to every active venue? Is the database reachable? Are all containers healthy?First-priority because nothing else matters if connections are down.
2. Kill switch state
Is the kill switch in the expected state? Accidentally-flipped switches are a real source of “why isn’t the bot trading” confusion.
3. Trade frequency vs expectation
Is the bot producing trades at the rate your mode/symbol pairing predicted? Sudden silence or sudden flood are flags.
4. Win rate trend
Is the win rate trending stable, up, or down? Sustained downward trend over 2+ weeks suggests regime mismatch.
5. Drawdown character
Is the current drawdown within backtest expectations? Beyond expectations is a flag.
6. Per-pair P&L
Are some pairs dragging? Some pairs carrying the rest? Asymmetric performance is information.
7. Log error rate
Recurring errors in the Logs panel — rare = OK, frequent = investigate.
8. API key audit log
The venue’s audit log. Monthly review for unfamiliar activity.
Live trades, positions, modes, pairs, kill switch, log stream, analytics. The hub.Check daily for 2–5 minutes; weekly for 30 minutes structured review.
Telegram chat — the firehose
Every closed trade. Running totals. Quick scan during the day for “is everything closing as expected?”Mute notifications during sleep hours; review the chat in the morning.
Logs panel (in Dashboard)
Live stream of TradingBot output. Use during:
First hour after any change (new pair, mode, key rotation).
Investigations (why did this trade not fire?).
Confidence checks (is the bot actually doing anything?).
Venue's API audit log
Each exchange’s API management page has audit logs. Monthly review for anomalies.
VPS-level monitoring
Disk space, memory, CPU. The bot is robust to occasional resource pressure but monitoring catches issues early.Tools: standard VPS monitoring (Netdata, Prometheus + Grafana, or your provider’s built-in). Most operators use the simple solution: occasional top, df, free commands.
External monitoring (optional)
UptimeRobot, Better Uptime, or similar can ping your reverse-proxy endpoints periodically and alert if down.Worth setting up for dashboard and (if applicable) signals.yourdomain.com.
Something looks off. Could be unusual trade rate, drawdown, error pattern, connection issue.
2
Step 1: is it within historical range?
Compare to backtest expectations. If within range → likely just normal regime variation. Continue monitoring; don’t act.
3
Step 2: is the connection healthy?
If connection issues: investigate (network, exchange, IP allowlist). Repair before evaluating performance.
4
Step 3: is the configuration as expected?
Did you (or someone) accidentally change something? Review recent operator log; check Modes panel.
5
Step 4: is the regime shift the cause?
If yes: decide whether to ride through (BasicMode handles regime shifts OK), kill-switch off (Tsl2Sell may be hurting), or change modes (deliberate move).
6
Step 5: kill switch off if uncertain
When in doubt, kill switch ON. Investigate without active risk. Re-enable when you’re clear.
7
Step 6: document the incident
Operator log entry: what happened, what you observed, what you did, what the resolution was. Future-you benefits.
External monitoring catches things you might not see in your daily routine:
Dashboard down (TLS expired? Container crashed?).
SignalsBot down (webhooks failing).
VPS down (provider issue).
Push notifications when something is wrong = faster operator response.
UptimeRobot — free tier, simple setup
Sign up at UptimeRobot. Add monitors for:
https://dashboard.yourdomain.com/api/health (or appropriate health endpoint).
https://signals.yourdomain.com/health.
Configure email/SMS notifications. Most operators get an alert in under a minute when something goes down.
Better Uptime — paid, more features
More sophisticated alerting (escalation policies, on-call rotation, status pages). For operators running larger setups or wanting professional-grade monitoring.
Custom Telegram alerts
For operators comfortable with scripting: a small custom monitor that pings your endpoints and sends Telegram messages on failure.Cheap, no third-party dependency, integrates with your existing Telegram channel.