Replication Monitoring

Concept version: 2.0

As of 2023-01-01 00:00:00


Dashboard Preferences

design note1: this allows CS Team to tailor the page based on their process, no need to ask dev if process changed

Threshold (status count for Monitor, Warning, Critical)
JobJar # (Report)
Teams Channel Emailprogram_teamchannel@givex.com

REPLICATION RATE

CAL1 : 1 VXL4 to 4 CWS
design note 0: graph appears when clicking a cell grid

REPLICATION OVERVIEW

design note2: clicking on a cell provides a drill-down per data center vs db role

STATUS CRITICALITY
Cal1TorontoMiamiLondon
OMWCMWCM WCMWC
trans3310301035231164133
report3103103511648333
robot3310310154231164133
root310310832311641333
shadow310310352311641333
spare3103103251611641333

design note3: shaded cell--->escalation threshold has been reached.
When a cell is clicked, we show "REPLICATION DELAYS" table below

STATUS DURATION
Cal1TorontoMiamiLondon
DURATIONMWCMWCMWCMWC
over 45 minstrans1, 3, 200, 3002, 4, 245,11,261035231164133
report31031035116483
robot31031034231164133
root310310303231164133
shadow31031035231164133
spare310310325161164133
15 - 45 minstrans1, 3, 200, 3002, 4, 248,95,301035231164133
report31031035116483
robot31031034231164133
root310310303231164133
shadow31031035231164133
spare310310325161164133
1 - 15 minstrans1, 3, 200, 3002, 4, 24203,33,1601035231164133
report31031035116483
robot31031034231164133
root310310303231164133
shadow31031035231164133
spare310310325161164133
under 60 secondstrans1, 3, 200, 3002, 4, 245,11,261035231164133
report31031035116483
robot31031034231164133
root310310303231164133
shadow31031035231164133
spare310310325161164133

design note3B: do we use DB numbers or DB ref?


REPLICATION DELAYS

Data Center: Cal1 | DB Role = Trans

FromToStatus / DelayDurationSelect
1Vxl4CWSCRITICAL00:02:3600:50:00
1Vxl3AdminCRITICAL00:03:3900:50:00
60JSON1vxlCRITICAL00:03:5000:50:00
201(Admin, IVR, Seq2) 1vxlCRITICAL00:0:4500:32:00
201(Admin, IVR, Seq2) 1vxlCRITICAL00:0:4500:05:00
201(Admin, IVR, Seq2) 1vxlCRITICAL00:0:4500:01:00
201{Show all 49 DB's}1vxlCRITICAL00:0:4500:23:00

design note: "Duration" is elapsed time since DB is in that status

Escalate to Tech Team

design note 4: content can be populated based on selections from "REPLICATION DELAYS"

On-Call: Johnny Techy
Title
Content



View by DB Role

design note5: we can still present more details -- primary audience: tech team

Filter:
Cal1
From/To1
T1-1:(ssd)
13.8
2
T1-2:(ssd)
13.8
3
T1-3:(ssd)
13.8
4
T1-4:(ssd)
13.8
5
T1-5:(ssd)
13.8
6
T1-6:(ssd)
13.8
7
T1-7:(ssd)
13.8
8
T1-8:(ssd)
13.8
9
T1-9:(ssd)
13.8
10
T1-10:(ssd)
13.8
11
T1-11:(ssd)
13.8
12
T1-12:(ssd)
13.8
13
T1-13:(ssd)
13.8
trans1Vxl
2DCDial, DCVpn, LCBO, Mcrs, Sway, XML
3Admin
4CWS
5DC, GAPI, MapU, Trans, UATP
6DCBck, DCMulti, JSON, SUN
7JSON, XML
8DCBck2, JSON, Srvc, WPOS
9JSON
10Metro,Seq
11DC
12DC, SUN
13JSON
14DCDial, DCMulti, JSON, Seq
15DCBck2, JSON
16CWS
17 JSON, WPOS