Devops Case Study

The Client
The client is an American company that owns over 150 brands across 100 countries, mostly in media and Internet, headquartered in New York City. They design, develop, and market digital applications on the Web.

Business Need

To keep pace with their growth, our client was challenged with efficient monitoring and management of entire infrastructure spread across 3 different data centres.

They wanted 24×7 monitoring and management service to ensure effective control of their AWS, GCP, SNOWFLAKE, DATA CENTER infrastructure for alerts from around 9000+ servers.

DIASPARK SOLUTION

Diaspark’s DevOps Team employed a successful DevOps strategy to monitor entire applications infrastructure housing 3 physical data centers in the US. Having a server count of more than 9000+, the alerts were configured on Hyperic, VMware, Nagios, Catchpoint, NewRelic, and GCP’s Stackdriver.
Using New Relic, the alerts were monitored in real-time and escalated accordingly. 24×7 monitoring of operations was done and the restart of some of the critical servers was performed with the Rundeck tool.
1
 VMware environment across 3 Physical Data Centres in US
2
Housing each data centre
3
Alerts monitoring using Hyperic (VMware) & escalation to the relevant L2 Team
4
On call Escalation for critical and revenue impacting alerts
5
Monitoring Nagios for critical alerts and restarting servers permitted to the Tier 1 team
6
Auditing frequently occurring alerts and reporting them in PSR meetings
7
Real-time monitoring and escalations based on escalations matrix
Tools Used
nagios
download
vmw_logo_1
Work Area
  • Monitoring Alerts & Escalations
  • Monitoring Operations (24x7x365)