Cloud Site Reliability Engineer Job in Vmware

Cloud Site Reliability Engineer

Apply Now
Job Summary

Job Description

Job Summary

To be a member of the Command Center with the focal point for the success of our enterprise class SaaS service offerings across all of VMware. The Command Center increases overall confidence in the services that are being delivered and ensures proper communication to VMware customers during any disruption of normal operations. This position requires experience developing integrations between monitoring and collaboration tools with service owners and establish processes to exercise judgment within defined procedures and practices to determine appropriate action. This role offers an exciting opportunity to work across multiple technology domains and engage in the latest and greatest technology services being developed at a world class company.

Team Responsibility

The Command Center Engineer is a member of the highly visible Cloud Services Operations team and is a core member of the VMware Cloud Productivity and Engineering organization (CPE). As a member of the Command Center, you will be functioning in a world-class team respected for its innovation, execution and collaboration operating a world-wide organization. The team ensures continuity of VMware SaaS Services that impacts any significant disruption of normal operations of our enterprise service offerings and operates 24/7/365 days a year.

The Command Center is expected to provide a reliable service with an enterprise level SLA and must strive for 100% customer support satisfaction. The primary objective of this team is to oversee and ensure critical applications and services provided are available and working as expected for customers and subscribers. The secondary objective is to develop and improve existing service monitoring tools through additional integrations, automation and collaboration.

Role Responsibility

The ideal candidate serves as the focal point for the success of our enterprise class SaaS service offerings across all of VMware providing technical skill and knowledge to the command center. Along with working on complex issues where analysis of situations or data requires an in-depth evaluation of various factors this role will be required to help level up the technical skills of the team, develop tools and automation for VMware services, and assist services in automated problem resolution. The ideal candidate will have technical background with VMware vSphere, Linux/Windows, AWS or Other Cloud products and VMware Products. Should have development skills in modern scripting language like python, Ansible or Good in shell scripting.

Required Experience

  • Proficient in vSphere, vSAN, and other VMware products and platforms.
  • Proficient in Windows or Linux administration
  • Proactively identify potential problems, issues and actively communicate and manage issues to resolution.
  • Assist in troubleshooting and root cause analysis for environmental issues as they arise.
  • Proactively identify and communicate potential problems and issues to project team members/leaders.
  • Identify, receive, triage and act upon events and incidents coming from various SaaS services
  • Consistently meets or exceeds established Command Center key performance indicators (KPIs)
  • Working under pressure in production environments running production customer workloads and services
  • Ability to work global teams.
  • Job Responsibilities include providing24/7remote support.


Required Skills

  • Minimum 3 years of hands on experience in managing vSphere, vSAN, NSX environment
  • Minimum 3 years of experience with Unix/Linux/WindowsOperating system
  • Must have knowledge on Networking and Storage.
  • Experience in one of the following languages:Python, PowerShell, PowerCLI, Python, Shell Script.
  • Experience working with one of communication tools: Slack, Azendoo, Hipchat
  • Experience working with any of the ticketing tools Jira, ServiceNow, Remedy
  • Excellent written andverbal communicationskills
  • Experience working with internal or external notification tools: Statuspage.io, status.io
  • Knowledge of infrastructure configuration like Puppet, Ansible, etc.
  • Domain knowledge of systems management and ITIL is strongly desired.
  • Good working knowledge of at least one public cloud such as AWS or GCP.

Required Qualifications

  • BS Degree in Computer Science, or a related field

Experience Required :

Fresher

Vacancy :

2 - 4 Hires

Similar Jobs for you

See more recommended jobs