Sre Job in Vmware
Job Description
Carbon Black is now part of VMware.As a standalone company, Carbon Black established itself as a leader in the endpoint security space. The product portfolio includes the rapidly growingCarbon Black Cloudplatform that delivers next-generation endpoint protection capabilities from the cloud. Now with the full resources of VMware, you have the opportunity to make an impact and build upon Carbon Blacks success
Our Site Reliability Engineering team is tackling problems of and at scale, in a challenging and demanding production
environment.Youll design, build, and maintain internally critical systems in the cloud enabling the successful scale
and management of sy
stems by supporting teams, while ensuring uptime, scale, and performance is maintained
appropriate to our end users needs. Here, the engineering opportunities are endless. With this fast
-
paced,
collaborative group, youll be working together and across t
he organization to ensure customer success through
enablement of teams within VMWare CarbonBlack.
What Youll Do
Ownership, architecture and management of AWS infrastructure as code (IaC) components such as VPCs, EC2,
S3, Kinesis, DynamoDB, Route 53, K
MS, OpsWorks, etc.
Own, maintain, and extend logging and monitoring tools, such as Splunk or Kibana and the TICK stack.
Working with configuration management tools in Linux and on AWS
-
Terraform, Salt, Ansible, Chef, CloudFormation
Provide continual enhancements to our security and operational posture.
Ensuring cloud based architectures meet availability and recoverability requirements.
Architecture and implementation of cloud
-
based monitoring, alerting and reporting; CloudWatch, StatusCake, Grafana, Splunk Dashboarding
Develop Remediation as a Service, through self healing automation, utilizing tools provided and building where
needed.
What Youll Bring
B.S. in Computer Science or equivalent experience
Minimum 4 years of experience manageng AWS infrastructure in a production environment
Minimum of 4 years of experience with operations based development
Minimum 4 years working with a preferred scripting language including Python, Ruby, and Bash.
Experience working with configuration management tools at scale, such as Salt or Ansible
Working knowledge of containerization platforms such as Docker
Solid knowledge of open logging and monitoring tools such as Splunk, InfluxDB, and Cloudwatch
Solid understanding/experience of web services, databases and relating infrastructure/architectures
Solid understanding of backup/restore best practices
Thorough understanding of networking protocols and concepts
Excellent Troubleshooting Skills
Experience supporting an enterprise level cloud environment
Security Experience a plus

