Lead System Reliability Engineer (loyal) Job in Fulcrum Digital

Lead System Reliability Engineer (loyal)

Fulcrum Digital 4+ weeks ago

Pune, Pune Division, Maharashtra
Not Disclosed
Full-time

Apply Now

Save Job

Job Summary

The Role Provide L2 support to production systems like application, database, middleware components, infrastructure and network components Manage productions incidents end-to-end within defined SLAs with focus on resolution rather than who caused it. Interact with various stake holders such as Release managers, program leads, service managers, development and test leads Review operational readiness requirements such as monitoring and alerting, log rotation and resilience of the components and report the gaps Provide pre-implementation support with activities such as release notes review and implementation dry runs. Protect production components by running health checks, monitoring latency and memory utilization. Automate day-to-day activities and propose changes that improve reliability Participate in CAB and provide feedback on change requests Support the DevOps team in testing the promote pipelines and suggest automation of configuration items. Practice incident management best practices and perform RCA. Participate in disaster recovery tests and operational acceptance tests Analyze the technology stack that makes up the product and optimize recovery time objective. Work with team members spread across and time zones Share knowledge, document improvements and mentor junior resources Use Jenkins to orchestrate builds as well as link to Sonar, Maven, etc. to build out the CI/CD pipeline. Support deployments of code into multiple lower environments. Supporting current processes needed with an emphasis on automating everything as soon as possible. Design, Implement, and enhance our deployment automation based on Chef. We need proven experience designing and implementing an overall release and deployment process. Design and implement a Git based code management strategy that will support multiple environment deployments in parallel. Experience with automation for Branch management, code promotions, and version management. Engage in and improve the whole lifecycle of services from inception and design, through deployment, operation, and refinement. Requirements Deployments MTF/Prod, Maintenance items (including stop/start, Disaster Recovery-related activities, etc.), CR for changes in MTF/Prod Tools - Log Monitoring Tool - Splunk Application Monitoring tool - DynaTrace Ticketing incident/problem management tool - Remedy Dev-ops Basics - CI-CD Basics, Overview of git, Bit-bucket, SonarQube, Ansible/ Chef, Artifactory Skills - Linux & Shell Scripting ITIL / ITSM PL/SQL Troubleshooting Jenkins - CI/CD, Groovy Scripting/Yaml

Experience Required :

Fresher

Vacancy :

2 - 4 Hires

Apply Now

Save Job

To Receive email alerts for similar jobs

Similar Jobs for you

Lead System Reliability Engineer (loyal)
Fulcrum Digital
- Pune, Pune Division, Maharashtra
4+ weeks ago
Lead System Reliabi...
Fulcrum Digital
- Pune, Pune Division, Maharashtra
4+ weeks ago
Lead Site Reliabili...
Epam Systems
- Pune, Pune Division, Maharashtra
4+ weeks ago
Lead Engineer Hardw...
Whirlpool Corporation
- Pune, Pune Division, Maharashtra
4+ weeks ago
Senior Site Reliabi...
Nvidia
- Pune, Pune Division, Maharashtra
4+ weeks ago
Senior Site Reliabi...
Integral Ad Science
- Pune, Pune Division, Maharashtra
4+ weeks ago
Senior Cloud Site R...
Zs Associates
- Pune, Pune Division, Maharashtra
4+ weeks ago

See more recommended jobs

Your 4 Step Guide to Career Success

Apply for jobs

Create Profile

Schedule Interview

Get Hired

Lead System Reliability Engineer (loyal) Job in Fulcrum Digital