Site Reliability Engineer Job in Sarva Labs Limited

Site Reliability Engineer

Apply Now
Job Summary

About the role As a Site Reliability Engineer you'll be managing the entire lifecycle of the infrastructure and more products related to web3.0 technology. You'll also design a state of the art CI-CD pipeline, bootstrap on-demand infrastructure to test software and help the team maintain a robust network. Responsibilities Collaborate as an integral part of a team dedicated to developing the MOI protocol using Agile development methodologies. Gain deep knowledge of the MOI protocol and its ecosystem. Perform follow-up items in areas for continuous improvement. Participating in infrastructure design consulting and capacity planning. Gauge the scope and criticality of the impact of issues to properly categorise and prioritise. Combining software and systems knowledge to engineer high-volume distributed systems in a reliable, scalable, and fault-tolerant manner. Troubleshoot problems, outages, and performance issues. Function well in a fast-paced and rapidly-changing environment. Own end-to-end availability and performance of mission-critical services and build automation to prevent problem recurrence. Coordinating security implementation work with our Cloud Infrastructure Team and other members of our Security Department. Building tooling that improves our efficiency and efficacy and allows us to move fast in a secure manner. Requirements Independent and self-directed work ethic when participating in a collaborative environment. A good understanding and hands on experience in large-scale distributed systems. Experience in designing and deploying high performance production services with extensive monitoring and logging practices CI/CD automation experience, including understanding of key open source technologies like Jenkins, Docker, and Hashicorp tools like Terraform. Strong experience with Linux, cloud infrastructure (AWS, GCP, AZURE, DO), and containerization (Docker, Kubernetes). Familiarity with at least one programming language (Python, Go, Javascript, etc.). Experience with monitoring and logging tools (Prometheus, Grafana, ELK, etc.). Understanding of networking and security principles. Able to identify, propose and assess solutions, workarounds and resolutions to enhance operational environment. Ability to quickly adapt and learn new technologies in response to changing requirements. Nice To Have Entrepreneurial mindset with a knack for solving complex problems and navigating uncertain situations. Hands on experience with Golang Exposure to blockchain technology and practical experience with distributed systems.

Experience Required :

Fresher

Vacancy :

2 - 4 Hires

Similar Jobs for you

See more recommended jobs