System Admin Kubernetes Job in Agreeya Solutions
System Admin Kubernetes
Agreeya Solutions
4+ weeks ago
- Hyderabad, Telangana
- Not Disclosed
- Full-time
Job Summary
- Solid understanding of LAN/WAN networks
- Knowledge of virtualization platforms (Hyper-V/ VMware), Windows & Linux Server OS
- Patching management and SW deployment
- Linux, Ubuntu OS Administration
- Web Application (Tomcat, Apache, Weblogic , Websphere ) expertise including installation, administration, configuration, troubleshooting, performance tuning, preventative maintenance, capacity planning, monitoring, and security procedures
- Strong knowledge of AWS Cloud , EKS Service
- Experience with Docker deployments (Docker Swarm, Kubernetes, Openshift)
- SKnowledge of installation of ElasticSearch, Logstach and Kibana and troubleshooting the issue related to ELK.
- Responsible for implementation and ongoing administration and configuration of Hadoop data platform and Linux environments.
- Newrelic, Grafana, Kibana, Putty, Customized scripts and other support tools.
- Implement and improve monitoring and alerting.
- Build and maintain highly available systems on Kubernetes.
- Scripting and automating system tasks to reduce manual repetitive work.
- Recommend and change system parameters or configuration variables to improve overall system performance and stability.
- Look for opportunities for continual improvements in infrastructure and processes.
- Ability to use a wide variety of open source technologies and tools.
- Ensure a scalable process and infrastructure with a focus on high availability.
- Ensure that infrastructure, operations, and application security standards are enforced.
- Willingness to pitch-in and help others on the team as needed.
- Provide timely estimates and on-going progress and transparency of work using tools like Jira along with updates in daily stand-ups.
- Keep abreast of technology trends and best practice with the ability to share those findings with the team.
- Experience working with Stateless and Stateful workloads
- Experience with Cron Jobs, Singletons, Kubernetes operators and Daemon sets
- Deep understanding and Hands-on in monitoring application and infrastructure performance and reliability
- Experience with continuous integration application delivery, including provisioning, deployment, testing and version control.
- Experience developing automation solutions utilizing Ansible, Jenkins, and Python
- Develop automation to improve our ability to rapidly deploy, effectively and proactively monitor applications in a large-scale environment.
- Employ deep troubleshooting and scripting skills to improve the availability, capacity, and security.
- Implementation of proactive monitoring, alerting, trend analysis and self-healing systems
- Experience with root cause analysis of critical business and production issues and professional in incident triage & response; effective working under pressure
- Design system support documents and production application service run books where needed
- On-call rotation to support our OnCall


Help us improve JobGrin
Need Help? Contact us