Monitoring Tools Prometheus Jobs in Bengaluru
449 Jobs Found
Gen AI Support Engineer-2
Exotel
Gen AI Support Engineer-2 Location: Bengaluru Experience: 4 7+ years Employment Type: Full-time About Us Exotel is the leading full-stack customer engagement platform and virtual telecom operator for emerging markets. Since its inception in 2011, Exotel has been powering 50 million daily engagements across voice, video, and messaging channels. We provide our unified customer engagement solutions to over 6000 companies globally, including industry leaders like Ola, Swiggy, Flipkart, GoJek, Byjus, Urban Company, HDFC Bank, Zomato, and Oyo. With $100 million in Series D funding and an ARR of $60 million, Exotel is a growth-stage company poised for massive impact. Overview We're seeking a Gen AI Support Engineer-2 to join our team. As an L2 Support Engineer, you will be the highest level of technical escalation within the support organization. Your role will encompass system reliability, platform integrity, troubleshooting mission-critical production issues, and collaborating with engineering teams for architecture feedback. Additionally, you'll help mentor junior engineers and improve operational processes and tools for large-scale environments. If you're passionate about writing clean code with Python and Django and want to contribute to a fast-paced, mission-driven company, this role is for you! Responsibilities Mission-Critical Issue Resolution: Own the resolution of high-priority, time-sensitive production issues. Root Cause Analysis (RCA): Lead RCA reviews and push for systemic improvements in system architecture and processes. Performance Optimization: Identify bottlenecks and propose architectural changes to improve system performance and scalability. Patch Management: Assist in configuring, deploying, and testing patches, releases, and application updates to production environments. SME for Production Systems: Serve as the Subject Matter Expert (SME) for Exotel's production systems and integrations. Cross-Team Collaboration: Work with Delivery, Product, and Engineering teams to influence system design, rollout strategies, and improvement plans. Mentorship: Lead and mentor L1/L2 engineers on troubleshooting best practices and continuous learning. Code Writing & Automation: Write clean, maintainable code for internal tools, scripts, and automation using Python and Django. Support Tooling: Automate recovery workflows and design support tools for proactive monitoring. Operational Excellence: Establish and improve SLAs, monitoring dashboards, alerting systems, and operational runbooks to ensure system reliability. Must Have Skills Backend Development Support: 3+ years of experience in backend development support, production support, or DevOps/SRE roles. Core Technologies: Proficiency in Python, Django, SQL, and troubleshooting in Linux. Web Technologies: Strong understanding of HTML, CSS, JavaScript, and other web technologies. Distributed Systems & Cloud: Experience working with distributed systems, cloud architecture (AWS), Docker, and Kubernetes. Automation: Strong scripting skills with Bash/Python for automation and operational support. CI/CD & Observability: Good understanding of CI/CD, observability tools, and release management workflows. Communication Skills: Excellent communication, leadership, and incident command skills for managing production issues and cross-functional collaboration. Nice to Have Experience with AI-powered systems and machine learning technologies. Familiarity with monitoring systems like Prometheus, Grafana, or Elasticsearch. Knowledge of microservices architectures and scaling distributed systems. Innovative Work: Be at the forefront of cloud-based communications technology and AI-driven customer engagement platforms. Impact: Play a key role in maintaining and optimizing systems that power millions of customer interactions daily. Growth Opportunities: Be part of a fast-growing company with ample learning opportunities and career development. Collaborative Environment: Work in a supportive, inclusive environment where your input and ideas matter. Competitive Benefits: Comprehensive benefits package including health insurance, mental wellness support, and more.
Senior Java Web Backend Engineer
Blueoptima
Position: Senior Java Web Backend Engineer Job Type: Full-time Location: Bengaluru Department: Engineering About BlueOptima: At BlueOptima, our vision is to become the global reference for optimizing the performance of software engineers across all industries. We provide industry-leading objective metrics in software development, enabling large organizations to deliver better software, faster, and at a lower cost through technology that pushes the limits of what has been done before. As a fast-growing global company, we ve consistently doubled our headcount and revenue year over year, without external investment. Our headquarters is in London, with additional offices in Mexico, India, and the US. Our diverse team consists of 210+ employees from 34+ nationalities and speaks over 25 languages. We foster an open-minded environment and encourage employees to create their own success stories within this high-performance atmosphere. Job Description: We are looking for a Senior Java Web Backend Engineer with extensive experience in designing, building, and maintaining scalable SaaS applications using Java/J2EE technologies. The ideal candidate will be a tech enthusiast, committed to excellence, and eager to take on a leadership role as a mentor to a team of talented engineers. You ll be part of a self-managed Agile team, where you will actively contribute to improving development processes, bringing new ideas to the table, and proposing improvements in methodology, management, and organization. Key Responsibilities: Application Development & Maintenance: Design, develop, implement, test, and maintain application software components. Requirements Analysis: Analyze client requirements and convert them into technical specifications, ensuring alignment with project goals. Feature Ownership: Take ownership of development for new features and continuous improvements to the platform. Performance Optimization: Identify and resolve performance bottlenecks, ensuring high scalability and efficiency of the system. Architecture Improvement: Identify architectural inefficiencies, and create and execute a roadmap to address and resolve them. Leadership & Mentorship: Lead and mentor junior developers, fostering their technical growth and career development. Client Interaction: Provide technical support to client-facing teams and occasionally interact with clients to resolve issues related to your component. What You Need to Succeed at BlueOptima: Education: Minimum Bachelor's degree in Computer Science or equivalent. Self-Sufficiency: Ability to work autonomously with minimal supervision. Problem-Solving Skills: Strong analytical and problem-solving capabilities, coupled with a can-do attitude. Agile Methodologies: Experience with Agile methodologies (e.g., SCRUM, Sprints) and leading small Scrum teams. Commitment to Excellence: Focused on completing tasks efficiently and reliably while identifying the best approach to solving complex problems. Must-Have Technical Skills: Java Expertise: 5+ years of experience with Java, J2EE/Java EE, Spring, and Spring Boot. Architectural Knowledge: Solid understanding of Monolithic, SOA, and Microservices architectures. Concurrency & Thread-Safety: Strong knowledge of Java concurrency patterns and experience building thread-safe applications. Database Skills: Expertise in relational databases, partitioning, indexing techniques, and SQL (PostgreSQL). System Design: Experience creating high and low-level design documents based on application architecture. Linux Proficiency: Familiarity with Linux shell and command-line tools. Testing Skills: Strong grasp of unit testing and integration testing frameworks. Cloud Platform Experience: Hands-on experience with cloud platforms like AWS, Azure, or Google Cloud (e.g., S3, EC2, Lambda). Message Queues & Streaming: Familiarity with message queues (e.g., Kafka, RabbitMQ, SQS) for high-performance, scalable systems. Monitoring & Logging: Experience with monitoring and logging tools (e.g., Prometheus, Grafana, Datadog, ELK Stack, Splunk). At BlueOptima, we believe in accelerating your career progression. You ll have the opportunity to strengthen your skills, take on diverse challenges, and quickly grow within the organization. We support your development every step of the way, with a clear path to leadership and technical expertise in a fast-paced, innovative environment. Qualification : Bachelor's degree in Computer Science or equivalent
Lead Platform Engineer
Team Vunet Systems
Lead Platform Engineer Observability Solutions Location: Bengaluru Experience: 6 10 Years Function: Observability Engineering | Platform Architecture | SRE Enablement Join VuNet Redefining Digital Observability at Scale VuNet is transforming the future of digital experiences through Business Journey Observability, combining Big Data and AI/ML to empower real-time visibility across payments, banking, and financial services. Monitoring 28+ billion transactions/month, our platform is trusted by top financial institutions and powers over 300 million users. Backed by Series B funding and recognized by Gartner, NASSCOM, and Forbes, we are leading the charge in building a new category of observability, proudly Made in India for global impact. Your Role: Lead Platform Engineer As the Lead Platform Engineer, you will architect and drive the development of packaged observability solutions across 100+ infrastructure and application technologies. You will define **golden signals**, build **data collection strategies**, and lead the standardization of alerts, dashboards, and RCA workflows for platforms like **Kubernetes, Oracle DB, and Tomcat**. This is a cross-functional leadership role that sits at the intersection of product, platform, DevOps, and SRE. You will **lead a team** and influence how observability is delivered, scaled, and adopted across complex environments. Key Responsibilities Observability Solution Development Design and lead the delivery of observability packages for databases, middleware, cloud-native, and legacy platforms. Define and implement data collection pipelines, including agents, APIs, logs, metrics, traces, and service discovery. Establish **golden signals, SLIs/SLOs**, and health KPIs for performance, availability, and anomaly detection. Dashboards, Alerts & RCA Develop standardized, reusable dashboards, alerts, reports, and troubleshooting playbooks. Automate **RCA workflows** to improve MTTR and reduce alert fatigue. Platform Enablement & Integration Work with engineering to enhance agent capabilities and support new data sources/formats. Guide implementation of platform features for better observability at scale. Team Leadership & Governance Lead and mentor a team of observability engineers and specialists. Define design patterns, reusable modules, and version-controlled libraries. Stakeholder Collaboration Partner with product managers, DevOps, SREs, and customer teams to gather requirements, align priorities, and validate use cases. Ensure deliverables are scalable, well-documented, and production-ready. What You Bring Must-Have Skills 6 10 years of experience in observability, platform engineering, or SRE roles. Hands-on with tools like Prometheus, Grafana, OpenTelemetry, ELK/EFK, Datadog, Splunk. Strong understanding of logs, metrics, traces, profiling, and collection strategies. Experience developing solutions for platforms like Kubernetes, Oracle, PostgreSQL, Tomcat, etc. Proficient in Python, Shell scripting, APIs, and automation tools (**Terraform**, etc.). Familiar with alert fatigue mitigation, anomaly detection, and RCA frameworks. Excellent communication, technical leadership, and documentation skills. Nice to Have Experience managing an observability marketplace or solution catalog. Contributions to open-source observability projects. Certifications in Kubernetes, Observability platforms, or cloud providers (AWS/GCP/Azure). Background in ITSM tools, CMDBs, or incident workflow automation. At VuNet, you ll help build a category-defining observability platform that s already transforming critical infrastructure for leading financial institutions. You ll work with passionate engineers, push technical boundaries, and grow in a high-trust, high-impact environment. What You ll Experience: Ownership of key observability initiatives impacting 300M+ users. Collaboration with SRE, DevOps, and product teams across real-time financial systems. Opportunity to experiment with and shape Gen AI, ML, and emerging telemetry trends. Perks & Benefits Health insurance for you, your parents, and dependents. 1:1 mental wellness support. Training programs, certifications, and career growth opportunities. Transparent, inclusive, and high-trust work culture. Access to cutting-edge technology and Gen AI-powered workspaces.
Principal Associate - Full Stack Engineering
Capital One
Principal Associate Full Stack Engineering (GenAI Observability) Location: Bangalore Company: Capital One India About Us At Capital One India, we re tackling some of the most complex problems in financial services using machine learning, advanced analytics, and cloud-first engineering. Our mission is to build cutting-edge, patentable solutions that transform customer experiences, enhance operational efficiency, and ensure robust risk and compliance standards. We re a team of makers, breakers, doers, and disruptors obsessed with turning data into real-world impact at scale. About the Team Machine Learning Experiences (MLX) The MLX team is pioneering the future of model governance, ML observability, and Generative AI infrastructure at Capital One. We re enabling teams to seamlessly deploy ML and GenAI models at scale, with full visibility into performance, health, compliance, and ethical usage. This is the platform powering the next generation of AI-driven financial products across the company. About the Role We re looking for a Principal Associate Full Stack Engineer to lead the development of observability platforms for Generative AI systems. You ll be part of a cross-functional team focused on governance automation, LLM monitoring, and intelligent diagnostics using telemetry data, metadata, and advanced analytics. You ll design systems to collect, analyze, and visualize performance data from our large-scale GenAI infrastructure, helping data scientists and engineers make faster, safer decisions. What You ll Do Lead architecture and development of observability tools and dashboards for monitoring GenAI models and platform health. Design and build core APIs and SDKs to instrument large language models (LLMs) and foundational models (training, fine-tuning, prompting stages). Integrate Generative AI to enable observability features like anomaly detection, predictive analytics, and copilot-assisted troubleshooting. Partner with platform, MLOps, and governance teams to ingest and analyze telemetry, metadata, and runtime metrics at scale. Drive development of tools to ensure compliance with AI ethics, data governance, and industry regulations. Collaborate with product, design, and research to turn complex requirements into scalable, cloud-native software solutions. Lead proof-of-concept initiatives to test and showcase how GenAI can improve platform observability and decision-making. Contribute to the open-source community and stay at the forefront of GenAI and ML infrastructure evolution. Basic Qualifications Bachelor s or Master s degree in Computer Science, Engineering, or related field 4+ years of experience building distributed, data-intensive systems using microservices architecture 4+ years of experience in backend development with Python, Go, or Java 4+ years of expertise with observability stacks (Prometheus, Grafana, ELK) and adapting them for AI systems Strong knowledge of OpenTelemetry, and experience building custom SDKs and APIs 5+ years of hands-on experience with Generative AI models, especially applied to observability, governance, or compliance 2+ years of experience with cloud platforms such as AWS, Azure, or GCP Preferred Qualifications 4+ years building and optimizing ML systems in production environments 3+ years of experience with MLOps tools like MLflow, Kubeflow, or commercial platforms Experience with GenAI frameworks and libraries like LangChain, Haystack, and vector databases (FAISS, Chroma, OpenSearch) Familiarity with emerging observability tools for LLMs such as Langfuse, Phoenix, Helicone, or OpenInference Contributor to open-source GenAI or ML infrastructure projects Author or co-author of published work in AI/ML observability, governance, or performance monitoring Experience with PyTorch, TensorFlow, Spark, or Dask Knowledge of NVIDIA GPU telemetry, CUDA programming, and performance optimization for AI workloads Understanding of AI ethics, data governance, and regulatory frameworks for machine learning systems Why Join Capital One India Work at the intersection of technology, AI, and compliance helping shape the future of responsible AI Join a team driving enterprise-wide adoption of Generative AI Collaborate with world-class engineers, data scientists, and product leaders Enjoy a high-performance culture that encourages innovation, learning, and mentorship Access to cutting-edge tools, open-source contributions, and cloud-native infrastructure Qualification : Bachelors or Masters degree in Computer Science, Engineering, or related field
Devops Engineer-2
Cashfree Payments India Private Limited
Position: DevOps Engineer-2 Location: Bengaluru Employment Type: Full-Time Department: Engineering Job Description: We are looking for a skilled DevOps Engineer-2 to design, implement, and maintain secure, scalable, and highly available infrastructure. You will play a key role in automating infrastructure provisioning, capacity planning, and building robust monitoring and CI/CD pipelines. Responsibilities: Design and implement secure, scalable infrastructure solutions. Automate infrastructure provisioning, demand forecasting, and capacity planning. Develop automation tools and frameworks to enhance system observability, availability, reliability, performance, and latency monitoring. Monitor system health, application performance, security controls, and cost optimization. Participate in sustainable incident response, peer reviews, and blameless postmortems. Lead the adoption and rollout of best DevOps tools and automation practices across services. Build and maintain continuous integration and continuous deployment (CI/CD) pipelines. Required Skills and Experience: Minimum 3 years of experience in DevOps and cloud technologies. Expertise in at least one major cloud platform: AWS, Azure, or GCP. Strong production experience with Kubernetes, including deployment, management, and troubleshooting. Proven ability to design scalable and resilient infrastructure architectures. Proficiency with infrastructure-as-code tools such as Terraform, Pulumi, or CloudFormation. Strong debugging and troubleshooting skills. Deep knowledge of Linux servers and networking fundamentals. Hands-on experience with scripting or programming languages like Python, Shell, Go, or Java. Familiarity with monitoring and observability tools such as DataDog, NewRelic, ELK stack, Prometheus, or Grafana. Understanding of modern cloud-native development practices including microservices architecture and RESTful APIs. Ability to thrive in a fast-paced, dynamic work environment.
Test Engineer
Acqueon
Test Engineer (Senior QA Engineer / SDET) Department: R&D - Engineering Location: Bangalore About Acqueon: Acqueon is a leading provider of Generative AI-powered Revenue Execution Platforms. We empower customer-centric brands to orchestrate multi-channel campaigns and proactively engage consumers through voice, messaging, and email. Trusted by over 200 clients globally, we help enterprises elevate their customer experience, improve revenue recovery, increase sales, and build lasting loyalty. At the heart of Acqueon is a relentless focus on creating delightful, friction-free, and referral-worthy customer experiences using cutting-edge AI and data-driven technology. Position Overview: We are seeking a talented and experienced Senior QA Engineer / SDET to join our growing engineering team. This role focuses heavily on performance testing and test automation, ensuring our applications meet the highest standards of scalability, reliability, and usability. You will work in a collaborative environment with developers, DevOps engineers, and product teams, taking ownership of designing and executing complex test strategies using tools such as JMeter, Gatling, k6.io, and Selenium WebDriver. Key Responsibilities: Design, develop, and execute performance tests for web applications and backend APIs using JMeter, Gatling, or k6.io. Create realistic test scenarios and simulate workloads to evaluate system behavior under varying conditions. Conduct performance tuning and optimization, identify system bottlenecks, and provide recommendations for improvement. Work closely with development teams to analyze test results, diagnose issues, and drive resolutions. Build and maintain automation frameworks using Selenium WebDriver with Java, Cucumber, JUnit, TestNG, or Playwright. Contribute to the integration of performance and functional tests into CI/CD pipelines. Participate in architectural and design discussions to ensure performance considerations are included from the outset. Document test strategies, metrics, and findings, and communicate them clearly across teams. Required Qualifications: Bachelor s degree in Computer Science, Engineering, or a related field. 6 7 years of experience in performance testing and test automation. Strong hands-on experience with JMeter, Gatling, or k6.io. Expertise in building and executing performance test plans for web applications and APIs. Deep understanding of performance metrics, system tuning, and capacity planning. Proficiency in automation using Selenium WebDriver with Java, and frameworks like Cucumber, JUnit, TestNG, or Playwright. Solid knowledge of web technologies, protocols (HTTP/S), and application architecture. Strong analytical skills, attention to detail, and ability to work in dynamic, fast-paced environments. Excellent communication and collaboration skills. Preferred Experience: Experience with Agile/Scrum methodologies. Familiarity with CI/CD pipelines and tools like Jenkins, GitHub Actions, or GitLab CI. Exposure to cloud-based performance testing environments. Experience with monitoring tools (e.g., Grafana, Prometheus, New Relic) is a plus. What We Offer: A fast-paced, high-growth environment working on next-gen customer engagement products. The opportunity to work with cutting-edge technologies and global enterprise clients. A collaborative, people-first culture that values curiosity, ownership, and excellence. If you re passionate about quality, performance, and automation and love solving complex challenges we d love to hear from you. Qualification : Bachelors degree in Computer Science, Engineering, or a related field
Site Reliability Engineer
Groww
Position: Site Reliability Engineer Location: Bengaluru About Groww At Groww, we re on a mission to make financial services simple, accessible, and transparent for every Indian. As one of India s fastest-growing financial platforms, we help millions take control of their financial future through a wide range of products. We re a team driven by ownership, radical customer-centricity, and a deep passion for challenging the status quo. From intuitive design to robust engineering, everything we build is grounded in what our customers need. If you re excited about building systems that power the future of finance in India, we d love to hear from you. Our Vision To empower every Indian with the knowledge, tools, and confidence to make sound financial decisions. Our goal is to be the most trusted financial partner for millions across the country. Our Core Values Customer Obsession We put our users first, always. Extreme Ownership We own everything we do, end-to-end. Simplicity We keep things simple, effective, and intuitive. Long-term Thinking We focus on sustainable, impactful decisions. Transparency We believe in open communication and collaboration. Role Overview: As a Site Reliability Engineer (SRE) at Groww, you will be responsible for ensuring our systems are highly available, performant, and secure. You will work closely with engineering and infrastructure teams to improve reliability, automate deployments, and manage mission-critical services that power our platform. Key Responsibilities: Monitor and troubleshoot issues related to system performance, availability, and security. Define and maintain SLIs, SLOs, and Error Budgets to improve system reliability. Use tools like Grafana to analyze and report on metrics and trace data. Participate in the on-call rotation for 24/7 support of production systems. Collaborate with developers to ensure scalability and reliability are built into new services. Roll out security and infrastructure features proactively. Manage automated deployments, version control, and release rollouts. Perform Root Cause Analysis (RCA) for incidents and implement long-term fixes. Optimize system performance, conduct capacity planning, and create recovery strategies. Identify and automate repetitive tasks to reduce toil. Leverage CI/CD tools such as Git, Jira, Jenkins to streamline development workflows. Requirements: 4 6 years of relevant experience in SRE, DevOps, or infrastructure engineering. Bachelor's or Master's degree in Computer Science or a related field. Strong background in Linux/Unix system administration and networking. Hands-on experience with cloud platforms like GCP or AWS. Proficiency in programming languages such as Python, Java, or Go. Experience with monitoring and alerting tools: Grafana, Prometheus, New Relic, etc. Familiarity with configuration management tools. Experience with Kubernetes, Docker, and container orchestration tools is a strong plus. Excellent problem-solving, communication, and team collaboration skills. Be a part of one of India s fastest-growing fintech startups. Build and scale systems that impact millions of users daily. Work with passionate, driven teammates who are redefining financial services. A culture that encourages continuous learning, ownership, and transparency. If you're ready to help shape the future of fintech infrastructure in India, Groww is the place for you. Let s build something extraordinary together. Qualification : Bachelor's or Master's degree in Computer Science or a related field
Technical Lead Devops
Subex Limited
Position: Technical Lead - DevOps Location: Bangalore Rural, Karnataka, India Department: Data Platform and DevOps Employment Type: Subexian Experience Required: 3 to 6 years Job Overview: We are seeking an experienced Kubernetes Administrator with a strong background in managing containerized environments. The ideal candidate will have 4+ years of hands-on experience in deploying, configuring, and optimizing Kubernetes clusters to drive scalability, reliability, and performance. This is an excellent opportunity to leverage your expertise in Kubernetes orchestration while contributing to the overall success of our platform. Key Responsibilities: Cluster Management: Deploy, configure, and manage Kubernetes clusters both on-premises and across cloud platforms such as AWS, Azure, and GCP. Security & Compliance: Implement best practices for cluster security, including role-based access control (RBAC), network policies, and data encryption at rest and in transit. Automation: Automate cluster provisioning and ongoing management using tools like Terraform, Ansible, or Helm charts, streamlining operations and reducing manual tasks by 40%. Monitoring & Performance: Continuously monitor cluster health and performance metrics using tools like Prometheus, Grafana, ensuring high availability and optimal performance. CI/CD Pipelines: Design and implement CI/CD pipelines for containerized applications using tools such as Jenkins, GitLab CI/CD, and CircleCI to enable smooth continuous delivery. Collaboration: Work closely with development teams to troubleshoot issues, optimize application performance, and ensure compatibility with Kubernetes environments. Security Audits: Conduct regular security audits to identify vulnerabilities and ensure compliance with industry standards. Documentation: Maintain clear and comprehensive documentation for deployment procedures, configuration settings, and troubleshooting guides to enhance knowledge sharing within the team. Infrastructure Management: Administer and maintain Linux/Unix servers and virtualization platforms such as VMware or KVM, ensuring seamless operations across the infrastructure. Backup & Recovery: Implement and manage robust backup and disaster recovery solutions to ensure data integrity and minimize system downtime. Technical Support: Provide expert-level technical support for server and network infrastructure-related issues. Required Skills & Qualifications: Proven experience in Kubernetes deployment, configuration, and administration. Strong command of containerization technologies, including Docker and containerd. Hands-on experience with cloud platforms such as AWS, Azure, and GCP. Proficiency in Infrastructure as Code (IAC) tools like Terraform and Ansible. Familiarity with CI/CD pipelines and automation tools like Jenkins and GitLab CI/CD. Excellent troubleshooting and problem-solving skills. Strong communication and collaboration abilities, with the capability to work effectively across cross-functional teams. If you re passionate about DevOps, Kubernetes, and driving the success of containerized environments, we d love to hear from you!
Senior Associate Infrastructure L1 (AWS)
Publicis Sapient
Senior Associate Infrastructure L1 (AWS) Location: Bengaluru, India Department: Infrastructure & Cloud Engineering Employment Type: Full-Time About the Role As a Senior Associate Infrastructure L1 (AWS), you will design, implement, and manage secure, scalable, and highly available cloud infrastructure for enterprise digital transformation initiatives. You ll collaborate with cross-functional teams to automate deployments, enable DevOps best practices, and ensure robust observability across systems. Your goal is to reduce time-to-market and optimize performance, cost, and compliance. Key Responsibilities Architect and build immutable infrastructure on AWS and/or other cloud platforms. Implement and maintain infrastructure as code using Terraform, CloudFormation, or similar. Manage containerized environments using Kubernetes (EKS/GKE), ECS, Docker, and Helm. Implement service mesh (e.g., Istio) for advanced traffic management, monitoring, and security. Develop and manage CI/CD pipelines using Jenkins, GitLab, CircleCI, or similar. Automate build/deployment processes using Groovy, Go, Python, Shell, or PowerShell. Integrate DevSecOps and security scanning into the software delivery lifecycle. Configure and maintain monitoring, logging, and observability using: Monitoring: Prometheus, Grafana, Datadog, New Relic Logging: ELK Stack, Fluentd, Splunk Observability: OpenTelemetry, Jaeger, Kiali, CloudTrail, Dynatrace Troubleshoot infrastructure, performance, and deployment issues. Collaborate with application teams and stakeholders to ensure high performance and availability of deployed services. Required Skills & Qualifications 4 to 12 years of experience in Cloud Infrastructure & DevOps roles. Bachelor's or Master s degree in Engineering, Computer Science, or related field. Hands-on experience with AWS (EC2, VPC, IAM, Lambda, RDS, CloudWatch, etc.) Solid experience in container orchestration using Kubernetes (EKS/GKE) and infrastructure management. Expert in IaC tools like Terraform (preferred), ARM templates, Pulumi, etc. Proficiency in CI/CD pipeline automation and scripting. Familiarity with cloud-native security practices and vulnerability scanning tools. Experience with DNS, Load Balancers, and high-volume application infrastructure setup. Hands-on experience with artifact repositories like Nexus or Artifactory. Preferred Certifications (Nice to Have) Associate-level certifications in AWS, Azure, or GCP HashiCorp Certified Terraform Associate Benefits Gender-neutral workplace policies 18 paid holidays per year Generous parental leave and new parent transition support Flexible work arrangements Comprehensive Employee Assistance Program (mental & physical wellness) About Publicis Sapient Publicis Sapient is a global digital transformation partner helping established organizations evolve into their future state through technology, data, consulting, and customer-first experiences. With over 20,000 employees across 53 offices, we combine deep domain knowledge with a start-up mindset and agile methods to solve complex business challenges.
Golang Developer
Team Vunet Systems
Golang Developer Location: Bengaluru, India Experience: 3 - 5 Years Job Type: Full-time About VuNet VuNet is a pioneer in Business Journey Observability, leveraging Big Data and Machine Learning to provide end-to-end visibility into customer journeys in financial services. Our platform monitors over 28 billion digital transactions monthly, impacting 300 million users, and powers major banks across India and MEA. Series B funded, recognized by Gartner, Forbes, and NASSCOM Work with cutting-edge observability technology in a fast-paced, innovative environment Collaborative, inclusive culture focused on learning, growth, and ownership Benefits include 100% health insurance (including dependents), mental wellness support, and career development programs Role Overview: Golang Developer You will develop and maintain backend microservices and APIs using Go, contributing to a scalable and robust product architecture. You will collaborate closely with the product and engineering teams, take ownership of deliverables, and help build features that differentiate VuNet s platform in the market. Key Responsibilities Develop backend services and RESTful APIs in Golang Work with product teams to design scalable and maintainable solutions Follow best practices: code reviews, unit testing, and coding standards Troubleshoot, debug, and optimize high-throughput distributed systems Collaborate in an agile startup environment, continuously learning and improving Mandatory Skills 3+ years of Golang development with deep understanding of concurrency and idioms Proficient in building REST APIs, microservices, gRPC, protobuf Experience with Docker, Kubernetes, Git, and CI/CD pipelines (GitLab CI, Jenkins) Familiarity with Kafka or NATS for messaging and streaming Knowledge of time-series/columnar databases (ClickHouse, Prometheus) Experience with metrics/logging/tracing tools (OpenTelemetry, Jaeger, ELK stack) Strong problem-solving and communication skills Good to Have Experience with Terraform, Ansible, Helm (IaC) Understanding Kubernetes and observability concepts (RED/USE metrics, SLOs, SLIs) Familiarity with PostgreSQL, Redis, Cassandra Background in observability, AIOps, or monitoring domains Agile and DevOps practices experience Benefits Comprehensive health insurance for you and family Mental wellness and 1:1 counseling Culture of innovation, growth, and ownership Access to next-gen AI and integrated technology workspace Supportive career growth and training programs
Performance Engineer
Cognite
Performance Engineer Location: Bengaluru (Whitefield) Team: Product Engineering Employment: Full-Time | Hybrid About Cognite Cognite is a global SaaS leader driving industrial digital transformation through AI and data. Our flagship products include Cognite Atlas AI and Cognite Data Fusion (CDF), empowering industries such as Oil & Gas, Chemicals, Pharma, and Manufacturing to harness data at scale. Recognized with multiple industry awards, including 2022 Technology Innovation Leader and 2024 Microsoft Energy & Resources Partner of the Year, we lead the way in innovative industrial solutions. Our Values Impact: Deliver meaningful outcomes with focus and purpose. Ownership: Take initiative, embrace responsibility, and collaborate inclusively. Relentless: Innovate persistently, learn from challenges, and improve continuously. Role & Responsibilities Design, develop, and execute performance and load tests to ensure system scalability, stability, and reliability of Cognite SaaS products. Identify performance bottlenecks and provide actionable insights for improvement. Build and maintain testing frameworks, scripts, and tools to support performance testing initiatives. Collaborate closely with engineering teams to align testing strategies with system architecture. Monitor production system performance and assist in root cause analysis of performance issues. Share performance optimization best practices via documentation, training, and team discussions. Qualifications Bachelor s or Master s degree in Computer Science, IT, or related fields. 3-5 years of experience in performance testing and engineering, preferably in SaaS environments. Proficiency with performance testing tools such as JMeter, Gatling, LoadRunner, BlazeMeter, or equivalents. Strong understanding of CI/CD pipelines and container technologies like Kubernetes and Docker. Solid programming skills in Java, Python, or similar languages. Experience with databases like PostgreSQL. Familiarity with performance monitoring and analysis tools such as Grafana and Prometheus. Preferred Skills Agile methodology experience and working in globally distributed teams. Expertise testing large-scale systems and handling high-volume data loads. Knowledge of React and JSON for test data creation and API performance testing. Diverse global community with 70+ nationalities and strong DEI focus. Modern, vibrant office in Whitefield, Bengaluru with hybrid work culture. Flat organizational structure with direct access to leadership and minimal bureaucracy. Collaborate with world-class talent on ambitious and impactful industrial tech projects. Engage with the wider Cognite community through HUB conversations and partnerships. Make an Impact Join Cognite to help build scalable, high-performing SaaS solutions that empower industrial enterprises globally. We welcome candidates from all backgrounds to apply. Qualification : Bachelors or Masters degree in Computer Science, IT, or related fields.
Lead Devops Engineer
Neuron7.ai
Lead DevOps Engineer Location: Bengaluru, India Employment Type: Full-time, Hybrid About Neuron7.ai Neuron7.ai is a rapidly growing AI-first SaaS company that is revolutionizing the world of service intelligence. Backed by top-tier venture capitalists in Silicon Valley and a distinguished group of angel investors, we are recognized as a startup to watch. Our platform enables enterprises to make accurate service decisions at scale by delivering service predictions in seconds through the analysis of both structured and unstructured data. At Neuron7.ai, you will be part of a dynamic and innovative team that is pushing the boundaries of service intelligence. We value creativity, collaboration, and a relentless commitment to innovation. This is your opportunity to make a meaningful impact on cutting-edge products at scale, in a fast-growing startup environment. About the Team Join a passionate team of professionals focused on optimizing our infrastructure, deployment processes, and overall system performance. We foster a culture of continuous improvement, where every team member is encouraged to contribute ideas and drive impactful projects. As a Lead DevOps Engineer, you will play a pivotal role in shaping the evolution of our infrastructure and operational efficiency. What You ll Do: CI/CD Pipelines: Lead the design, implementation, and management of CI/CD pipelines to automate and streamline deployment processes. Collaboration: Work closely with software development and IT teams to enhance workflows and ensure efficient release cycles. System Monitoring: Monitor and troubleshoot system performance to ensure high availability and reliability of applications across environments. Cloud Infrastructure: Architect and manage cloud infrastructure (AWS, Azure, GCP) for scalable, secure, and performant application environments. Automation: Automate infrastructure provisioning and configuration management using tools like Terraform, Ansible, or similar technologies. Security & Compliance: Conduct regular system audits, implement security best practices, and ensure compliance with industry standards. Mentorship: Mentor and guide junior DevOps engineers, fostering a collaborative, knowledge-sharing, and growth-focused environment. Documentation: Document processes, configurations, and standard operating procedures to enhance team efficiency and maintain operational excellence. What We re Looking For: Experience: 8+ years of experience in DevOps engineering or a related field. Cloud Expertise: Extensive knowledge and hands-on experience with cloud platforms (AWS, Azure, GCP) and associated services (EC2, S3, Lambda, etc.). Containerization: Strong experience with containerization technologies such as Docker and Kubernetes for managing microservices. Automation Skills: Proficiency in scripting languages (Python, Bash, Ruby) for automation tasks and infrastructure-as-code management. Monitoring & Logging: Familiarity with monitoring and logging tools such as Prometheus, Grafana, and the ELK stack. Problem-Solving: Excellent problem-solving skills with a proactive, solutions-oriented mindset for resolving operational challenges. Collaboration: Strong communication skills with the ability to work collaboratively across teams and influence operational best practices. What We Do and Value: At Neuron7.ai, we prioritize integrity, innovation, and a customer-centric approach. Our mission is to use advanced AI technology to improve service decision-making and we are committed to delivering excellence in all aspects of our work. Company Perks & Benefits: Competitive salary, equity, and spot bonuses Paid sick leave Latest MacBook Pro for your work Comprehensive health insurance Paid parental leave Work from home or from our vibrant Bengaluru office with flexible work arrangements Our Commitment to Diversity and Inclusion: Neuron7.ai is committed to fostering a diverse and inclusive workplace. We ensure equal employment opportunities without discrimination based on race, color, religion, sex, sexual orientation, gender identity, age, disability, national origin, marital status, or any other characteristic protected by law. If you re passionate about optimizing deployment processes, improving infrastructure, and driving operational excellence, we d love to hear from you!
Azure Infrastructure Architect
Mathco (themathcompany)
Job Title: Azure Infrastructure Architect Location: Bengaluru, Karnataka, India Department: Engineering About CloudSEK CloudSEK is one of India s leading cybersecurity product companies, focused on leveraging Artificial Intelligence and Machine Learning to identify and resolve digital threats in real-time. With a strong portfolio of products like CloudSEK XVigil, BeVigil, and SVigil, we provide advanced solutions for attack surface monitoring, cybersecurity risk management, and software supply chain protection. We are headquartered in Singapore and have expanded rapidly across India, Southeast Asia, and the Americas. At CloudSEK, we prioritize innovation, agility, and building impactful products that keep our customers ahead of emerging cybersecurity threats. About the Role: Azure Infrastructure Architect CloudSEK is looking for an Azure Infrastructure Architect to design, implement, and optimize cloud infrastructure solutions on Microsoft Azure. The ideal candidate will have deep expertise in Azure services, infrastructure automation, and security best practices, with a strong background in cloud architecture patterns. You will work closely with various stakeholders to align cloud strategies with business objectives, ensuring high availability, security, and scalability of enterprise applications. Key Responsibilities Cloud Architecture & Design: Design and implement highly scalable, resilient, and secure cloud infrastructure solutions on Microsoft Azure. Develop cloud adoption strategies, migration plans, and hybrid cloud architectures to meet business needs. Create architecture blueprints, reference architectures, and best practices tailored to Azure environments. Ensure compliance with the Azure Well-Architected Framework and other industry standards. Infrastructure & Automation: Architect and automate Infrastructure as Code (IaC) using tools like Terraform, ARM templates, Bicep, and Ansible. Implement CI/CD pipelines for infrastructure deployment using Azure DevOps, GitHub Actions, or Jenkins. Automate cloud operations, monitoring, and governance using Azure Automation, PowerShell, and Python. Security & Compliance: Design and enforce Azure security best practices including Azure Policy, RBAC, NSGs, Key Vault, Defender for Cloud, and Sentinel. Ensure adherence to regulatory frameworks like ISO 27001, SOC 2, GDPR, and HIPAA. Perform threat modeling, risk assessments, and define zero-trust architectures. Networking & Hybrid Cloud: Design and optimize Azure Virtual Networks (VNet), ExpressRoute, VPN Gateway, and Load Balancers for seamless operations. Implement hybrid cloud architectures, integrating on-premises data centers with Azure via Azure Arc. Optimize network performance, DNS, CDN, and traffic routing in cloud-native environments. Performance Optimization & Cost Management: Leverage Azure Cost Management to implement cost-effective cloud solutions and manage reserved instances. Optimize cloud resource utilization using Autoscaling, VM sizing, and Serverless computing models. Define and implement cloud governance policies to control costs and improve operational efficiency. Collaboration & Stakeholder Engagement: Collaborate with DevOps, Security, and Application teams to align infrastructure solutions with business needs. Provide technical leadership and guidance to engineering teams, ensuring alignment with industry trends and best practices. Act as a trusted advisor to leadership on cloud strategies, emerging technologies, and technical decision-making. Required Skills & Qualifications Technical Expertise: Azure Services: Experience with Azure Virtual Machines, Azure Kubernetes Service (AKS), Azure Functions, Azure Storage, Azure SQL, CosmosDB, and Azure Networking (VNet, NSG, VPN, ExpressRoute). Automation & IaC: Expertise in Terraform, ARM Templates, Bicep, Ansible, PowerShell, Python. DevOps & CI/CD: Knowledge of Azure DevOps, GitHub Actions, Jenkins, Kubernetes, Docker. Security & Compliance: Familiarity with Azure Security Center, Azure AD, Key Vault, Microsoft Defender for Cloud, Sentinel, and Zero Trust Model. Networking & Hybrid Cloud: Experience with ExpressRoute, Load Balancers, Private Link, Virtual WAN, Azure Arc. Monitoring & Logging: Proficiency with Azure Monitor, Log Analytics, Prometheus, and Grafana. Cloud Migration: Hands-on experience in cloud migrations using Azure Migrate, including planning and execution. Soft Skills: Strong problem-solving and analytical abilities. Excellent communication and stakeholder management skills. Ability to work in agile environments and across cross-functional teams. A passion for continuous learning and staying updated with the latest Azure technologies. Preferred Qualifications: Azure Certifications such as Azure Solutions Architect Expert (AZ-305), Azure Security Engineer (AZ-500), or Azure DevOps Engineer (AZ-400). Experience with multi-cloud environments (AWS, GCP). Familiarity with database technologies (SQL, NoSQL, PostgreSQL, MySQL). Location Bengaluru, Karnataka, India Education/Qualification Bachelor s degree in Engineering or Technology (B.Tech/BE). Years of Experience 10 to 15 years of professional experience in cloud infrastructure and architecture design. Be part of a fast-growing startup where you can make a direct impact. Competitive salary and a comprehensive benefits package. Access to cutting-edge technologies and continuous learning opportunities. Flexible working hours and a collaborative work culture focused on innovation and growth. If you're an experienced Azure Infrastructure Architect passionate about designing and optimizing cloud solutions with Microsoft Azure, we would love to hear from you! Join CloudSEK and be part of a forward-thinking team revolutionizing the cybersecurity landscape. Qualification : Bachelors degree in Engineering or Technology (B.Tech/BE).
Devops Engineer
Sarvam
DevOps Engineer Location: Bengaluru, Karnataka, India (On-Site) Department: Engineering Employment Type: Full-Time About Sarvam.ai Sarvam.ai is a cutting-edge generative AI startup headquartered in Bengaluru, India, with a mission to make generative AI accessible and impactful for Bharat. Founded by AI experts, we are dedicated to developing high-performance, cost-effective AI agents tailored for the Indian market. We enable enterprises to tap into new opportunities, build deeper customer connections, and reshape the future of AI for India and beyond. Role Overview We are looking for a DevOps Engineer to join our team and help build and manage scalable, secure, and high-performance infrastructure. In this role, you will be a key contributor to automating deployments, managing cloud infrastructure, optimizing CI/CD workflows, and ensuring system reliability. You will work with cutting-edge technologies, including cloud platforms, containerization, and infrastructure as code (IaC), to deliver impactful solutions for AI-driven products. Key Responsibilities CI/CD Pipelines: Design, implement, and manage CI/CD pipelines for seamless software deployment and integration. Cloud Infrastructure: Deploy and manage cloud infrastructure using Terraform, Kubernetes, and Docker for scalability and high performance. Automation & Scaling: Automate infrastructure provisioning, scaling, and security compliance to support high-availability environments. Monitoring & Optimization: Implement logging, monitoring, and alerting solutions using tools like Prometheus, Grafana, ELK Stack, or CloudWatch to monitor system performance and optimize resource utilization. Security & Compliance: Enhance security and compliance by managing IAM policies, encryption, and vulnerability scanning. Troubleshooting & Root Cause Analysis: Troubleshoot system failures, perform root cause analysis, and implement improvements to ensure reliability and uptime. Collaboration: Work closely with development teams to ensure smooth deployment and operation of AI models and applications. Must-Have Skills & Qualifications Educational Background: Bachelor s degree in Computer Science, Engineering, or related field (2024/2025 graduates). Cloud Expertise: Strong experience with AWS, Azure, or GCP for deploying and managing cloud-based applications. Containerization: Proficiency in Docker and Kubernetes for building and managing containerized applications. Infrastructure as Code (IaC): Experience with Terraform, Ansible, or CloudFormation to automate infrastructure management. CI/CD Pipelines: Experience in setting up automated workflows using tools like GitHub Actions, Jenkins, or GitLab CI/CD for smooth deployments. Monitoring & Logging: Experience with Prometheus, Grafana, ELK, or similar tools to implement effective monitoring and logging solutions. Networking & Security: Strong understanding of firewalls, VPNs, SSL, and cloud security best practices for secure infrastructure. Version Control: Proficiency with Git for managing code repositories and version control workflows. Problem Solving: Strong debugging, troubleshooting, and analytical skills to resolve complex system issues. Good to Have (Preferred Experience) Serverless Computing: Exposure to serverless computing models such as AWS Lambda or Azure Functions. Message Queues: Experience with message queues like Kafka, RabbitMQ, or SQS. Site Reliability Engineering (SRE): Familiarity with SRE practices to ensure the reliability and availability of large-scale systems. Open Source Contributions: Contributions to open-source projects or a strong GitHub portfolio showcasing DevOps expertise and best practices. Impactful Work: Work on AI-driven products that are reshaping the future of technology in India. Innovative Team: Collaborate with a team of AI experts and engineers pushing the boundaries of technology. Career Growth: Opportunity to grow in a fast-growing startup at the forefront of the generative AI revolution. Cutting-edge Technologies: Work with cloud technologies, automation, and AI infrastructure to create high-impact products. Qualification : Bachelors degree in Computer Science, Engineering, or related field
Senior Software Engineer - Devops
Finbox
DevOps Engineer | FinBox Location: India (Specific location not mentioned) Experience: 5+ Years About FinBox: Where Fintech Meets Fun! Welcome to FinBox, the buzzing hive of tech innovation and creativity! Since our inception in 2017, FinBox has built some of the most advanced technologies in the financial services space that help lenders like Banks, NBFCs and large enterprises build and launch credit products within a matter of days, not months or years. FinBox is a Series A funded company which is expanding globally with offices in India, Vietnam, Indonesia and Philippines. Our vision is to build the best-in-class infrastructure for lending products and help Banks & Financial Services companies across the world scale and launch credit programs that set a new standard in the era of digital finance. So far, we ve helped our customers disburse Billions of Dollars in credit across unsecured and secured credit including personal loans, working capital loans, business loans, mortgage and education loans. FinBox solutions are already being used by over 100+ companies to deliver credit to over 5 million customers every month. Why Should You be a FinBoxer: Innovative Environment: At FinBox, we foster a culture of creativity and experimentation, encouraging our team to push the boundaries of what's possible in fintech. Impactful Work: Your contributions will directly impact the lives of millions, helping to provide fair and accessible credit to individuals and businesses alike. Growth Opportunities: We are a Series A funded startup and have ample opportunities for growth, professional development and career advancement. Collaborative Culture: Join a diverse and inclusive team of experts who are passionate about making a difference and supporting one another. Who s a Great FinBoxer: At FinBox, we re on the lookout for exceptional folks who are all about innovation and impact. If you re excited to shake things up in the banking & financial services world, keep reading! Creative Thinkers: If your brain is always bubbling with out-of-the-box ideas and wild solutions, you re our kind of person. We love disruptors who challenge the norm and bring fresh perspectives to the table. Customer Heroes: Our customers are our champions, and we need heroes who can understand their needs, deliver magical experiences, and go above and beyond to keep them happy. Team Players: We believe in the power of we. If you thrive in a collaborative environment, value different viewpoints, and enjoy being part of a spirited, supportive team, you ll fit right in. About the Job As a DevOps Engineer at FinBox, you will also impart your strong understanding of cloud computing and infrastructure-as-a-service tools. You will be given ownership for taking new initiatives to implement industry best practices in process improvements, implementing automation and orchestration, and ensuring that software applications are delivered quickly, reliably, and at scale. How You'll Contribute Build and set up new development tools and infrastructure which can be helpful in scaling the product Strong understanding of CI/CD to implement and maintain automated build, test, and deployment pipelines to ensure continuous delivery of high-quality software Design, build and maintain an automated secure & resilient cloud infrastructure Collaborate with other developers to ensure that development follows established processes & practices Implement and maintain a practice of monitoring and logging to ensure the availability, performance and security of the application Enhance the development and delivery process by implementing best practices and tools in the industry Troubleshoot and resolve issues related to software development, deployment, and infrastructure by identifying the root cause Provide Day 2 technical support with products and the deployments and document the phases involved throughout the process Train and guide other members on the best DevOps practices and tools Who You Are An exceptional 5+ years of experience in handling CI/CD and implementing the best development practices In-depth knowledge of scripting languages, preferred are Go/Python/Bash In-depth working knowledge in Kubernetes,Terraform, Docker, Jenkins, etc Strong knowledge of system design & algorithms Hands-on experience on the cloud provider, AWS Strong knowledge of automating existing systems, repetitive tasks and deployments Carrying a knack for promoting high-quality coding practices with clean and reusable code Fair understanding of monitoring frameworks such as Grafana, Prometheus, Datadog, etc
Oracle Cloud Operation Engineer
Oracle
Job Description: SaaS Cloud Ops Specialist We are looking for SaaS Cloud Ops specialists involved in managing and supporting cloud-based applications, databases, and services. These roles can include: Designing, planning, implementing, onboarding, configuring, and managing cloud environments and applications; troubleshooting and resolving cloud services issues; maintaining, monitoring, planning, and documenting; and infrastructure-level automation experience. Career Level - IC3 Responsibilities As part of the Oracle Finance GIU - Banking-Application Management Support team, SaaSOps will be taking complete responsibility for supporting & maintaining OCI cloud-based applications, environments, and databases on OCI (Oracle Cloud). The new hire is expected to support 24x7 Production Operations for SaaS customers, associated banking cloud services, and products. Candidate should have expertise in the below (at least 3-4 from below): Kubernetes administration (Mandatory) Oracle Database administrator (Mandatory) OCI administration / or any other cloud administration (Mandatory) Linux (Mandatory) Excellent Communication Skills (Mandatory) 24*7 Production Operations (Mandatory) Expertise in Autonomous Database Automation experience CI/CD Pipelines Knowledge in GIT Repository Disaster Recovery (DR) SaaSOps is expected to possess strong troubleshooting skills and will need to work on a ticketing-based system to resolve issues and monitor various aspects of the cloud services as part of the day-to-day job. Also, he/she will work on critical and non-critical issues from the queues, escalation channels, and other modes of assignments. The candidate would be expected to update Service Requests with technical and non-technical solutions, meet SLA requirements, and interact with other functional teams, customers, customer management teams, and Product engineering teams as and when required.
Site Reliability Developer 2/3
Oracle
Job Description: Site Reliability Engineer - OCI Cloud Engineering Team Role: Site Reliability Engineer (SRE) Team: OCI OLTP (Online Transaction Processing) Location: Kiev Career Level: IC2 Experience: 5+ years Overview: Oracle Cloud Infrastructure s (OCI) OLTP organization is seeking a Site Reliability Engineer (SRE) to join our dynamic and fast-paced Cloud engineering team. The team is responsible for mission-critical distributed systems and cloud services, and we are looking for an engineer who is deeply interested in databases, distributed systems, and cloud services. If you thrive in an environment where innovation, problem-solving, and operational excellence intersect, this is an exciting opportunity for you! As a member of the SRE services, you will focus on Cloud Services, building deployments, operations, security vulnerability mitigation, and automation. You will be instrumental in fostering a culture of Site Reliability Engineering (SRE) within the team, and your work will directly contribute to ensuring the stability, performance, and reliability of Oracle s global cloud service infrastructure. This role requires someone who is adaptable, highly motivated, and capable of managing large-scale cloud environments with a focus on continuous improvement. Key Responsibilities: Cloud Service Operations & Reliability: Deploy, operate, and maintain large-scale cloud service products in a highly available, fault-tolerant, and scalable environment. Collaborate with internal teams to identify and mitigate cross-team issues that pose operational risks to cloud services. Focus on systems reliability and ensure the continuous availability of cloud services by automating tasks and eliminating manual interventions. Automation & Improvements: Automate operational tasks and improve service deployments, focusing on scaling, performance, and uptime. Contribute to CI/CD systems, ensuring seamless integration and continuous delivery for cloud-based services. Leverage automation tools such as Terraform, Grafana, and Bitbucket to streamline operations. Security & Incident Response: Mitigate security vulnerabilities within cloud services and ensure compliance with Oracle's security standards. Participate in on-call rotations to provide immediate troubleshooting support and ensure rapid issue resolution. Perform deep analysis of service performance and collaborate with team members to diagnose and resolve issues that affect service availability or performance. Collaborative Problem-Solving: Work closely with cross-functional teams, including development, database, networking, and storage experts, to ensure the reliability and performance of services. Identify systemic issues and potential risks, develop solutions, and ensure proper documentation and communication with stakeholders. Documentation & Knowledge Sharing: Contribute to documentation such as runbooks, operational guides, and troubleshooting manuals. Mentor junior engineers and share knowledge on best practices for site reliability engineering and cloud service operations. Continuous Learning: Stay up to date with new cloud technologies, trends, and best practices, and actively implement them in your day-to-day work. Technical and Professional Requirements: Cloud Services & Infrastructure: 5+ years of experience in SRE, DevOps, or Automation roles with a focus on large-scale infrastructure and cloud services. Hands-on experience with cloud platforms (e.g., OCI, AWS, Azure) and expertise in compute, database, networking, and storage services within cloud environments. Automation & Tooling: Proficiency with automation tools such as Terraform, Grafana, LumberJack, and Shepherd. Solid experience in using CI/CD tools and processes for cloud service deployments and operations. Scripting & Systems: Strong knowledge of scripting languages, particularly Python and Java. Familiarity with Linux systems, docker containers, virtualized infrastructure, and orchestration (e.g., Kubernetes). Performance & Troubleshooting: Excellent troubleshooting skills with a focus on performance, availability, reliability, and scalability of distributed systems. Experience in operating fault-tolerant, highly available, high-throughput distributed systems. Security & Incident Management: Familiarity with security practices and mitigating security vulnerabilities in cloud services. Proven ability to handle incident response and provide efficient troubleshooting during on-call rotations. Collaboration & Communication: Strong verbal and written communication skills, capable of working effectively with diverse teams across multiple geographies. Ability to work in a highly collaborative environment, driving operational excellence and customer satisfaction. Preferred Qualifications: Experience in operating and maintaining multi-tenant, cloud-based infrastructure with a focus on scalability and high availability. Familiarity with tools and platforms like Grafana, Prometheus, and other observability and monitoring tools. Experience in networking and storage technologies in a cloud environment. Joining OCI s OLTP team as an SRE gives you the opportunity to work with cutting-edge technologies and contribute to the operational excellence of Oracle s global cloud infrastructure. This is a chance to grow your skills in a highly dynamic environment and to solve complex problems that directly impact mission-critical cloud services. With a focus on automation, scalability, and high performance, you will be an essential part of a team that powers Oracle s leading cloud services. If you are an experienced engineer passionate about cloud technologies, automation, and ensuring the reliability of large-scale systems, we encourage you to apply and join us in this exciting journey!
Software Development Engineer - 2
Locus
Job Title: Software Development Engineer - 2 Location: Bangalore (On-site; full-time) About Locus: At Locus, we are redefining logistics decision-making with deep-tech solutions that drive efficiency, consistency, and transparency across industries like retail and FMCG/CPG. Founded in 2015 by Nishith Rastogi and Geet Garg, Locus has evolved from a women s safety geo-tracking app into a globally recognized logistics optimization platform. Our technology has empowered enterprises such as Unilever and Nestl to execute over a billion deliveries across 30+ countries. Guided by our commitment to innovation and sustainable growth, we transform complex supply chains into strategic growth enablers. Join us at Locus and be part of a team shaping the future of global logistics. Job Overview: About the Role: As an Software Development Engineer -2, Backend Engineer at Locus, you will play a pivotal role in building robust, scalable, and high-performance backend systems. You will be at the forefront of designing solutions that can handle millions of transactions, ensuring reliability, security, and innovation across our products. Key Responsibilities: System Design: Architect scalable backend services and APIs, focusing on low-latency and high-throughput systems. Core Development: Build, test, and deploy features using Java, ensuring code quality and maintainability. Performance Optimization: Analyze and optimize application performance and scalability by addressing bottlenecks and implementing efficient algorithms. Database Management: Design, query, and maintain complex databases (relational and NoSQL), ensuring data consistency and availability. Integration: Collaborate with frontend and data teams to integrate backend services seamlessly. Ownership: Take end-to-end responsibility for assigned modules or features, from requirements gathering to production deployment and monitoring. Security: Implement robust security practices to safeguard systems and user data. Code Reviews: Conduct thorough peer reviews to maintain coding standards and share knowledge within the team. Mentorship: Guide junior engineers, fostering a culture of learning and innovation. Skills and Qualifications: Core Expertise: Proficiency in Java and frameworks like Spring Boot. Database Knowledge: Experience with MySQL, PostgreSQL, or similar, along with hands-on knowledge of NoSQL solutions like MongoDB or Cassandra. Cloud Experience: Familiarity with AWS, Azure, or GCP for deployment and infrastructure management. Tooling: Experience with CI/CD pipelines, version control systems (Git), and monitoring tools like Prometheus or Grafana. Problem-Solving: Strong analytical skills with a focus on algorithms, data structures, and system design. Collaboration: Ability to work closely with cross-functional teams and adapt to a fast-paced environment. Education: Bachelor's or Master s degree in Computer Science, Engineering, or a related field. Join Locus and become part of a visionary team that is redefining logistics through innovation and smart distribution. We provide competitive compensation, comprehensive benefits, and a collaborative environment where your expertise will drive both your growth and that of the organization. Locus is an equal opportunity employer dedicated to creating a diverse and inclusive workplace.
Application Developer: Cloud Fullstack
International Business Machines Corporation
Job Title: Application Developer - IBM Consulting Introduction: As an Application Developer in one of IBM's Consulting Client Innovation Centers (Delivery Centers), you'll be at the forefront of delivering deep technical expertise to both public and private sector clients worldwide. Our delivery centers offer locally based skills that drive innovation and facilitate the adoption of new technologies. At IBM, your role will involve transforming business requirements into code, contributing to the development of customized systems in an agile environment. By leveraging the latest tools, technologies, and education, you will help accelerate IBM's and its clients' digital transformations globally. Your work will integrate seamlessly with enterprise systems, creating solutions that drive innovation and business success. This is an exciting opportunity to make a global impact while advancing your career in one of the world's leading technology companies. Your Role and Responsibilities: Solution Design & Development: Address functional needs by designing and developing solutions using multiple technologies, converting high-level designs into functional and technical specifications. Application Support & Performance: Provide functional support services to ensure applications meet customer performance, availability, service level agreements (SLAs), and satisfaction targets. Project Management & Governance: Ensure adherence to project management practices and processes such as software application development, testing, service management, change management, and root cause analysis (RCA). Project Execution: Plan and manage medium to large-scale, complex, integrated application or platform projects, ensuring they are executed effectively within scope, timeline, budget, and quality parameters. Best Practices & Design Reviews: Define and promote coding best practices within your team. Perform design reviews to ensure the robustness and quality of the developed solutions. Required Technical and Professional Expertise: Languages & Frameworks: Proficiency in Java 8 and above, Spring Boot, and REST API Design & Development. Database & ORM: Experience with Spring Data/JPA/Hibernate and working knowledge of databases such as SQL Server/DB2. Containerization & Orchestration: Expertise in Docker, container orchestration platforms like OpenShift or Kubernetes. Messaging Systems: Familiarity with messaging platforms such as RabbitMQ and Kafka. CI/CD: Experience with continuous integration and continuous deployment tools such as Azure DevOps and Drone.io. Monitoring & Alerting: Knowledge of monitoring and alerting systems like AppDynamics and Prometheus. Preferred Technical and Professional Expertise: Log Management & Visualization: Experience with the ELK Stack (Elasticsearch, Logstash, Kibana) for log management and analysis. Data Visualization & Monitoring: Familiarity with Grafana for data visualization. Cloud Computing: Experience with AWS and cloud-based application deployments. Innovative Culture: At IBM, you ll work with cutting-edge technologies that are transforming industries globally. Global Impact: Your work will directly contribute to the success of clients worldwide by delivering impactful solutions. Career Growth: Gain access to professional development programs, training, and mentorship that will accelerate your career. Dynamic Work Environment: Join a diverse, collaborative team where creativity and new ideas are highly encouraged. If you're passionate about developing innovative solutions, transforming business needs into code, and working with the latest technologies, apply today to join IBM Consulting and make a significant global impact.
Senior Engineer - IT Software Development & Operations
Sasken Technologies
Job Title: Senior Engineer - IT Software Development & Operations Location: Bengaluru Job Summary The Senior Engineer will be responsible for applying their technical expertise in various aspects of software development and operations, including design, coding, testing, documentation, and technical support. This role requires the ability to handle complex issues, adapt existing methods to solve problems, and deliver results with minimal supervision. The ideal candidate will have strong collaboration skills, consistently seek to improve their technical capabilities, and actively participate in technical initiatives to enhance organizational success. Roles & Responsibilities Design & Development: Responsible for the design, coding, testing, bug fixing, documentation, and technical support within the assigned area. Ensure timely delivery of solutions while meeting quality and productivity goals. Collaboration & Customer Interaction: Regularly collaborate with customer teams to clarify technical issues, resolve queries, and ensure smooth project execution. Participate in key project and work-related activities, providing input on identifying important issues and risks. Process Improvement: Actively seek opportunities to enhance existing skills and acquire new complex technical skills. Participate in technical initiatives related to the project and organization, delivering training and contributing to process improvements. Project Execution: Adhere to organizational guidelines and checklists during deliverable reviews. Provide regular status reports to the Team Lead and ensure that relevant organizational processes are followed. Skill Development: Enhance technical capabilities by attending training sessions, engaging in self-study, and undergoing periodic technical assessments. Education and Experience Education: Engineering Graduate, MCA, or equivalent. Experience: 2-5 years of relevant experience. Competencies Description Digital Automation Engineer: Experienced in designing and implementing engineering processes and automation across phases of the DevOps-based SDLC, including Configuration Management, Build & Release, Test Automation, Deployment, Infrastructure Automation, and Continuous Operations. Configuration Management Specialist: Design, configure, and implement version control, branching, and configuration strategies using source code and version control systems like GIT, GitLab, BitBucket, SVN, CVS, Clearcase. Build Automation Specialist: Experience in Continuous Integration (CI) and Build Automation tools like Jenkins, Bamboo, ANT, Maven, Gradle. Test Automation Specialist: Experience in designing and authoring Test Automation scripts for Mobile, Web, Cross-platform, Web Services, Microservices, and infrastructure testing. Proficient in Black Box, White Box, Functional, Performance, UI, Security, and Regression testing, along with experience in BDD frameworks and device test clouds like Sauce Labs and Xamarin Test Cloud. Deployment Specialist: Expertise in release management strategies, managing package repositories, AMIs, and deploying applications and service packages across cloud and container-based infrastructure. Infrastructure Automation Specialist: Expertise in designing and implementing programmable infrastructure on virtualized and cloud-based environments. Ability to manage IaaS, Configuration Management, Container Management, and Environment Management across cloud platforms (AWS, Azure, etc.). Continuous Operations Specialist: Design, implement, and operate elastic infrastructure, manage application and service monitoring, failover scenarios, scalability, SLAs, and operational dashboards across cloud and virtualized environments. Platforms Linux, Windows, Android, iOS, VMware, OpenStack, Hyper-V Technology Standards AWS, Azure, RESTful APIs, SOAP, Test-Driven Development (TDD), Microservices patterns, Service Mesh, CloudFormation templates. Tools Configuration Management: GIT, GitLab, BitBucket, SVN, Clearcase, Perforce. Build Tools: GNU Make, NMake, ANT, Maven, Gradle, Ivy. CI Tools: Jenkins, Bamboo, CircleCI, AWS DevOps tools, Azure DevOps. Requirement Management: Bugzilla, Jira. Code Review: Gerrit, GitLab, ReviewBoard. Containers: Docker, Docker Swarm, Kubernetes, ECS (Amazon), AKS (Azure). Automation & Configuration Management: Ansible, Chef, Puppet. Cloud-Native DevOps Services (AWS, Azure): Cloud-Native DevOps Services. Testing Tools: Appium, Visual Studio App Center, SauceLabs, Selenium, Black Duck, SOAP UI, Protractor, JUnit, NUnit, LoadRunner, JMeter. Monitoring & Dashboarding: Prometheus, ELK Stack, Grafana. Languages Scripting Languages: Perl, Python, Groovy, Shell Script, PowerShell, YAML, Ansible. Other Programming Languages: Java, C#, XML. Test Automation Languages: Java, Python (for Appium and Sauce Labs). Specialization Key Areas: Configuration Management, Test Automation, Build and Release Automation, Infrastructure Automation, Continuous Operations, Deployment, RPA (Robotic Process Automation). Desired Skills Strong collaboration and communication skills. Ability to manage multiple projects and tasks while ensuring quality delivery. Experience working in an agile development environment. Proactive in identifying and resolving technical challenges. Strong analytical and problem-solving abilities. This is an exciting opportunity for a skilled Senior Engineer to advance their career in the IT Software Development and Operations domain, work on innovative projects, and gain experience across cutting-edge technologies. Qualification : Engineering Graduate, MCA, or equivalent.
1 - 20 of 0 jobs
* No exact matches found. Showing closest results insteadNo results found
Modify search criteria or create an alert to get relevant jobs as soon as they’re posted