AI Incident Response Jobs in Bengaluru

1455 Jobs Found

AS

Staff Engineer - Software Development

Aviatrix Systems

7+ Years | Not Disclosed | Bengaluru, Karnataka, India | Full-time

Staff Engineer - Software Development (Cloud AI & Network Security) Location: Bengaluru Company: Aviatrix Experience Required: 7+ Years About Aviatrix: Aviatrix is a global leader in cloud network security, trusted by over 500 enterprises. We provide a specialized platform for securing multi-cloud environments, giving organizations the control and visibility needed to modernize their cloud strategies. Architectural Focus & Impact As a Staff Engineer, you will architect and deliver advanced AI-driven network security solutions. This role bridges the gap between Distributed Systems (Python/Go), Real-time Telemetry, and LLM-integrated automation to build self-learning, adaptive security infrastructures. Technical Expertise Core Software Engineering: Languages: Deep proficiency in Python and Go (Golang). Distributed Systems: Mastery of Kubernetes, Microservices, and high-scale observability (Prometheus, ELK). Data Pipelines: Experience with real-time stream processing using Kafka, Flink, Kinesis, or Pub/Sub. Networking & Security Domain: Cloud Infrastructure: Expert knowledge of VPC/VNet design, Routing, Load Balancers, and Overlays. Firewall Technologies: Hands-on with Deep Packet Inspection (DPI), NGFW/IDS/IPS, and Cloud-native firewalls (AWS, Azure, GCP). Security Frameworks: Alignment with Zero Trust, NIST CSF, and CIS Benchmarks. AI & Machine Learning Integration: Model Serving: Experience serving ML models via REST or gRPC. Generative AI: Familiarity with LLM integration, RAG (Retrieval-Augmented Generation), LangChain, and vector databases. Key Responsibilities System Architecture: Lead the design of cloud-native microservices for security control planes. AI-Driven Features: Integrate LLMs for Natural Language-to-Firewall Rule translation and automated incident summarization. Technical Leadership: Mentor junior engineers and set high standards through rigorous Design and Code Reviews. Cross-Functional Collaboration: Partner with Data Scientists and Cloud Networking teams to deliver production-grade AI features. Benefits & Why Join Us Regional Package: Comprehensive pension, private medical coverage, and life assurance. Wellbeing: Annual wellbeing stipend and generous holiday allowance. Growth Culture: We value unique career paths and prioritize candidates who are passionate about the intersection of AI and Security.

Engineer Staff Engineer Software Engineer software Software Engineer
CA

Senior Manager, Security Operations Center (soc)

Calix

8+ Years | Not Disclosed | Bengaluru, Karnataka, India | Full-time

Senior Manager, Security Operations Center (SOC) Location: Bangalore Type: Full-Time Experience Required: 8+ Years (3+ in Leadership) Role Overview: Strategic Cyber Defense We are seeking a Senior Manager to lead and modernize our SOC operations across enterprise and product environments. You will oversee a high-performance team dedicated to threat detection, advanced detection engineering, and incident response. This role is a strategic blend of technical mastery leveraging AI and SOAR and people leadership, focused on building a resilient, automation-first security culture. Core SOC Service Offerings & Expertise Advanced Defense & Detection: Detection Engineering: Implement Detection-as-Code practices and prioritize backlogs based on the evolving threat landscape. Threat Intelligence & Hunting: Deliver actionable intel and execute structured threat hunting hypotheses to proactively identify stealthy adversaries. Deception & Validation: Manage deception strategies (honeypots/tokens) and use attack emulation tools to validate detection logic effectiveness. Forensics: Lead digital forensic investigations, evidence acquisition, and post-incident analysis. Automation & Technology Stack: Azure Ecosystem: Advanced proficiency with Microsoft Sentinel, Defender XDR, and Defender for Cloud using KQL. Cloud Operations: Strong knowledge of security operations across Azure, AWS, and preferably GCP. SOAR & AI: Champion the integration of Security Orchestration, Automation, and Response (SOAR) and AI to drive SOC efficiency. Key Responsibilities Leadership & Strategy: Team Development: Coach and mentor the SOC team, conducting regular 1-on-1s and fostering a growth-oriented culture to prevent burnout. Roadmap Execution: Help define a comprehensive SOC strategy and maturity framework aligned with organizational risk management. Stakeholder Liaison: Act as a trusted advisor to Product, IT, and Development leaders to integrate security into cross-functional workflows. Metrics & Operational Excellence: Data-Driven Reporting: Develop dashboards (e.g., Power BI) to track KPIs, KRIs, and detection coverage. Incident Lifecycle: Lead the lifecycle of escalated incidents, conduct root cause analysis, and execute tabletop exercises. 24/7 MDR Strategy: Define operational procedures for Managed Detection and Response (MDR) and sustainable on-call rotations. Qualifications for Success Proven Leadership: 8+ years in InfoSec with specific experience leading SOC or MDR functions. Azure Mastery: Deep technical expertise in the Microsoft security stack. Framework Knowledge: Familiarity with MITRE ATT&CK, Purple Teaming, and cloud-native detection. Soft Skills: Exceptional ability to simplify complex technical content for executive-level communication.

Senior Manager Senior manager Security Manager security
ON

Infrastructure Security Leader

Observe.ai Networks Private Limited

9+ Years | Not Disclosed | Bengaluru, Karnataka, India | Full-time

Infrastructure Security Leader Location: Bengaluru About Us: Observe.AI Observe.AI is the leading AI-powered platform for customer experience, enabling enterprises to automate customer interactions using AI agents. Our platform ensures natural conversations, delivering predictable outcomes, and is trusted by top companies like DoorDash, Affordable Care, Signify Health, and Verida. Observe.AI blends advanced speech understanding, workflow automation, and enterprise-grade governance to deliver end-to-end AI solutions that optimize both human and AI interactions, providing insights for coaching and quality management. At Observe.AI, we re on a mission to transform customer experiences through AI. As a founding member of our Infrastructure/Cloud Security team, you will have the opportunity to shape and design cloud security from the ground up for a platform trusted by over 80 million users. Reporting directly to the VP of Information Security, you will drive a defense-in-depth approach across infrastructure, IAM, and networks. This is a unique, zero-to-one role where you ll define security strategy, mentor the team, and make a long-lasting impact in a fast-growing AI company. What You ll Be Doing: Security Strategy Development: Design and document security policies, reference architectures, design patterns, and roadmaps to protect our platform. Secure Access & Network Design: Lead efforts to design secure access controls and networks for production environments. Cross-Department Leadership: Collaborate with Corporate IT to implement security measures within the corporate environment. Defense-in-Depth: Implement network segmentation, firewall configurations, VPNs, and deep packet inspection to minimize impact from security incidents. AWS Infrastructure Security: Re-architect AWS infrastructure to enhance security, ensuring that networks, VPCs, and security configurations are optimized. Vulnerability Management: Identify tools and technologies to scan networks, OS, and infrastructure for vulnerabilities, and work with SRE teams to remediate identified risks. Security Compliance: Represent Infrastructure Security in PCI, SOC, ISO, HITRUST, and other regulatory audits, ensuring compliance. Collaborative Design: Partner with engineering teams and architects to ensure infrastructure designs meet both business and security requirements. Stakeholder Collaboration: Work with other teams to integrate up-to-date security features and infrastructure designs across the organization. What You ll Bring to the Role: 9+ years of experience in Software Engineering, Network Security, and AWS Security. Proven track record in designing and implementing secure Cloud Infrastructure, Network Security, and Corporate IT Security. Experience at a SaaS product company with hands-on knowledge of cloud security. Leadership experience in managing Infrastructure Security teams or Security-Focused SRE teams. Strong understanding of network designs, protocols, and certifications like CCNA (or similar). Ability to handle multiple, high-priority projects simultaneously while maintaining focus and quality. Comfort with working off-hours to handle security incidents in a dynamic, fast-paced environment. First-hand experience with major cloud providers, specifically AWS. Deep understanding of large-scale systems and N-tier architectures. Excellent communication skills, able to effectively influence and collaborate with stakeholders across the organization. Perks & Benefits: Medical Insurance: Comprehensive options, including free online doctor consultations. Leave Policies: Yearly privilege and sick leaves as per Karnataka S&E Act, along with generous national, festive, and parental leave. Learning & Development: Access to a fund that supports continuous learning and professional growth. Flexible Benefits: Tax exemptions for meals, PF, etc., along with other flexible benefit plans. Team Culture: Fun events to foster collaboration and culture across the organization.

Infrastructure Security Infrastructure Security Security infrastructure Leader
TV

Lead Software Engineer - Scale & Performance

Team Vunet Systems

6-12 Years | Not Disclosed | Bengaluru, Karnataka, India | Full-time

Lead Software Engineer - Scale & Performance Location: Bengaluru Experience: 6 12 years About VuNet VuNet is a pioneer in Business Journey Observability, using Big Data and Machine Learning to revolutionize digital experiences in the financial services industry. Our platform delivers end-to-end visibility into customer journeys, helping organizations proactively resolve issues, ensure operational resilience, and deliver superior user satisfaction. With over 28 billion digital transactions monitored every month and serving more than 300 million users globally, VuNet is shaping the future of observability for some of the largest banks and financial institutions. We are Series B funded, part of NASSCOM s DeepTech Club, and recognized by global analysts such as Gartner and Omdia. Your Role: Lead Software Engineer - Scale & Performance As a Lead Software Engineer for Scale & Performance, you ll own the performance and scalability benchmarks for VuNet s observability platform. You will work with cutting-edge technologies, design robust test frameworks, and ensure that our platform scales seamlessly to meet the demands of millions of users. Roles & Responsibilities Own performance and scalability benchmarking for key platform components (ingestion pipelines, data storage, and query services). Design and execute load, stress, soak, and capacity tests across microservices, agents, and ingestion layers. Identify and resolve performance bottlenecks in both infrastructure (CPU/memory/IO) and application layers (API latency, throughput, GC behavior). Develop and maintain performance test frameworks, preferably using Kubernetes-based environments. Collaborate with DevOps and SRE teams to optimize system configurations (Kubernetes, Postgres/TimescaleDB, ClickHouse, Kafka) for scale. Implement OpenTelemetry for service instrumentation to monitor system health and latency (p50/p95/p99 metrics). Contribute to capacity planning, scaling strategies (horizontal/vertical), and resource optimization. Analyze production incidents related to scaling issues and drive permanent fixes. Work with engineering teams to design scalable architecture patterns and define SLIs/SLOs for system performance. Document performance baselines, tuning guides, and scalability best practices for internal use. What You Bring Mandatory Skills: Strong background in performance engineering for large-scale distributed systems or SaaS platforms. Expertise in Kubernetes, container runtimes (containerd/Docker), and resource profiling in containerized environments. Solid understanding of Linux internals, CPU/memory profiling, and network stack tuning. Hands-on experience with observability tools (Prometheus, Grafana, OpenTelemetry, Jaeger, Loki, Tempo, etc.). Familiarity with observability platform datastores like ClickHouse, PostgreSQL/TimescaleDB, Elasticsearch, or Cassandra. Experience with performance benchmarking tools such as k6, Locust, JMeter, or custom Golang/Python scripts. Ability to interpret system metrics (CPU usage, memory, GC, latency) and correlate across different layers. Nice-to-Have Skills: Experience with agent benchmarking (OpenTelemetry Collector, custom data shippers). Exposure to streaming systems like Kafka, NATS, or Pulsar. Familiarity with CI/CD pipelines for performance testing and regression tracking. Knowledge of cost optimization and capacity forecasting in cloud environments (AWS/GCP/Azure). Proficiency in Go, Python, or Bash scripting for automation and data analysis. Life at VuNet: At VuNet, we're building a world-class observability platform, and we re just getting started. You ll be part of a passionate, problem-solving team that embraces collaboration, fast learning, and staying ahead of emerging technologies like Gen AI. We foster a high-trust, inclusive culture where collaboration, ownership, and innovation are central to our success. If you're looking to work on cutting-edge tech, make a real impact, and grow with a supportive team you ll fit right in at VuNet. Benefits: Comprehensive health insurance coverage for you, your parents, and dependents. Mental wellness and 1:1 counseling support. A culture that promotes continuous learning, innovation, and career growth. Transparent, inclusive, and high-trust workplace. Opportunities for skill enhancement with training programs focused on new Gen AI technologies.

Lead Software Software lead Engineer Lead Engineer
TV

Mobile App And Observability Sdk Engineer

Team Vunet Systems

3-6 Years | Not Disclosed | Bengaluru, Karnataka, India | Full-time

Mobile App and Observability SDK Engineer Experience: 3 6 Years Location: Bengaluru About VuNet VuNet is a pioneer in Business Journey Observability, revolutionizing the financial services industry with Big Data and Machine Learning. Our cutting-edge platform offers end-to-end visibility into customer journeys, driving proactive issue resolution, operational resilience, and superior user satisfaction. With over 28 billion digital transactions monitored monthly touching 400 million users worldwide we re already powering leading banks and financial institutions across India and MEA. VuNet is Series B funded, part of NASSCOM DeepTech Club, and recognized globally by analysts like Gartner and Omdia. Your Role: Mobile App and Observability SDK Engineer At VuNet, the Product Development Team is dedicated to delivering exceptional customer experiences through scalable products. We are looking for a Mobile App and Observability SDK Engineer to join this team. In this role, you ll be at the forefront of building high-quality mobile applications and advancing our Mobile Real User Monitoring (MRUM) initiatives. You ll capture and translate mobile performance data into actionable insights, helping improve the performance and user experience of mobile apps across various platforms. If you re passionate about mobile engineering, user experience, and observability this role offers a unique opportunity to merge these interests into a groundbreaking solution. Roles & Responsibilities Mobile Application Development: Design, develop, and maintain robust, high-performance mobile applications for iOS and Android using Swift, Kotlin, Flutter, or React Native. Testing & Quality Assurance: Implement unit, integration, and UI testing strategies to ensure the app s quality, stability, and regression coverage. Debugging & Profiling: Identify and resolve performance bottlenecks, ANRs, crashes, and memory leaks using tools like Android Studio Profiler, Xcode Instruments, or Flipper. Crash Analysis & Reporting: Integrate crash analytics tools and develop efficient incident tracking and resolution workflows. Performance Monitoring & Insights: Leverage telemetry, profiling, and analytics data to enhance app performance, responsiveness, and overall user experience. Observability Collaboration: Work with SRE and backend teams to export performance metrics, logs, and traces from mobile clients into centralized observability platforms. Code Quality: Write clean, modular, and well-documented code, adhering to best practices in mobile development and SDK maintenance. What You Bring Mandatory Skills: Mobile App Development: 3+ years of hands-on experience in mobile app development using Flutter, React Native, Swift, or Kotlin (experience in at least two of these). App Lifecycle & Performance: Strong understanding of mobile app lifecycle, UI rendering, asynchronous processing, state management, and performance optimization (ANRs, memory management, network latency). Debugging & Profiling Tools: Proficiency in debugging, profiling, and testing mobile applications using tools like Android Studio Profiler, Xcode Instruments, or Flipper. Crash Analytics: Experience integrating and using crash analytics and reporting tools. CI/CD & SDK Versioning: Familiarity with CI/CD pipelines, automated testing, and SDK versioning. Performance Instrumentation: Interest in observability, monitoring, and performance instrumentation with a willingness to learn OpenTelemetry and RUM concepts. Problem-Solving Mindset: Strong analytical and debugging skills, focused on enhancing performance and reliability. Nice-to-Have Skills: OpenTelemetry & SDKs: Exposure to OpenTelemetry SDKs or other instrumentation frameworks for capturing telemetry data (e.g., traces, metrics, logs). Mobile Observability: Familiarity with mobile observability backends. Session Replay & Mobile Analytics: Knowledge of session replay, user behavior tracking, or mobile analytics SDKs. SRE & Monitoring Practices: Understanding of SRE principles, monitoring best practices, and golden signals. Open Source Contributions: Contributions to open-source SDKs or mobile performance tools. Life at VuNet: At VuNet, we re building a world-class observability platform proudly Made in India. We re just getting started, and we re looking for people like you to join us in tackling some of the most complex challenges in the digital world. Our team is filled with passionate problem-solvers who thrive in a collaborative, fast-paced environment. We embrace continuous learning, adapt quickly, and stay ahead of emerging technologies like Gen AI. If you re looking to work on cutting-edge technology, make a real impact, and grow with a supportive team, you ll feel right at home here at VuNet. Benefits: Comprehensive health insurance coverage for you, your parents, and dependents. Mental wellness and 1:1 counseling support. A learning culture that promotes growth, innovation, and ownership. A transparent, inclusive, and high-trust workplace culture. Access to Gen AI and integrated technology workspaces. Supportive career development programs to expand your skills with various training opportunities.

Mobile Mobile app Observability SDK Mobile sdk
EX

Gen AI Support Engineer-2

Exotel

4-7 Years | Not Disclosed | Bengaluru, Karnataka, India | Full-time

Gen AI Support Engineer-2 Location: Bengaluru Experience: 4 7+ years Employment Type: Full-time About Us Exotel is the leading full-stack customer engagement platform and virtual telecom operator for emerging markets. Since its inception in 2011, Exotel has been powering 50 million daily engagements across voice, video, and messaging channels. We provide our unified customer engagement solutions to over 6000 companies globally, including industry leaders like Ola, Swiggy, Flipkart, GoJek, Byjus, Urban Company, HDFC Bank, Zomato, and Oyo. With $100 million in Series D funding and an ARR of $60 million, Exotel is a growth-stage company poised for massive impact. Overview We're seeking a Gen AI Support Engineer-2 to join our team. As an L2 Support Engineer, you will be the highest level of technical escalation within the support organization. Your role will encompass system reliability, platform integrity, troubleshooting mission-critical production issues, and collaborating with engineering teams for architecture feedback. Additionally, you'll help mentor junior engineers and improve operational processes and tools for large-scale environments. If you're passionate about writing clean code with Python and Django and want to contribute to a fast-paced, mission-driven company, this role is for you! Responsibilities Mission-Critical Issue Resolution: Own the resolution of high-priority, time-sensitive production issues. Root Cause Analysis (RCA): Lead RCA reviews and push for systemic improvements in system architecture and processes. Performance Optimization: Identify bottlenecks and propose architectural changes to improve system performance and scalability. Patch Management: Assist in configuring, deploying, and testing patches, releases, and application updates to production environments. SME for Production Systems: Serve as the Subject Matter Expert (SME) for Exotel's production systems and integrations. Cross-Team Collaboration: Work with Delivery, Product, and Engineering teams to influence system design, rollout strategies, and improvement plans. Mentorship: Lead and mentor L1/L2 engineers on troubleshooting best practices and continuous learning. Code Writing & Automation: Write clean, maintainable code for internal tools, scripts, and automation using Python and Django. Support Tooling: Automate recovery workflows and design support tools for proactive monitoring. Operational Excellence: Establish and improve SLAs, monitoring dashboards, alerting systems, and operational runbooks to ensure system reliability. Must Have Skills Backend Development Support: 3+ years of experience in backend development support, production support, or DevOps/SRE roles. Core Technologies: Proficiency in Python, Django, SQL, and troubleshooting in Linux. Web Technologies: Strong understanding of HTML, CSS, JavaScript, and other web technologies. Distributed Systems & Cloud: Experience working with distributed systems, cloud architecture (AWS), Docker, and Kubernetes. Automation: Strong scripting skills with Bash/Python for automation and operational support. CI/CD & Observability: Good understanding of CI/CD, observability tools, and release management workflows. Communication Skills: Excellent communication, leadership, and incident command skills for managing production issues and cross-functional collaboration. Nice to Have Experience with AI-powered systems and machine learning technologies. Familiarity with monitoring systems like Prometheus, Grafana, or Elasticsearch. Knowledge of microservices architectures and scaling distributed systems. Innovative Work: Be at the forefront of cloud-based communications technology and AI-driven customer engagement platforms. Impact: Play a key role in maintaining and optimizing systems that power millions of customer interactions daily. Growth Opportunities: Be part of a fast-growing company with ample learning opportunities and career development. Collaborative Environment: Work in a supportive, inclusive environment where your input and ideas matter. Competitive Benefits: Comprehensive benefits package including health insurance, mental wellness support, and more.

Ai Gen Ai Support Engineer Ai engineer
OK

Manager, Go-to-market Technology - Support Operations

Okta

7+ Years | Not Disclosed | Bengaluru, Karnataka, India | Full-time

Manager, Go-To-Market Technology Support Operations Location: Bengaluru Department: Business Technology Experience: 7+ Years (3+ Years in Team Management) Employment Type: Full-Time About Okta Okta is The World s Identity Company. We empower everyone to securely use any technology, on any device, from anywhere. Our Okta and Auth0 platforms offer secure access, authentication, and automation placing identity at the center of digital transformation and enterprise growth. At Okta, we celebrate diverse backgrounds and experiences. We aren t looking for a perfect fit we re looking for lifelong learners and collaborative builders who bring unique value to our mission. The Team You ll join the Go-To-Market Technology (GTM) group, a core part of Okta s Technology, Data & Intelligence (TDI) organization. Our vision: drive clarity, collaboration, and accountability across the business while enabling Okta s scale and growth. The Opportunity We re seeking a Manager to lead the Support Operations team within GTM Technology. This role is responsible for managing a team of Business Application Administrators who oversee and support Okta s GTM systems primarily Salesforce and integrated applications such as ServiceNow. You ll drive operational excellence, oversee capacity and team development, collaborate cross-functionally, and shape how we support and optimize business applications across global teams. This role requires strong technical know-how of the Salesforce ecosystem, a mindset for process improvement, and a passion for team leadership. Key Responsibilities Leadership & Talent Development Build, motivate, and lead a high-performing team of application administrators. Hire, mentor, and retain top talent through coaching and career planning. Provide direction and remove roadblocks to help your team succeed. Foster a culture of learning, ownership, and continuous improvement. Performance Management Define and track KPIs and team SLAs with a data-driven approach. Manage team resource allocation and adjust capacity as business needs shift. Identify skill gaps and build plans to address them through training and hiring. Cross-Functional Collaboration Partner with Technology, Data & Intelligence, Security, and Compliance teams to align on goals and incident handling. Refine escalation processes for a smooth support experience across teams. Enable seamless knowledge transfer and system supportability. Documentation & Automation Lead the Knowledge Centered Service (KCS) program to scale AI-driven incident resolution. Standardize and document team operational processes to ensure consistency. Security & Compliance Ensure all Salesforce and GTM-related systems adhere to compliance standards such as SOX. Collaborate with security teams on audits and mitigation of any vulnerabilities. Innovation Culture Encourage your team to explore new Salesforce, AI, and automation features. Promote participation in hackathons, Fix-It Days, and other internal innovation initiatives. Required Skills & Experience 7+ years of experience in IT or Business Systems, with 3+ years in people management. Strong expertise in the Salesforce ecosystem and enterprise SaaS tools like ServiceNow, Jira, Confluence, GitHub, etc. Experience in a global or multi-location work environment. Deep understanding of compliance (e.g., SOX) and security standards for enterprise applications. Proven track record of driving team innovation and embedding modern tools or practices. Excellent interpersonal and executive-level communication skills. Strong organizational, time management, and stakeholder alignment capabilities. Ability to remain resilient under pressure and maintain focus on team and business outcomes. High Impact: Drive global support operations for a critical business tech stack. Empowered Leadership: Build and lead a team in a dynamic, growing organization. Growth & Learning: Opportunities for continuous development in technology, leadership, and innovation. Collaborative Culture: Join a purpose-driven company with a human-centered, inclusive team culture. Join Us Become a part of a company that s transforming how identity is secured and scaled in the modern world. At Okta, you belong.

Manager Go Market Market manager Technology
TV

Lead Platform Engineer

Team Vunet Systems

6-10 Years | Not Disclosed | Bengaluru, Karnataka, India | Full-time

Lead Platform Engineer Observability Solutions Location: Bengaluru Experience: 6 10 Years Function: Observability Engineering | Platform Architecture | SRE Enablement Join VuNet Redefining Digital Observability at Scale VuNet is transforming the future of digital experiences through Business Journey Observability, combining Big Data and AI/ML to empower real-time visibility across payments, banking, and financial services. Monitoring 28+ billion transactions/month, our platform is trusted by top financial institutions and powers over 300 million users. Backed by Series B funding and recognized by Gartner, NASSCOM, and Forbes, we are leading the charge in building a new category of observability, proudly Made in India for global impact. Your Role: Lead Platform Engineer As the Lead Platform Engineer, you will architect and drive the development of packaged observability solutions across 100+ infrastructure and application technologies. You will define **golden signals**, build **data collection strategies**, and lead the standardization of alerts, dashboards, and RCA workflows for platforms like **Kubernetes, Oracle DB, and Tomcat**. This is a cross-functional leadership role that sits at the intersection of product, platform, DevOps, and SRE. You will **lead a team** and influence how observability is delivered, scaled, and adopted across complex environments. Key Responsibilities Observability Solution Development Design and lead the delivery of observability packages for databases, middleware, cloud-native, and legacy platforms. Define and implement data collection pipelines, including agents, APIs, logs, metrics, traces, and service discovery. Establish **golden signals, SLIs/SLOs**, and health KPIs for performance, availability, and anomaly detection. Dashboards, Alerts & RCA Develop standardized, reusable dashboards, alerts, reports, and troubleshooting playbooks. Automate **RCA workflows** to improve MTTR and reduce alert fatigue. Platform Enablement & Integration Work with engineering to enhance agent capabilities and support new data sources/formats. Guide implementation of platform features for better observability at scale. Team Leadership & Governance Lead and mentor a team of observability engineers and specialists. Define design patterns, reusable modules, and version-controlled libraries. Stakeholder Collaboration Partner with product managers, DevOps, SREs, and customer teams to gather requirements, align priorities, and validate use cases. Ensure deliverables are scalable, well-documented, and production-ready. What You Bring Must-Have Skills 6 10 years of experience in observability, platform engineering, or SRE roles. Hands-on with tools like Prometheus, Grafana, OpenTelemetry, ELK/EFK, Datadog, Splunk. Strong understanding of logs, metrics, traces, profiling, and collection strategies. Experience developing solutions for platforms like Kubernetes, Oracle, PostgreSQL, Tomcat, etc. Proficient in Python, Shell scripting, APIs, and automation tools (**Terraform**, etc.). Familiar with alert fatigue mitigation, anomaly detection, and RCA frameworks. Excellent communication, technical leadership, and documentation skills. Nice to Have Experience managing an observability marketplace or solution catalog. Contributions to open-source observability projects. Certifications in Kubernetes, Observability platforms, or cloud providers (AWS/GCP/Azure). Background in ITSM tools, CMDBs, or incident workflow automation. At VuNet, you ll help build a category-defining observability platform that s already transforming critical infrastructure for leading financial institutions. You ll work with passionate engineers, push technical boundaries, and grow in a high-trust, high-impact environment. What You ll Experience: Ownership of key observability initiatives impacting 300M+ users. Collaboration with SRE, DevOps, and product teams across real-time financial systems. Opportunity to experiment with and shape Gen AI, ML, and emerging telemetry trends. Perks & Benefits Health insurance for you, your parents, and dependents. 1:1 mental wellness support. Training programs, certifications, and career growth opportunities. Transparent, inclusive, and high-trust work culture. Access to cutting-edge technology and Gen AI-powered workspaces.

Lead Platform Engineer Lead Engineer Engineer lead
AL

Information Security Engineer

Altisource

3-5 Years | Not Disclosed | Bengaluru, Karnataka, India | Full-time

Job Title: Information Security Engineer Location: Bengaluru Company: Altisource (NASDAQ: ASPS) About Altisource At Altisource, we develop cutting-edge technologies and services for the mortgage and real estate industry. We re a trusted partner to 7 of the top 10 U.S. mortgage servicers, operate one of the leading real estate auction platforms, and support a cooperative with over 15% market share in the $1.8 trillion U.S. originations market. If you're passionate about cybersecurity and want to make an impact in a high-growth, tech-driven environment this is the role for you. Position Summary We re looking for a highly motivated Information Security Engineer to support our growing security operations. You will play a vital role in identifying and mitigating security risks across applications, systems, and networks. This role involves vulnerability assessments, code reviews, and automation of security tasks ensuring Altisource remains secure and compliant in a fast-paced environment. Key Responsibilities Conduct vulnerability assessments on applications, networks, and systems. Perform manual verification to reduce false positives and validate security fixes. Communicate identified vulnerabilities and recommend remediation steps to internal teams. Perform secure code reviews and assist development teams in fixing identified issues. Identify and mitigate risks throughout the software development lifecycle. Leverage commercial and open-source tools for vulnerability detection (e.g., Qualys, Nessus, Burp Suite). Assist in internal penetration testing initiatives. Develop internal tools and automate security tasks, leveraging AI where applicable. Stay updated on the latest threats, tools, and best practices in cybersecurity. Create detailed assessment reports and present findings to technical and non-technical stakeholders. Train and mentor team members on vulnerability management processes and tools. Required Qualifications Bachelor s degree in Computer Science, Engineering, or a related field. 3 to 5 years of hands-on experience in information security or related roles. Relevant certifications such as CEH, GIAC, or similar. Solid experience in: Network vulnerability assessments Application scanning and secure code review Windows, Linux, and Unix operating systems Familiarity with OWASP tools, methodologies, and security best practices. Strong communication skills both written and verbal. Preferred Skills Experience with tools like: Qualys, Nessus, Nexpose, SAINT Burp Suite Pro, HP WebInspect Static analysis tools (e.g., IBM AppScan Source, Fortify) Proficiency in one or more programming languages: Java, C, C++, .NET (C#, VB). Experience delivering training or presenting technical content to teams. Background in technical writing or web development is a plus. Be part of a team securing technologies used by top players in the mortgage and real estate space. Work with modern tools and frameworks. Enjoy a collaborative environment that supports innovation, growth, and learning. Qualification : Bachelors degree in Computer Science, Engineering, or a related field

Information Security Information security Engineer Security engineer
SI

Senior Manager, Salesforce Operations

Samsara Inc

3+ Years | Not Disclosed | Bengaluru, Karnataka, India | Full-time

Position: Senior Manager, Salesforce Operations Location: Bengaluru, India (Hybrid 3 days onsite) Company: Samsara Technologies India Pvt. Ltd. About Samsara Samsara (NYSE: IOT) leads the Connected Operations Cloud, empowering industries like transportation, agriculture, and manufacturing to harness IoT data for smarter, safer, and more sustainable operations. With a global impact and a fast-scaling culture, Samsara offers unique opportunities to solve real-world challenges with cutting-edge technology. Role Overview Samsara is seeking a Senior Manager, GTMS (Go-to-Market Systems) Operations to lead the Salesforce operations team in Bangalore. Reporting to the Sr. Director of Sales Systems, this role is pivotal in building Samsara s India-based GTMS operations from the ground up, ensuring performance, scalability, and alignment across Sales, Finance, Product, and Business Technology functions. The ideal candidate is an experienced Salesforce operations leader with a passion for systems stability, stakeholder alignment, and continuous process improvement, coupled with strong people leadership and cross-functional collaboration skills. Key Responsibilities Operational Excellence & Governance Lead end-to-end incident and problem management across the Salesforce and GTMS ecosystem. Drive operational stability, reliability, and proactive issue resolution across sales systems. Manage system releases, updates, and quality control processes. Cross-Functional Collaboration Act as a bridge between Sales, Finance, Product, and IT to align systems strategy with business outcomes. Ensure seamless data flow and process integration across enterprise systems. Maintain transparent, regular communication with senior stakeholders. Strategic Planning & Cost Management Build operational strategies that support scale and growth in GTM functions. Optimize resource allocation and control budget and cost efficiency. Support and execute on long-term product and process roadmaps. Team Leadership & Development Build, mentor, and manage L2/L3 operations teams based in India. Foster an inclusive, high-performing team culture with strong talent development practices. Define KPIs and continuously improve team performance through coaching and process optimization. Vendor, Compliance & Risk Management Manage third-party vendor relationships and evaluate tools to enhance operational delivery. Enforce compliance, data security, and privacy standards within the systems landscape. Minimum Qualifications Bachelor s degree in IT, Business, or a related field (Master s preferred). 3+ years experience in a Salesforce-focused operations leadership role. Proven expertise in Salesforce Sales Cloud, CPQ, and GTM systems integration. Deep operational experience in system support, QA, and incident management. Strong executive presence, communication, and stakeholder influencing skills. High level of business acumen and ability to align tech strategy with business goals. Ideal Traits Strategic thinker with a passion for customer-centric system design. Strong collaborator across technical and non-technical teams. Agile leader ready to scale operations in a hyper-growth, data-driven environment. Curious about using AI and automation to elevate system reliability and performance. Qualification : Bachelors degree in IT, Business, or a related field (Masters preferred).

Senior Manager Senior manager Salesforce Salesforce manager
CO

Senior Software Engineer, Customer Solutions

Commure

3+ Years | Not Disclosed | Bengaluru, Karnataka, India | Full-time

Job Title: Senior Software Engineer Customer Solutions Location: Bengaluru, India Employment Type: Full-time Department: Engineering About Commure Commure is revolutionizing healthcare with AI-powered technologies designed to eliminate administrative overhead and give clinicians more time with patients. Our platform combines advanced LLM AI, RTLS, and workflow automation to streamline clinical operations, improve patient engagement, and enhance care delivery. We support 250,000+ clinicians across hundreds of care sites nationwide and we re just getting started. If you're passionate about building life-changing solutions in one of the world s most vital industries, now is the time to join. About the Role As a Senior Software Engineer on the Customer Solutions team, you ll be instrumental in building and customizing applications on top of our Patient Experience Platform to address client-specific needs. Your work will directly impact how healthcare providers interact with our technology and serve patients better. Key Responsibilities Translate business and client requirements into scalable, maintainable technical solutions. Design, develop, and integrate customized applications and services using our core platform. Collaborate with internal teams and customers to prioritize features and maintain a customer-focused development backlog. Build long-term client relationships through technical leadership and delivery excellence. Implement and maintain observability through logging, monitoring, and alerting systems. Apply SRE and DevOps practices to improve stability and incident response. Coordinate testing and quality assurance activities in collaboration with QA teams. Stay informed on healthcare tech trends and integrate innovations into the platform. Participate in client-facing meetings to advise on feasibility, risks, and technical trade-offs. Mentor junior engineers and contribute to a strong engineering culture. Required Qualifications Bachelor's or Master s degree in Computer Science, Engineering, or a related field. 3+ years of professional software development experience. Frontend: React, Next.js, TypeScript Backend: Python, Node.js Cloud: Proficiency in AWS, Azure, or GCP with experience in cloud-native architectures CI/CD: Familiarity with tools like GitHub Actions, Google Cloud Build, etc. Infrastructure: Experience with Docker, Kubernetes, and IaC principles Monitoring & Observability: Implemented logging, tracing, and alerting systems Production Support: Experience with on-call rotations and incident response Strong communication and collaboration skills with cross-functional teams Experience working directly with clients to deliver technical solutions Understanding of APIs, webhooks, and third-party system integrations in healthcare Preferred Skills Familiarity with HIPAA, FHIR, HL7, and other healthcare standards Understanding of data privacy, compliance, and security best practices Strong problem-solving abilities and adaptability in dynamic environments Experience in client support, customization, or professional services engineering is a plus Why You ll Love Working at Commure + Athelas Mission-Driven Work Help transform healthcare through meaningful technology. Elite Backing Backed by General Catalyst, Sequoia, Y Combinator, and more. Explosive Growth 500%+ YoY growth pre-merger and Series D funded. Competitive Benefits Flexible PTO, health insurance, parental leave, and more (location-specific). Be part of the future of healthcare. Join Commure and help build intelligent, scalable systems that truly matter. Qualification : Bachelor's or Masters degree in Computer Science, Engineering, or a related field.

Senior Software Senior software Engineer Senior engineer
TV

Devops Engineer

Team Vunet Systems

3+ Years | Not Disclosed | Bengaluru, Karnataka, India | Full-time

DevOps Engineer Location: Bengaluru, India Experience: 3 - 5 Years Job Type: Full-time About VuNet VuNet is a deep-tech leader in Business Journey Observability, leveraging Big Data and Machine Learning to deliver end-to-end digital experience monitoring for major financial institutions. The platform monitors over 28 billion transactions monthly, powering top banks and enterprises in India and MEA. Work on cutting-edge observability technology Join a Series B funded, award-winning startup recognized by Gartner, Forbes, and NASSCOM Collaborate in a fast-paced, innovative environment focused on learning and growth Access to mental wellness support, health insurance (covering family), and career development programs Role Overview: DevOps Engineer Design, develop, and maintain VuSmartMaps deployments across on-premises, cloud, and hybrid environments Automate deployments using Infrastructure-as-Code (IaC) and CI/CD pipelines Manage cybersecurity assessments and remediations for deployments Collaborate with development teams to improve deployment processes and infrastructure support Publish VuSmartMaps in cloud marketplaces (AWS, Azure, GCP) Stay current on DevOps, CI/CD, infrastructure orchestration, cybersecurity, AI workflows, and big data technologies Key Responsibilities Develop and maintain IaC frameworks enabling flexible VuSmartMaps deployment Build and manage CI/CD pipelines using GitHub Actions, Jenkins Monitor infrastructure, conduct cybersecurity testing, and manage patching Improve deployment efficiency and customer experience Collaborate cross-functionally for seamless integration and rollout Must-Have Skills 3+ years building/managing CI/CD pipelines (GitHub Actions, Jenkins) Certified/experienced in Kubernetes, Docker, Terraform, Helm, YAML Hands-on experience with GitOps workflows Knowledge of web servers (Nginx, Django), identity providers (Active Directory, LDAP), load balancers (Traefik) Experience with databases (PostgreSQL, Elasticsearch, Hadoop stack) and secrets management (Key Vault) Familiarity with cloud services (AWS, Azure, GCP) across IaaS, PaaS, SaaS layers Strong Linux and scripting skills (Bash, Python) Excellent communication skills for cross-team collaboration Good-to-Have Skills Exposure to Red Hat OpenShift, VMware, Ansible, Chef, Puppet Familiarity with container orchestration tools (Podman, Docker Swarm, Nomad) Experience optimizing dockerized microservices and container images Benefits Comprehensive health insurance covering you and your family Mental health and 1:1 counseling support Learning culture focused on innovation and career growth Inclusive, transparent workplace culture Access to new Gen AI tools and integrated tech workspace Career development and skill enhancement programs

DevOps Engineer Devops engineer Full-Time Continuous Integration (CI)
CS

Principal Cloud Development Engineer

Cloud Software Group

14+ Years | Not Disclosed | Bengaluru, Karnataka, India | Full-time

Job Title: Principal Cloud Development Engineer Location: Bengaluru, India About Cloud Software Group: Cloud Software Group (CSG), home to Citrix and TIBCO, is one of the largest global providers of cloud-based technologies, empowering over 100 million users worldwide. As a Principal Cloud Development Engineer, you will play a pivotal role in shaping the future of Desktop-as-a-Service (DaaS) solutions helping deliver secure, scalable, and intelligent platforms that drive modern work experiences from anywhere. We re entering an era of accelerated innovation and transformation now is the perfect time to bring your technical leadership, cloud expertise, and mentorship mindset to the forefront. About This Team: The DaaS team at CSG is responsible for designing and building scalable and resilient cloud-native microservices that power Citrix s core virtualization offerings. This team collaborates across product, architecture, operations, and customer success groups to build next-gen capabilities on Azure, AWS, and other hybrid environments. Your Role and Responsibilities: As a Principal Cloud Development Engineer, you will be expected to: Lead design and architecture discussions for cloud-native solutions within the Citrix DaaS product line. Drive the development of scalable and secure backend features, with emphasis on business logic, cloud security, and performance. Mentor junior and senior engineers, guiding them in coding best practices, design decisions, and technical growth. Collaborate with Product Managers, UX Designers, Support, and Site Reliability Engineers to build customer-centric features and maintain high service uptime. Contribute to strategic technical initiatives, including the adoption of Gen AI tools, DevSecOps automation, and performance tuning of production systems. Participate in on-call escalation support, helping debug complex issues and lead incident resolution. Promote a culture of continuous learning and improvement through code reviews, technical sessions, and post-incident analysis. Required Experience and Skills: 14+ years of experience in cloud software development using .NET (C#), Java, or equivalent Object-Oriented Programming languages. Strong computer science fundamentals (algorithms, data structures, systems design). Proven track record in building and leading cloud-native microservices with modern deployment practices (CI/CD, IaC, Kubernetes, Docker). Strong cloud platform expertise, especially in Microsoft Azure or Amazon EC2. Deep understanding of cloud security, including identity/access management, encryption, compliance, and incident response. Advanced knowledge in automation scripting (Python, PowerShell). Familiarity with troubleshooting tools like Sumo Logic, Splunk, or equivalent observability platforms. Experience with Terraform, CI/CD pipelines, and managing Kubernetes-based deployments. Strong communication, collaboration, and mentoring abilities. Preferred Qualifications: Prior experience building secure services in the DaaS, VDI, or enterprise SaaS domain. Hands-on experience with Azure Active Directory, Microsoft AD, or other identity solutions. Moderate understanding of cryptographic protocols and encryption standards. Familiarity with Agile/SAFe development methodologies. Contributions to open-source or technical publications are a plus. Impact: Influence the architecture and direction of mission-critical cloud platforms used globally. Mentorship: Be a technical leader shaping the next generation of engineers. Innovation: Work with a company at the edge of a "Cambrian leap" in cloud evolution. Culture: Inclusive, forward-thinking, and driven by curiosity and collaboration. Flexibility & Benefits: Competitive salary, performance bonus, flexible work model, health insurance, wellness programs, and more. Equal Opportunity Statement: Cloud Software Group is committed to Equal Employment Opportunity and prohibits unlawful discrimination of any kind. All qualified applicants will receive consideration without regard to race, color, religion, gender, gender identity or expression, national origin, age, disability, veteran status, or any other characteristic protected by law.

Principal Cloud Development Cloud development Engineer
IB

Infrastructure Specialist-cloud Application Operations

International Business Machines

Fresher | Not Disclosed | Bengaluru, Karnataka, India | Full-time

Infrastructure Specialist Cloud Application Operations Location: Bangalore, Karnataka, India Job Type: Full-Time Experience Level: Mid to Senior-Level Industry: IT Consulting / Cloud Infrastructure Company: IBM Consulting Client Innovation Center Introduction: At IBM Consulting, your career is powered by collaboration, innovation, and the opportunity to work with visionary clients across industries. You'll be part of a global team committed to driving transformation across hybrid cloud and AI. Backed by our cutting-edge technology and strong ecosystem of strategic partners, you'll help shape the future of cloud operations. In this role, you will be based out of one of our IBM Client Innovation Centers in Bangalore, delivering localized skills and deep technical expertise to clients in both the public and private sectors. Your work will help clients adopt next-gen technologies and innovate faster. Your Role & Responsibilities: Provide technical operations support for cloud-based applications, middleware, DevOps processes, security systems, and infrastructure components. Manage Application ID provisioning and access control in accordance with client standards. Enable infrastructure elasticity by implementing auto-scaling mechanisms to optimize resources based on business needs. Collaborate with global teams to ensure seamless incident management, change control, and service delivery. Share expertise and assist in training peers on technical and procedural workflows. Support business continuity by managing Disaster Recovery (DR) protocols and executing manual failovers when needed. Prepare and present daily, weekly, and monthly integrated service management reports summarizing infrastructure health and operations. Required Skills & Experience: Bachelor's degree in Computer Science, Information Technology, or a related field. Strong communication, collaboration, and teamwork skills. Experience working in technical support or cloud operations environments. Familiarity with application support, DevOps workflows, middleware, and security in cloud ecosystems. Ability to train team members on both procedural and technical topics. Preferred Qualifications: Master s degree in a relevant field is a plus. In-depth understanding of Platform-as-a-Service (PaaS) environments, high availability (HA) infrastructures, and load balancer configurations. Experience with service reporting, performance monitoring tools, and integrated ITSM frameworks. Be a part of a global innovation leader. Work on challenging and impactful projects that influence industries. Collaborate in a culture of growth, continuous learning, and mentorship. Enjoy a dynamic work environment with a strong emphasis on client success and personal development. Apply now and become part of IBM s journey to reshape the future of infrastructure and application support. Qualification : Bachelor's degree in Computer Science, Information Technology, or a related field.

Infrastructure Specialist Infrastructure specialist Cloud Cloud Infrastructure
IN

Staff Data Engineer

Intuit

12+ Years | Not Disclosed | Bengaluru, Karnataka, India | Full-time

Intuit is a global leader in financial technology, dedicated to helping individuals and businesses thrive. Our suite of products, including TurboTax, Credit Karma, QuickBooks, and Mailchimp, serves approximately 100 million customers worldwide. At Intuit, we believe in providing everyone with the tools and resources they need to achieve financial success. We are constantly innovating to make financial empowerment a reality for all. Job Overview Join the Intuit Data Platform (IDP) team as a Staff Engineer and help us transform the way we handle big data! The IDP team is responsible for the Intuit Analytics Platform, which powers real-time data ingestion, cataloging, analytics, and machine learning across the entire organization. As Intuit s customer base grows, so does the volume of data we process. Our engineering excellence ensures that we can scale and leverage this data to drive machine learning and product innovations. We re in the process of building the next-generation real-time and batch ingestion engine, capable of indexing, cataloging, and organizing data and metadata. We are passionate about using open-source technologies to solve challenges and contributing back to the community. If you're excited about building a platform that will directly impact data scientists and analysts and have a desire to shape the future of data at Intuit, then come join us! Key Responsibilities Architect & Design: Build fault-tolerant and scalable big-data platforms using open-source technologies to handle massive datasets. Data Solutions: Create architecture solutions that address complex use cases like data normalization, lineage, governance, ontology, and discoverability. Cross-Team Collaboration: Work with analysts and data scientists to understand data requirements for building operational propensity models and gaining deep customer insights. Hands-On Coding: Lead development efforts within the Hadoop ecosystem using technologies such as Java MapReduce, Spark, Scala, HBase, and Hive to build and optimize data pipelines for both real-time and batch applications. Database Management: Work with NoSQL, SQL, and in-memory databases to design high-performance data systems. Code Reviews: Ensure code quality, consistency, and adherence to best practices through regular code reviews. Architectural Alignment: Ensure alignment between enterprise architecture and business requirements. Prove Feasibility: Conduct proof-of-concept (POC) experiments for new technologies or approaches and drive them to production. Collaboration with Data Cataloging Team: Work closely with data catalog teams and architects to index and catalog all data sources at Intuit. Agile Leadership: Lead fast-paced development teams using agile methodologies and promote best practices in software development, testing, and incident response. Design & Model: Build dimension models suited for customer business use cases and ensure seamless integration of business and technical requirements. Qualifications Experience: 12+ years of relevant experience, with at least 5+ years specializing in the big data domain. Big Data Architecture: Proven experience in architecting end-to-end ecosystems for big data and analytics platforms. Expert Knowledge: Deep expertise in building fault-tolerant, scalable big data solutions, especially using the Hadoop ecosystem (Hive, HBase, Spark, Kafka, MapReduce, etc.). Programming Expertise: Mastery of Java and Scala, with a focus on building high-throughput data services. Machine Learning: Knowledge of machine learning principles and AI applications in big data. Big-Data Technologies: Familiarity with tools such as HDFS, Storm, Zookeeper, Cassandra, Redshift, GraphDB, and others. Understanding both real-time and batch processing in the Hadoop ecosystem. Communication: Strong communication skills, with an ability to explain complex technical topics to both technical and non-technical audiences. Programming Skills: Intermediate experience in Python or R for data processing. Education: BE/BTech/MS in Computer Science or a related field (or equivalent experience). Collaboration: Demonstrated ability to work cross-functionally and lead change through influence and example. At Intuit, you ll be part of a talented, passionate team working on innovative solutions that shape the future of data analytics and machine learning. As a Staff Engineer, you ll have the chance to work with cutting-edge technologies, build scalable systems, and help revolutionize how Intuit leverages data to drive product innovation. If you're looking for a dynamic environment where you can have a meaningful impact, come join us at Intuit! Qualification : BE/BTech/MS in Computer Science (or equivalent)

Data Engineer Staff Engineer Data Engineer Full-Time
CO

Senior Site Reliability Engineer

Couchbase

5+ Years | Not Disclosed | Bengaluru, Karnataka, India | Full-time

Job Title: Site Reliability Engineer (SRE) Cloud Platform & Production Pipeline Initiatives Location: Bangalore, India (Office-based role) About Couchbase: As industries race to embrace AI, traditional database solutions fall short of rising demands for versatility, performance, and affordability. Couchbase is leading the way with Capella, the developer data platform for critical applications in our AI-driven world. By uniting transactional, analytical, mobile, and AI workloads into a seamless, fully managed solution, Couchbase empowers developers and enterprises to build and scale applications with unmatched flexibility, performance, and cost-efficiency from cloud to edge. Trusted by over 30% of the Fortune 100, Couchbase is unlocking innovation, accelerating AI transformation, and redefining customer experiences. Come join our mission! Job Overview: As a Site Reliability Engineer (SRE), you will play a pivotal role in managing, optimizing, and maintaining Couchbase s cloud infrastructure for Capella, our Database as a Service (DBaaS) platform. You will be responsible for ensuring the reliability and performance of our cloud service while collaborating closely with engineering teams to improve deployment pipelines, security practices, and overall system health. You will work across cloud platforms and multiple tools to provide guidance, mentorship, and contribute to the strategic direction of cloud operations. Responsibilities: Infrastructure Management: Manage, monitor, and maintain the infrastructure for Capella to ensure reliable operations. Security & Compliance: Implement and manage cloud environments in accordance with company security guidelines, including vulnerability management, penetration testing, and compliance requirements (SOC 2, PCI-DSS, GDPR, HIPAA, etc.). CI/CD & Release Pipeline: Collaborate with engineering teams to optimize CI/CD processes, aiming for a highly resilient deployment strategy, ideally with zero downtime. Cloud Optimization: Stay up-to-date with new technologies and industry trends to continuously improve cloud platform architecture and meet the evolving needs of the business. Security Integration: Work with development teams to integrate security scanners within the DevOps lifecycle, enhancing security posture. Leadership & Mentorship: Provide guidance on architecture, code reviews, and technical feedback to improve service reliability, security, cost, and performance. Incident Management: Demonstrate exceptional problem-solving skills, proactively identifying and addressing potential issues before they affect business operations. Collaboration: Partner with development teams, application owners, and stakeholders to integrate best practices and ensure seamless service delivery. Requirements: Experience: 5+ years in Site Reliability Engineering (SRE), DevSecOps, or similar roles, with significant experience working in public cloud environments. Programming & Scripting: Proficiency in languages such as Go, Python, Java, or Ruby. Linux Expertise: High proficiency with Linux operating systems. Kubernetes Management: Experience in managing and maintaining Kubernetes clusters (both self-managed and managed platforms like AWS EKS). Security & Vulnerability Management: In-depth knowledge of security tools and practices (vulnerability management, pen testing, SCA, DAST, SAST), with hands-on experience using tools like Sysdig, Synk, and Blackduck. Cloud Platforms & Tools: Strong experience with cloud platforms (AWS, GCP, Azure) and open-source tools like Artifactory, Jira, Jenkins, Grafana, Prometheus, Datadog, Thanos, etc. Configuration Management: Proficiency with Terraform, Git, and CI/CD platforms (e.g., CircleCI, GitHub, Spinnaker). Networking Security: Solid understanding of TCP/IP, DNS, HTTP, Firewalls, VPNs, and other networking security concepts. Preferred Skills: Availability & Reliability: Knowledge of SLO/SLA, availability, reliability, and performance concepts. Incident Management: Experience with on-call rotations and incident management. Database Experience: Familiarity with databases, particularly Couchbase. Security Certifications: Relevant certifications in security or cloud technologies are a plus. Couchbase reimagines database technology to deliver a fast, flexible, and affordable cloud database platform, empowering developers to build applications with exceptional customer experiences. Trusted by over 30% of the Fortune 100, Couchbase drives innovation and customer success through its Capella platform. Benefits at Couchbase: Generous Time Off Program: Flexibility to care for yourself and your family. Wellness Benefits: Access to world-class medical plans, dental, vision, life insurance, and employee assistance programs. Financial Planning: RSU equity program, ESPP, retirement planning, and business travel insurance. Career Growth: Focused on your career development and success. Fun Perks: Ergonomic and comfortable office setup, food & snacks for in-office employees, and more!

Senior Site Reliability Site reliability Engineer
DA

Incident Manager

Databricks

8+ Years | Not Disclosed | Bengaluru, Karnataka, India | Full-time

CSQ124R98 At Databricks, an Incident Manager utilizes their technical experience and resourcefulness to lead urgent customer situations to resolution. Responsible for managing frequent, high-quality updates to all internal and external stakeholders, Incident Managers advocate with engineering and leadership, on behalf of their customers, to ensure that escalations are handled with the appropriate level of urgency from stakeholders. The impact you will have: Drive critical customer escalations or widespread outages to conclusion and resolution. Escalate to on-call resources in support and engineering and establish checkpoint calls and action items to ensure that progress is made and status updates are delivered on time. Demonstrate cross-functional leadership while establishing ownership of escalations and outages. Compile and deliver frequent high-quality communications to internal and external stakeholders, including executive staff. Candidate should be comfortable creating concise and effective messaging that is tailored to a technical or executive audience with minimal assistance from others. Commence and lead war rooms while establishing other temporary communication channels as warranted for the duration of an outage. Ability to multi-task on several incidents and/or projects at once. Be a leader who identifies product and process improvements from every incident and submits necessary feedback for improvements. Participate in on-call rotations. What we look for: Minimum 8+ years of experience in customer support, support escalation and incident management is required. Excellent contextual interpretation and writing skill with an effective ability to summarize and communicate to technical and business audiences is required. Demonstrates strong ability to make timely decisions for both business and technical perspectives. Excellent analytical and troubleshooting skills are required. Candidate should be able to demonstrate technical excellence by applying engineering principles to solve complex problems. Hands-on experience developing any two or more of the following: Big Data, Hadoop, Spark, Machine Learning, Artificial Intelligence, Streaming, Kafka, Data Science, ElasticSearch related industry use cases at the production scale. Hands-on experience in the performance tuning/troubleshooting of Spark-based applications at production scale. Proven and real-time experience in JVM and Memory Management techniques such as Garbage collection and Heap/Thread Dump Analysis is required. Working knowledge in Data Lakes and preferably on the SCD types use cases at production scale. Working and hands-on experience with any SQL-based databases, Data Warehousing/ETL technologies like Informatica, DataStage, Oracle, Teradata, SQL Server and MySQL Linux/Unix administration skills and hands-on experience with AWS or Azure or GCP is required. About Databricks Databricks is the data and AI company. More than 10,000 organizations worldwide including Comcast, Cond Nast, Grammarly, and over 50% of the Fortune 500 rely on the Databricks Data Intelligence Platform to unify and democratize data, analytics and AI. Databricks is headquartered in San Francisco, with offices around the globe and was founded by the original creators of Lakehouse, Apache Spark , Delta Lake and MLflow. To learn more, follow Databricks on Twitter,LinkedIn and Facebook . Benefits At Databricks, we strive to provide comprehensive benefits and perks that meet the needs of all of our employees. For specific details on the benefits offered in your region, please visithttps://www.mybenefitsnow.com/databricks. Our Commitment to Diversity and Inclusion At Databricks, we are committed to fostering a diverse and inclusive culture where everyone can excel. We take great care to ensure that our hiring practices are inclusive and meet equal employment opportunity standards. Individuals looking for employment at Databricks are considered without regard to age, color, disability, ethnicity, family or marital status, gender identity or expression, language, national origin, physical and mental ability, political affiliation, race, religion, sexual orientation, socio-economic status, veteran status, and other protected characteristics. Compliance If access to export-controlled technology or source code is required for performance of job duties, it is within Employer's discretion whether to apply for a U.S. government license for such positions, and Employer may decline to proceed with an applicant on this basis alone.

Incident Manager Incident manager Full-Time Incident management
DA

Manager - Technical Solutions (spark)

Databricks

10-12 Years | Not Disclosed | Bengaluru, Karnataka, India | Full-time

As a Manager of the Spark Technical Solutions team, you will lead & manage a team of Technical solution engineers and be responsible for driving deep dive technical solutions for any issues reported by Databricks customers. We expect the manager to resolve challenges with comprehensive technical and customer communication skills. You will assist our customers in their Databricks journey and provide them with the guidance, knowledge, and expertise that they need to realise value and achieve their strategic objectives using our products. The impact you will have: As a manager and member of the leadership team, you will be directly responsible for the management of Technical solution engineers, team leads and operations personnel Responsible for directly monitoring, reporting, and driving improvements to team-level metrics and KPIs, acting as an escalation point with customers and internal teams, and optimising and developing support processes and tools Responsible for working across multiple cross functional teams that include Engineering, product management, sales and customer success; manage Hiring, mentoring and onboarding new support engineers Regularly meet one-on-one with your direct reports, conducting annual reviews and career development discussions throughout the year Be a hands on manager to assist the team members in resolving issues related to Spark core internals, Spark SQL, Structured Streaming, Delta, Lakehouse and other databricks runtime features Manage and drive best practices guidance around Spark runtime performance and usage of Spark core libraries and APIs for custom-built solutions developed by Databricks customers; contribute in the development of tools/automation initiatives Own Engineering JIRA tickets and proactively work to bring quicker resolutions to customer reported issues; participate in creation of knowledge base articles Participate in weekend and weekday on-call rotation and run escalations during databricks runtime outages, incident situations, ability to multitask and plan day 2 day activities and provide escalated level of support for critical customer operational issues, etc What we look for: Min 10-12 years of experience in designing, building, testing, and maintaining Python/Java/Scala/Spark based applications in a typical project delivery and consulting environments with 4+ years working as a Manager 5+ years of hands-on experience in developing and leading any two or more of the Big Data, Hadoop, Spark,Machine Learning, Artificial Intelligence, Streaming, Kafka, Data Science, ElasticSearch related industry use cases at the production scale. Spark experience is mandatory Hands on experience in the performance tuning/troubleshooting of Hive and Spark based applications at production scale. Real time experience in JVM and Memory Management techniques such as Garbage collections, Heap/Thread Dump Analysis is preferred Working and hands-on experience with Data lakes and any SQL-based databases, Data Warehousing/ETL technologies like Informatica, DataStage, Oracle, Teradata, SQL Server, MySQL is preferred Hands-on experience with AWS or Azure or GCP is preferred Experience in implementing CI/CD, Monitoring/alerting for Production Systems Technical lead in design, implementation and support of large scale data and analytics solutions that are highly reliable, flexible, and scalable Experience in leading and managing end-to-end projects and have reported and escalated to top levels Experience in managing and leading teams in an organisation involving multiple reporting lines Strong written and verbal communication skills; very good analytical, organisational, multi-tasking skills About Databricks Databricks is the data and AI company. More than 10,000 organizations worldwide including Comcast, Cond Nast, Grammarly, and over 50% of the Fortune 500 rely on the Databricks Data Intelligence Platform to unify and democratize data, analytics and AI. Databricks is headquartered in San Francisco, with offices around the globe and was founded by the original creators of Lakehouse, Apache Spark , Delta Lake and MLflow. To learn more, follow Databricks on Twitter,LinkedIn and Facebook . Benefits At Databricks, we strive to provide comprehensive benefits and perks that meet the needs of all of our employees. For specific details on the benefits offered in your region, please visithttps://www.mybenefitsnow.com/databricks. Our Commitment to Diversity and Inclusion At Databricks, we are committed to fostering a diverse and inclusive culture where everyone can excel. We take great care to ensure that our hiring practices are inclusive and meet equal employment opportunity standards. Individuals looking for employment at Databricks are considered without regard to age, color, disability, ethnicity, family or marital status, gender identity or expression, language, national origin, physical and mental ability, political affiliation, race, religion, sexual orientation, socio-economic status, veteran status, and other protected characteristics. Compliance If access to export-controlled technology or source code is required for performance of job duties, it is within Employer's discretion whether to apply for a U.S. government license for such positions, and Employer may decline to proceed with an applicant on this basis alone.

Manager Technical Manager technical Technical manager Solutions
DA

Senior Manager - Technical Solutions (spark)

Databricks

10-12 Years | Not Disclosed | Bengaluru, Karnataka, India | Full-time

As a Senior Manager of the Spark Technical Solutions team, you will lead & manage a team of Technical Solution Engineers (Spark) and be responsible for driving deep dive technical solutions for any issues reported by Databricks customers. We expect the manager to resolve challenges with comprehensive technical and customer communication skills. You will assist our customers in their Databricks journey and provide them with the guidance, knowledge, and expertise that they need to realise value and achieve their strategic objectives using our products. The impact you will have: As a manager and member of the leadership team, you will be directly responsible for the management of Technical solution engineers, team leads and operations personnel Responsible for directly monitoring, reporting, and driving improvements to team-level metrics and KPIs, acting as an escalation point with customers and internal teams, and optimising and developing support processes and tools Responsible for working across multiple cross functional teams that include Engineering, product management, sales and customer success; manage Hiring, mentoring and onboarding new support engineers Regularly meet one-on-one with your direct reports, conducting annual reviews and career development discussions throughout the year Be a hands on manager to assist the team members in resolving issues related to Spark core internals, Spark SQL, Structured Streaming, Delta, Lakehouse and other databricks runtime features Manage and drive best practices guidance around Spark runtime performance and usage of Spark core libraries and APIs for custom-built solutions developed by Databricks customers; contribute in the development of tools/automation initiatives Own Engineering JIRA tickets and proactively work to bring quicker resolutions to customer reported issues; participate in creation of knowledge base articles Participate in weekend and weekday on-call rotation and run escalations during databricks runtime outages, incident situations, ability to multitask and plan day 2 day activities and provide escalated level of support for critical customer operational issues, etc What we look for: Min 10-12 years of experience in designing, building, testing, and maintaining Python/Java/Scala/Spark based applications in a typical project delivery and consulting environments with 4+ years working as a Manager 5+ years of hands-on experience in developing and leading any two or more of the Big Data, Hadoop, Spark,Machine Learning, Artificial Intelligence, Streaming, Kafka, Data Science, ElasticSearch related industry use cases at the production scale. Big Data / Spark hands on-experience is mandatory Hands on experience in the performance tuning/troubleshooting of Hive and Spark based applications at production scale. Real time experience in JVM and Memory Management techniques such as Garbage collections, Heap/Thread Dump Analysis is preferred Working and hands-on experience with Data lakes and any SQL-based databases, Data Warehousing/ETL technologies like Informatica, DataStage, Oracle, Teradata, SQL Server, MySQL is preferred Hands-on experience with AWS or Azure or GCP is preferred Experience in implementing CI/CD, Monitoring/alerting for Production Systems Technical lead in design, implementation and support of large scale data and analytics solutions that are highly reliable, flexible, and scalable Experience in leading and managing end-to-end projects and have reported and escalated to top levels Experience in managing and leading teams in an organisation involving multiple reporting lines About Databricks Databricks is the data and AI company. More than 10,000 organizations worldwide including Comcast, Cond Nast, Grammarly, and over 50% of the Fortune 500 rely on the Databricks Data Intelligence Platform to unify and democratize data, analytics and AI. Databricks is headquartered in San Francisco, with offices around the globe and was founded by the original creators of Lakehouse, Apache Spark , Delta Lake and MLflow. To learn more, follow Databricks on Twitter,LinkedIn and Facebook . Benefits At Databricks, we strive to provide comprehensive benefits and perks that meet the needs of all of our employees. For specific details on the benefits offered in your region, please visithttps://www.mybenefitsnow.com/databricks. Our Commitment to Diversity and Inclusion At Databricks, we are committed to fostering a diverse and inclusive culture where everyone can excel. We take great care to ensure that our hiring practices are inclusive and meet equal employment opportunity standards. Individuals looking for employment at Databricks are considered without regard to age, color, disability, ethnicity, family or marital status, gender identity or expression, language, national origin, physical and mental ability, political affiliation, race, religion, sexual orientation, socio-economic status, veteran status, and other protected characteristics. Compliance If access to export-controlled technology or source code is required for performance of job duties, it is within Employer's discretion whether to apply for a U.S. government license for such positions, and Employer may decline to proceed with an applicant on this basis alone.

Senior Manager Senior manager Technical Senior technical
DA

Senior Technical Solutions Engineer (platform)

Databricks

5+ Years | Not Disclosed | Bengaluru, Karnataka, India | Full-time

Job Overview: We are seeking a highly skilled Frontline Senior Technical Solutions Engineer with over 5 years of experience to join our Platform Support team. This role is pivotal in delivering exceptional support for our Databricks Data Intelligence platform, addressing complex technical challenges, and ensuring the seamless operation of our data solutions. As a frontline engineer, you will be the primary point of contact for critical issues, working closely with both internal teams and customers to resolve high-impact problems and drive platform improvements. Key Responsibilities: Frontline Support: Serve as the primary technical point of contact for escalated issues related to the Databricks Data Intelligence platform. Provide expert-level troubleshooting, diagnostics, and resolution for complex problems affecting system performance and reliability. Customer Interaction: Engage with customers directly to understand their technical issues and requirements. Provide timely, clear, and actionable solutions to ensure high levels of customer satisfaction. Incident Management: Lead the resolution of high-priority incidents, coordinating with various teams to address and mitigate issues swiftly. Conduct thorough root cause analyses and develop preventive measures to avoid recurrence. Collaboration: Work closely with engineering, product management, and DevOps teams to share insights, identify recurring issues, and drive improvements to the Databricks Data Intelligence platform. Documentation and Knowledge Sharing: Create and maintain detailed documentation on support procedures, known issues, and solutions. Contribute to internal knowledge bases and create training materials to assist other support engineers. Performance Monitoring: Monitor and analyze platform performance metrics to identify potential issues before they impact customers. Implement optimizations and enhancements to improve platform stability and efficiency. Platform Upgrades: Manage and oversee the deployment of Databricks Data Intelligence platform upgrades and patches, ensuring minimal disruption to services and maintaining system integrity. Innovation and Improvement: Stay abreast of industry trends and advancements in Databricks technology. Propose and drive initiatives to enhance platform capabilities and support processes. Customer Feedback: Collect and analyze customer feedback to drive continuous improvement in support processes and platform features. Qualifications: Experience: Minimum of 5 years of hands-on experience in a technical support or engineering role related to Databricks Data Intelligence platform, cloud data platforms, or big data technologies. Technical Skills: A deep understanding of Databricks architecture and Apache Spark, along with experience in cloud platforms like AWS, Azure, or GCP, is essential. Strong capabilities in designing and managing data pipelines, distributed computing are required. Proficiency in Unix/Linux administration, familiarity with DevOps practices, and skills in log analysis and monitoring tools are also crucial for effective troubleshooting and system optimization. Problem-Solving: Demonstrated ability to diagnose and resolve complex technical issues with a strong analytical and methodical approach. Communication: Exceptional verbal and written communication skills, with the ability to effectively convey technical information to both technical and non-technical stakeholders. Customer Focus: Proven experience in managing high-impact customer interactions and ensuring a positive customer experience. Collaboration: Ability to work effectively in a team environment, collaborating with engineering, product, and customer-facing teams. Education: Bachelor s degree in Computer Science, Engineering, or a related field. Advanced degree or relevant certifications are highly desirable. Preferred Skills: Experience with additional big data tools and technologies such as Hadoop, Kafka, or NoSQL databases. Familiarity with automation tools and CI/CD pipelines. Understanding of data governance and compliance requirements. Innovative Environment: Work with cutting-edge technology in a fast-paced, innovative company. Career Growth: Opportunities for professional development and career advancement. Team Culture: Collaborate with a talented and motivated team dedicated to excellence and continuous improvement. About Databricks Databricks is the data and AI company. More than 10,000 organizations worldwide including Comcast, Cond Nast, Grammarly, and over 50% of the Fortune 500 rely on the Databricks Data Intelligence Platform to unify and democratize data, analytics and AI. Databricks is headquartered in San Francisco, with offices around the globe and was founded by the original creators of Lakehouse, Apache Spark , Delta Lake and MLflow. To learn more, follow Databricks on Twitter,LinkedIn and Facebook . Benefits At Databricks, we strive to provide comprehensive benefits and perks that meet the needs of all of our employees. For specific details on the benefits offered in your region, please visithttps://www.mybenefitsnow.com/databricks. Our Commitment to Diversity and Inclusion At Databricks, we are committed to fostering a diverse and inclusive culture where everyone can excel. We take great care to ensure that our hiring practices are inclusive and meet equal employment opportunity standards. Individuals looking for employment at Databricks are considered without regard to age, color, disability, ethnicity, family or marital status, gender identity or expression, language, national origin, physical and mental ability, political affiliation, race, religion, sexual orientation, socio-economic status, veteran status, and other protected characteristics. Compliance If access to export-controlled technology or source code is required for performance of job duties, it is within Employer's discretion whether to apply for a U.S. government license for such positions, and Employer may decline to pr...

Senior Technical Senior technical Solutions Technical solutions

1 - 20 of 0 jobs

* No exact matches found. Showing closest results instead
Sort by:

No results found

Modify search criteria or create an alert to get relevant jobs as soon as they’re posted

Create an alert

Continue to Save

Please login to your jobseeker account, or create a new one to save this job.

Feedback

Share Feedback