Infrastructure Scaling Jobs in Bengaluru
531 Jobs Found
Lead Software Engineer - Scale & Performance
Team Vunet Systems
Lead Software Engineer - Scale & Performance Location: Bengaluru Experience: 6 12 years About VuNet VuNet is a pioneer in Business Journey Observability, using Big Data and Machine Learning to revolutionize digital experiences in the financial services industry. Our platform delivers end-to-end visibility into customer journeys, helping organizations proactively resolve issues, ensure operational resilience, and deliver superior user satisfaction. With over 28 billion digital transactions monitored every month and serving more than 300 million users globally, VuNet is shaping the future of observability for some of the largest banks and financial institutions. We are Series B funded, part of NASSCOM s DeepTech Club, and recognized by global analysts such as Gartner and Omdia. Your Role: Lead Software Engineer - Scale & Performance As a Lead Software Engineer for Scale & Performance, you ll own the performance and scalability benchmarks for VuNet s observability platform. You will work with cutting-edge technologies, design robust test frameworks, and ensure that our platform scales seamlessly to meet the demands of millions of users. Roles & Responsibilities Own performance and scalability benchmarking for key platform components (ingestion pipelines, data storage, and query services). Design and execute load, stress, soak, and capacity tests across microservices, agents, and ingestion layers. Identify and resolve performance bottlenecks in both infrastructure (CPU/memory/IO) and application layers (API latency, throughput, GC behavior). Develop and maintain performance test frameworks, preferably using Kubernetes-based environments. Collaborate with DevOps and SRE teams to optimize system configurations (Kubernetes, Postgres/TimescaleDB, ClickHouse, Kafka) for scale. Implement OpenTelemetry for service instrumentation to monitor system health and latency (p50/p95/p99 metrics). Contribute to capacity planning, scaling strategies (horizontal/vertical), and resource optimization. Analyze production incidents related to scaling issues and drive permanent fixes. Work with engineering teams to design scalable architecture patterns and define SLIs/SLOs for system performance. Document performance baselines, tuning guides, and scalability best practices for internal use. What You Bring Mandatory Skills: Strong background in performance engineering for large-scale distributed systems or SaaS platforms. Expertise in Kubernetes, container runtimes (containerd/Docker), and resource profiling in containerized environments. Solid understanding of Linux internals, CPU/memory profiling, and network stack tuning. Hands-on experience with observability tools (Prometheus, Grafana, OpenTelemetry, Jaeger, Loki, Tempo, etc.). Familiarity with observability platform datastores like ClickHouse, PostgreSQL/TimescaleDB, Elasticsearch, or Cassandra. Experience with performance benchmarking tools such as k6, Locust, JMeter, or custom Golang/Python scripts. Ability to interpret system metrics (CPU usage, memory, GC, latency) and correlate across different layers. Nice-to-Have Skills: Experience with agent benchmarking (OpenTelemetry Collector, custom data shippers). Exposure to streaming systems like Kafka, NATS, or Pulsar. Familiarity with CI/CD pipelines for performance testing and regression tracking. Knowledge of cost optimization and capacity forecasting in cloud environments (AWS/GCP/Azure). Proficiency in Go, Python, or Bash scripting for automation and data analysis. Life at VuNet: At VuNet, we're building a world-class observability platform, and we re just getting started. You ll be part of a passionate, problem-solving team that embraces collaboration, fast learning, and staying ahead of emerging technologies like Gen AI. We foster a high-trust, inclusive culture where collaboration, ownership, and innovation are central to our success. If you're looking to work on cutting-edge tech, make a real impact, and grow with a supportive team you ll fit right in at VuNet. Benefits: Comprehensive health insurance coverage for you, your parents, and dependents. Mental wellness and 1:1 counseling support. A culture that promotes continuous learning, innovation, and career growth. Transparent, inclusive, and high-trust workplace. Opportunities for skill enhancement with training programs focused on new Gen AI technologies.
Finance Associate
Falconx
Job Title: Finance Associate Location: Bangalore Department: Finance Employment Type: Full-Time About FalconX At FalconX, we are a pioneering team of operators, investors, and builders committed to transforming institutional access to the cryptocurrency markets. By blending traditional finance with cutting-edge technology, we are solving the industry's most pressing challenges. As the leading solution provider for all digital asset strategies, FalconX empowers clients to navigate the rapidly evolving world of cryptocurrency with confidence, clarity, and ease. Our clients range from large financial institutions to innovative startups, and we are building the connective infrastructure that bridges conventional financial markets and the world of digital assets. What You ll Do As a Finance Associate at FalconX, you will play a critical role in ensuring accurate financial operations and reporting within the organization. You will support the team with key accounting tasks, reconciliations, reporting, and audits while helping streamline processes for optimal financial outcomes. Key Responsibilities Maintain & Reconcile General Ledger Accounts: Prepare and post journal entries with appropriate supporting documentation. Reconcile balance sheet accounts such as cash, prepaid expenses, accruals, and intercompany accounts using NetSuite. Support crypto wallet reconciliations, ensuring that on-chain balances align with internal records. Month-End & Year-End Close: Assist with the timely and accurate month-end and year-end closing processes, including completion of checklist items in FloQast. Prepare monthly schedules and ensure they tie to the general ledger. Identify and correct posting errors during the close process. Prepare Financial Reports: Run monthly financial reports and trial balances from ERP systems (e.g., NetSuite, Oracle). Compile supporting schedules for balance sheets and income statements, assist with variance analysis and provide account-level explanations. Cross-Functional Collaboration: Collaborate with Operations, Platform, and FP&A teams to confirm data accuracy for financial transactions and journal entries. Follow standardized coding rules for vendors, departments, and accounts to ensure consistent reporting. Internal Controls and Compliance: Adhere to internal controls over financial reporting, following established approval and documentation procedures for all journal entries. Support audit and control reviews, assisting with PBC documentation and responding to auditor inquiries. Payroll Reconciliation: Reconcile payroll reports from systems like Rippling to the ERP (NetSuite) GL entries. Record recurring payroll and benefit journal entries and support the team in managing payroll-related liabilities. Prepaid and Accrual Management: Update and amortize prepaid expense schedules using ERP templates. Record standard accrual entries for open invoices or unbilled expenses, ensuring all balances are reconciled. Bank Reconciliation: Conduct weekly reconciliations for fiat bank accounts, investigate unmatched transactions, and coordinate with the Treasury and Opex teams for settlement confirmation. Budgeting and Forecasting Support: Provide historical data and expense trends to support the FP&A team with planning and budgeting. Help track recurring vs. non-recurring items during budget-to-actual reviews and maintain allocation files. Financial Analysis & Reporting: Assist in analyzing monthly account fluctuations and identify significant variances. Build reconciliations and basic dashboards for reporting purposes. Success in the Role Own the general ledger reconciliations to enable smooth and on-time month-end close. Partner with Trading, Treasury, and Operations teams to validate data and ensure proper GL treatment. Demonstrate a passion for working in a fast-paced, dynamic environment with a strong initiative to learn and grow. Maintain high levels of accuracy and attention to detail, ensuring all tasks are executed with precision. Exhibit the ability to multitask efficiently under pressure while meeting deadlines and achieving departmental goals. Required Qualifications Educational Background: Bachelor s degree in Accounting, Finance, or related field. Professional certifications (e.g., Chartered Accountant (CA), CPA) preferred. Experience: 3 5 years of relevant experience in accounting and finance. Experience working in financial institutions or financial services start-ups is preferred. Familiarity with IFRS and US GAAP reporting standards. Technical Skills: Proficiency in Microsoft Excel, Word, and PowerPoint. Familiarity with NetSuite or other ERP systems. Strong analytical skills and the ability to interpret complex financial data. Communication & Interpersonal Skills: Strong verbal and written communication skills, with the ability to present complex concepts clearly and concisely. A collaborative mindset, with the ability to work across multiple teams and interact with stakeholders at various levels. Other Skills: Detail-oriented with a focus on accuracy in financial data management. Ability to work independently with great initiative. Prior experience in cryptocurrency markets is advantageous but not required. Innovative Environment: Join a dynamic team at the intersection of traditional finance and the emerging crypto market. High-Growth Opportunity: Be part of a rapidly scaling organization with access to cutting-edge technology and the evolving landscape of digital assets. Collaborative Culture: Work alongside industry leaders and innovators who share a commitment to making crypto markets accessible and transparent. Competitive Compensation: Enjoy a comprehensive salary and benefits package with opportunities for career growth and development. If you are eager to be a part of an industry-defining company at the forefront of the crypto revolution, we want to hear from you. Join FalconX and help shape the future of digital asset trading and institutional access! Qualif...
Manager - Data Analytics, Credit Card Portfolios
Zeta
Job Title: Manager - Data Analytics, Credit Card Portfolios Location: Bangalore Employment Type: Full-time About Zeta: Zeta is a next-gen banking technology company empowering banks and fintechs to build the future of financial products. Founded in 2015 by Bhavin Turakhia and Ramki Gaddipati, Zeta s flagship platform Zeta Tachyon is a cloud-native, fully API-enabled banking stack powering issuance, processing, lending, core banking, fraud & risk, and more. Over 20 million cards have been issued globally through our platform. With 1,700+ employees across the US, EMEA, and Asia and 70%+ in R&D, Zeta is backed by SoftBank, Mastercard, and others, having raised $330M at a $2B valuation in 2025. We work with leading banks and fintechs worldwide to transform multi-million card portfolios. Role Overview: We are looking for a strategic and experienced Manager - Data Analytics to lead business intelligence and enterprise reporting for global fintech portfolios including Credit Cards, Deposits, and other financial products. This role involves managing a team of analysts, leveraging multiple data lakes and warehouses, and building a scalable, comprehensive reporting framework for diverse markets including the US, UK, and India. Key Responsibilities: Enterprise Reporting & Data Architecture: Design and maintain end-to-end reporting across the customer lifecycle: acquisition, activation, usage, delinquency, collections, retention, operations, and support. Deliver accurate analysis of key financial KPIs: revenue, profitability, credit risk, defaults, acquisition cost. Build dashboards, self-service BI tools, and automated pipelines using Apache Superset, Metabase, Tableau. Optimize data storage and reporting for scalability and cost-efficiency. Data Integration & Analytics Execution: Collaborate with vendors and internal engineering to integrate data from credit bureaus, open banking, core banking, card and payment processors, loan origination, CCaaS, and aggregators into a centralized Data Lake. Business Intelligence & Growth: Lead analytics projects to uncover user behavior, optimize acquisition channels, underwriting, and portfolio performance via segmentation, cohort, and funnel analyses. Partner with Product and Marketing teams to evaluate experiments (A/B testing) and guide roadmap decisions. Leadership: Build, mentor, and lead a high-performing team of BI analysts and data visualization experts. Data Governance: Establish and enforce data governance best practices, ensuring compliance and data security. Skills & Experience: Expert in BI tools such as Apache Superset, Metabase, Tableau; strong SQL skills. Familiarity with cloud data platforms like Snowflake, Redshift, BigQuery. Deep knowledge of credit and fintech KPIs: acquisition, credit decisioning, delinquency, repayment, charge-offs, profitability, RoA, CLTV, etc. Proven leadership experience managing analytics teams and scaling reporting infrastructures. Excellent communication skills with the ability to translate complex data into business strategies. Knowledge of data governance, privacy, and security in financial services. Qualifications: 10+ years in Business Intelligence/Analytics with 3+ years in the credit card industry. 3+ years managing teams of analysts or data professionals. Bachelor s degree in Computer Science, Engineering, Statistics, or a related field. Equal Opportunity: Zeta celebrates diversity and is an equal opportunity employer. We are committed to fostering an inclusive environment and encourage candidates from all backgrounds to apply. Qualification : Bachelors degree in Computer Science, Engineering, Statistics, or a related field
Engineering Manager - Active Directory
Rubrik
Engineering Manager Active Directory Location: Bangalore, India About the Team The Active Directory team is part of Rubrik s Enterprise Data Protection (EDP) organization. They develop data protection solutions specifically for Active Directory, including backup, restore, and integration of AD as an Identity Provider within Rubrik s security platform. About the Role Rubrik is seeking an experienced Engineering Manager to lead the Active Directory development team. This role focuses on guiding the design and delivery of AD data protection solutions, scaling the team, and driving innovation. The ideal candidate combines strong software development expertise especially with Active Directory and identity technologies with proven leadership skills. What You ll Do Team Leadership: Mentor and lead developers and engineers, foster innovation, collaboration, and technical excellence. Development Lifecycle: Manage sprint planning, code reviews, and adherence to standards; prioritize workload and resource allocation. Software Development: Oversee design, development, and testing of Active Directory data protection solutions and integrations with Rubrik s security platform. Customer & Growth Management: Engage with customers to support adoption and scale the team accordingly. Strategic Planning: Collaborate on roadmap definition with product managers and architects aligned to business goals. Operational Excellence: Provide technical leadership on escalations, maintain system health, and minimize regressions. Documentation & Collaboration: Develop thorough documentation and work closely with engineering, security, infrastructure teams, and stakeholders. Communication: Effectively communicate project status, risks, and technical details to diverse audiences, including senior leadership. Experience & Qualifications Education & Experience: Bachelor s or Master s degree in Computer Science, Software Engineering, IT, or related field. 8-10 years in software development and IT, with at least 2-3 years in technical leadership or engineering management roles. Technical Expertise: Strong skills in distributed systems and data storage. Solid knowledge of Windows Server OS and Active Directory (AD/Entra-ID) concepts. Experience with Microsoft Windows ecosystem preferred. Understanding of Identity and Access Management (IAM) concepts; familiarity with IAM services like Okta or AWS IAM is a plus. Knowledge of identity security (users, groups, roles, NHI) is advantageous. Leadership & Management: Proven ability to lead, mentor, and develop software engineering teams. Strong grasp of software development methodologies and project management. Experience collaborating with customers, sales, and support teams. Excellent organizational, communication, interpersonal, and presentation skills. Rubrik is on a mission to secure the world s data with Zero Trust Data Security . We empower organizations to defend against cyber threats and ensure data resilience through innovative cloud and SaaS security technologies. Qualification : Bachelors or Masters degree in Computer Science, Software Engineering, IT, or related field.
Systems Development Engineer, Google Cloud
Google Careers
Systems Development Engineer Google Cloud Location: Bengaluru, Karnataka, India Company: Google Minimum Qualifications Bachelor s degree in Computer Science, Information Technology, or a related field; or equivalent practical experience. 2+ years of experience with systems automation. 2+ years of experience in technical infrastructure (e.g., deployment, maintenance, troubleshooting). Preferred Qualifications 3+ years of experience in systems design and implementation. About the Role As a Systems Development Engineer (SDE) at Google Cloud, you will be part of a team responsible for managing and scaling critical services and infrastructure. This role emphasizes automation, reliability, and observability, using engineering practices to eliminate manual toil and improve system efficiency. Google SDEs design and build the tools and systems that power the infrastructure for Google s services, transforming telemetry into actionable insights and proactively solving operational challenges. You ll have the opportunity to work on impactful, large-scale projects in an environment that fosters learning, collaboration, and growth. Key Responsibilities Participate in on-call rotations and incident response, managing services within your domain. Troubleshoot infrastructure and system issues, evaluate diagnostic data, and recommend solutions. Resolve tickets and bugs within defined service-level objectives (SLOs). Collaborate with primary responders to maintain high availability and reliability of systems. Contribute to the design and implementation of systems and services in related domains. Work directly with customers to gather requirements, define distributed system needs, and propose solutions. Develop automation tools and systems to improve efficiency and reduce operational overhead. About Google Cloud Google Cloud helps organizations transform their business with advanced technologies and enterprise-grade solutions. With a focus on sustainability, innovation, and scalability, Google Cloud serves customers in over 200 countries and territories, providing the tools and infrastructure necessary to solve the world s most complex business challenges. Qualification : Bachelor's degree in Computer Science or IT-related field, or equivalent practical experience.
Director / Sr Manager - Platforms
Eightfold
Job Title: Director / Sr Manager - Platforms Location: Bangalore, Karnataka, India Job Type: Full-Time (Hybrid Work Model) Experience Level: 10+ Years About Eightfold.ai: At Eightfold.ai, we are revolutionizing how organizations manage talent by leveraging the power of artificial intelligence. Our cutting-edge AI platform is transforming the way businesses hire, develop, and retain talent. By utilizing AI to understand individual skills and potential, we re solving the fundamental problem of matching people with the right opportunities. We are looking for a visionary engineering leader to drive the growth of our Core Infrastructure Team in India, shaping the foundation of our AI platform. About the Core Infrastructure Team: The Core Infrastructure Team at Eightfold is the backbone of the organization, responsible for the architecture, maintenance, and enhancement of critical infrastructure elements that support our entire technology stack. Our team builds and maintains systems for Search, Databases, Machine Learning Infrastructure, Data Warehouses, Developer Platforms, and Application Infrastructure. We ensure the scalability, security, and reliability of these services, which underpin every product that we offer to our users and customers. What You ll Own & Drive: As the Director / Sr Manager - Platforms, you will lead the technical direction for Eightfold's infrastructure, security, and analytics platforms, ensuring they meet the needs of our growing enterprise-scale business. Vision & Roadmap: Lead the strategy, roadmap, and execution of the Infrastructure, Security, and Analytics platforms. Team Building: Hire, mentor, and lead a high-performing engineering team, fostering a culture of innovation, excellence, and autonomy. Cross-Functional Collaboration: Partner with Product, Data, and DevOps teams to build secure, scalable systems that support business growth. Infrastructure Scaling: Ensure reliability, availability, and performance across both cloud (AWS, GCP) and on-prem environments. Security Leadership: Define and enforce security protocols, including threat modeling, vulnerability management, and compliance frameworks (SOC2, ISO27001, etc.). Operational Excellence: Champion modern engineering practices, including CI/CD, observability, and cost optimization. Analytics Platform Development: Lead the creation and scaling of an end-to-end Analytics Product stack including data warehouse, query engine, and dashboards. Ownership & Impact: Take ownership of the full product/technology lifecycle from vision, architecture, and deployment, ensuring long-term impact and success. What You Bring: Required Skills & Experience: 10+ Years of Engineering Experience: Significant experience in engineering with at least 3+ years in a leadership role leading teams at scale. Expertise in Cloud Infrastructure: Deep expertise in cloud-native infrastructure (AWS, GCP, etc.) and DevSecOps principles. Proven Success in Platform Scaling: A track record of building and scaling secure, reliable platforms at an enterprise level. Security Expertise: Leadership in security initiatives, including threat modeling, vulnerability management, and compliance. Excellent Communication: Strong communication skills, with the ability to influence and collaborate across engineering and business teams. Bonus Experience: Exposure to scaling analytics stacks (Snowflake, dbt, Airflow, Looker, etc.) is a plus. Leadership & Culture Building: Demonstrated success in building high-caliber teams and cultivating a thriving engineering culture. Impactful Leadership: Take on a high-leverage leadership role that shapes the foundation of Eightfold's AI platform and directly impacts the company s growth and success. Innovative Environment: Work with cutting-edge technologies and collaborate with brilliant minds to solve complex engineering challenges. Career Growth: As a leader at Eightfold, you will have the autonomy to drive strategic initiatives while building and scaling high-performing teams. Hybrid Work Model: Enjoy a flexible hybrid work model with the ability to work remotely while maintaining a strong in-office presence for team collaboration starting February 1, 2024. Comprehensive Benefits: Competitive salary, comprehensive family medical coverage, and eligibility for equity awards and discretionary bonuses or commissions. How to Apply: If you're a visionary engineering leader with a passion for building scalable, secure platforms and leading high-performing teams, we want to hear from you. Join Eightfold.ai and help us redefine how companies build, hire, and retain their workforce using AI-powered talent intelligence. Equal Opportunity Employer: Eightfold.ai is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees, regardless of race, color, religion, sex, sexual orientation, gender identity, national origin, age, or disability.
Staff Engineer - Core Infrastructure
Eightfold
Staff Engineer - Core Infrastructure Location: Bangalore, Karnataka, India Employment Type: Full-Time | Hybrid Work Model About Eightfold.ai At Eightfold.ai, we re transforming the future of work by leveraging artificial intelligence to connect individuals with career opportunities based on their skills and potential, not just their network. Our Talent Intelligence Platform powers a more diverse, inclusive workforce by helping organizations plan, hire, develop, and retain top talent. With $410M+ in funding and a $2B+ valuation, we are revolutionizing how the world thinks about skills, potential, and careers. If you re passionate about cutting-edge technology, infrastructure, and creating scalable solutions that impact the world, we want you to join us. The Opportunity We re looking for a Staff Engineer to join our Core Infrastructure Team and help scale the backbone of Eightfold s platform. This high-impact role will involve designing, building, and optimizing foundational systems that power everything from search and machine learning infrastructure to developer platforms and observability tools. You will drive system design across our stack and mentor engineering teams to build scalable, resilient systems that enable Eightfold to grow and deliver AI-powered solutions for our customers. What You ll Own & Drive Architect & Scale Core Systems: Design and build scalable infrastructure systems that support Eightfold s AI-driven products, including search, compute, storage, and machine learning infrastructure. Cross-Functional Leadership: Lead cross-team technical initiatives, collaborating with Product, Security, Data, and Platform teams to align with company-wide goals. Hands-On Development: Contribute directly to system design, code reviews, and incident response, ensuring best practices are followed. Mentorship & Leadership: Guide and mentor engineers to help them grow into future leaders, fostering a culture of technical excellence across teams. Advocate for Engineering Excellence: Champion best practices across areas such as cloud architecture, CI/CD, security, and observability. Solve Complex Infrastructure Challenges: Tackle problems around reliability, scalability, and infrastructure performance, ensuring the systems are robust and perform well at scale. Bring Emerging Tech to Life: Stay on top of the latest trends and technologies, incorporating new scalable design patterns into our architecture. What You Bring 10+ years of experience in backend or infrastructure engineering, with a strong background in building distributed, cloud-native systems. Proven track record in designing and delivering reliable, high-scale services (ideally in AWS, GCP, or Azure environments). Expertise in Infrastructure Technologies: Deep knowledge of containerization, orchestration (Kubernetes), and infrastructure-as-code. Experience with one or more of the following: search infrastructure, ML/AI infrastructure, databases/data warehouses, developer tooling, or platform security. Leadership Experience: A passion for mentoring and guiding engineers, influencing teams and peers, and driving excellence across projects. Strong communication skills, able to translate complex technical challenges into strategic business impact. (Bonus) Experience with SRE principles, cloud security, and compliance for enterprise/government environments. Our Engineering Culture At Eightfold, we believe in ownership over tasks. You won t just be given directions; you ll be trusted to take responsibility and make a measurable impact. We have a growth mindset and continuously improve in all aspects of our work. Collaboration, transparency, and speed are core to everything we do. You ll work in a dynamic, supportive environment where your work directly influences the success of the company and its mission. Meaningful Work: Help shape the future of work by building products that impact careers and businesses globally. Growth Opportunities: Be part of a rapidly scaling company where your contributions are highly valued. Competitive Compensation: Attractive salary, equity, and comprehensive benefits package (including medical, vision, and dental coverage). Hybrid Work Model: Work from our Bangalore office twice a week, with flexibility for remote work. Inclusive Culture: We are committed to fostering a diverse and inclusive work environment where everyone feels valued. Equal Opportunity Employer Eightfold.ai is an Equal Opportunity Employer. We do not discriminate based on race, color, religion, sex, sexual orientation, gender identity, national origin, age, or disability. If you re a hands-on, innovative engineer with a passion for building scalable systems and tackling infrastructure challenges, we want to hear from you.
Devops Engineer
Sarvam
DevOps Engineer Location: Bengaluru, Karnataka, India (On-Site) Department: Engineering Employment Type: Full-Time About Sarvam.ai Sarvam.ai is a cutting-edge generative AI startup headquartered in Bengaluru, India, with a mission to make generative AI accessible and impactful for Bharat. Founded by AI experts, we are dedicated to developing high-performance, cost-effective AI agents tailored for the Indian market. We enable enterprises to tap into new opportunities, build deeper customer connections, and reshape the future of AI for India and beyond. Role Overview We are looking for a DevOps Engineer to join our team and help build and manage scalable, secure, and high-performance infrastructure. In this role, you will be a key contributor to automating deployments, managing cloud infrastructure, optimizing CI/CD workflows, and ensuring system reliability. You will work with cutting-edge technologies, including cloud platforms, containerization, and infrastructure as code (IaC), to deliver impactful solutions for AI-driven products. Key Responsibilities CI/CD Pipelines: Design, implement, and manage CI/CD pipelines for seamless software deployment and integration. Cloud Infrastructure: Deploy and manage cloud infrastructure using Terraform, Kubernetes, and Docker for scalability and high performance. Automation & Scaling: Automate infrastructure provisioning, scaling, and security compliance to support high-availability environments. Monitoring & Optimization: Implement logging, monitoring, and alerting solutions using tools like Prometheus, Grafana, ELK Stack, or CloudWatch to monitor system performance and optimize resource utilization. Security & Compliance: Enhance security and compliance by managing IAM policies, encryption, and vulnerability scanning. Troubleshooting & Root Cause Analysis: Troubleshoot system failures, perform root cause analysis, and implement improvements to ensure reliability and uptime. Collaboration: Work closely with development teams to ensure smooth deployment and operation of AI models and applications. Must-Have Skills & Qualifications Educational Background: Bachelor s degree in Computer Science, Engineering, or related field (2024/2025 graduates). Cloud Expertise: Strong experience with AWS, Azure, or GCP for deploying and managing cloud-based applications. Containerization: Proficiency in Docker and Kubernetes for building and managing containerized applications. Infrastructure as Code (IaC): Experience with Terraform, Ansible, or CloudFormation to automate infrastructure management. CI/CD Pipelines: Experience in setting up automated workflows using tools like GitHub Actions, Jenkins, or GitLab CI/CD for smooth deployments. Monitoring & Logging: Experience with Prometheus, Grafana, ELK, or similar tools to implement effective monitoring and logging solutions. Networking & Security: Strong understanding of firewalls, VPNs, SSL, and cloud security best practices for secure infrastructure. Version Control: Proficiency with Git for managing code repositories and version control workflows. Problem Solving: Strong debugging, troubleshooting, and analytical skills to resolve complex system issues. Good to Have (Preferred Experience) Serverless Computing: Exposure to serverless computing models such as AWS Lambda or Azure Functions. Message Queues: Experience with message queues like Kafka, RabbitMQ, or SQS. Site Reliability Engineering (SRE): Familiarity with SRE practices to ensure the reliability and availability of large-scale systems. Open Source Contributions: Contributions to open-source projects or a strong GitHub portfolio showcasing DevOps expertise and best practices. Impactful Work: Work on AI-driven products that are reshaping the future of technology in India. Innovative Team: Collaborate with a team of AI experts and engineers pushing the boundaries of technology. Career Growth: Opportunity to grow in a fast-growing startup at the forefront of the generative AI revolution. Cutting-edge Technologies: Work with cloud technologies, automation, and AI infrastructure to create high-impact products. Qualification : Bachelors degree in Computer Science, Engineering, or related field
Engineering Manager- Platform Engineering
Meesho
Engineering Manager Platform Engineering Location: Bangalore, Karnataka | Department: Tech About the Team At Meesho, we support 5% of Indian households with high-scale e-commerce solutions and we do it with zero downtime. We value speed over perfection, embrace failures as learning opportunities, and empower teams with a Founder s Mindset. As part of the Platform Engineering team, you ll be building resilient, low-latency, high-throughput systems that serve millions of users daily. We invest in the growth of every engineer through continuous feedback, open communication, and a supportive culture. And yes we know how to party as hard as we code. About the Role We are looking for a skilled Engineering Manager Platform Engineering to lead a team responsible for designing, scaling, and optimizing our core infrastructure. This role involves managing large-scale distributed systems, fostering engineering excellence, and collaborating across teams to drive innovation. You ll ensure technical quality, delivery speed, and scalable architecture for all projects under your ownership. What You Will Do Design and allocate technical tasks while maintaining Meesho s engineering standards. Own execution of platform projects from inception to deployment, ensuring scalability and reliability. Conduct regular 1:1s, drive feedback cycles, and support career growth of engineers. Partner closely with Product and Design teams to develop new platform capabilities. Coach engineers on best practices for architecture, performance, and scalability. Monitor project health, sprint progress, and engineering KPIs. Foster a high-performing team culture with strong engineering ownership. What You Will Need Bachelor s or Master s degree in Computer Science or a related technical field. 8+ years of professional software development experience, including 1+ year in team management. Proven experience building large-scale distributed systems. Strong coding skills in Java, Python, or Go, and multithreading expertise. Deep understanding of messaging systems (Kafka, etc.), transactional and NoSQL databases. Experience working on cloud platforms like GCP or AWS. Exceptional communication, leadership, and stakeholder management skills. Good to have: Exposure to Elasticsearch, data pipelines, or stream processing systems. About Us Meesho is India s leading e-commerce platform built for the next billion users. With 1.75M+ sellers and a customer base spread across every serviceable pin code, we are democratizing internet commerce by enabling small businesses to sell online at zero commission and with the lowest logistics costs in the industry. From affordable products that reflect local demand to a robust pan-India tech infrastructure, Meesho is transforming how India shops and sells online. Our Culture & Total Rewards At Meesho, we believe in creating a culture of impact, inclusion, and innovation. Our values reflected in 11 guiding principles or "Mantras" shape how we work, collaborate, and grow together. Why You ll Love Working Here: Compensation: Competitive salary with equity-based rewards tailored to your experience and impact. Wellness: Extensive health insurance for you and your family through our MeeCare Program, mental wellness support, gym discounts, and more. Flexibility & Leave: Generous time off, parental benefits, and relocation support. Growth & Learning: Continuous learning through workshops, internal mobility, and performance coaching. Culture of Recognition: Personalized gifts, fun rituals, and regular engagement programs celebrating wins big and small. Join us to build the platform powering the future of digital commerce in India. Apply now and be part of a tech-first, people-driven journey at Meesho. Qualification : Bachelors or Masters degree in Computer Science or a related technical field.
Ai Platform Architect
Adobe
AI Platform Architect Location: Bangalore, Karnataka, India Employment Type: Full-Time About Adobe Adobe is changing the world through digital experiences. Whether you're an emerging artist or a global brand, our tools empower creativity and innovation across every screen. From powerful imaging and video solutions to immersive web and app design, Adobe s mission is to help people and businesses deliver exceptional digital experiences. We are committed to creating an inclusive workplace where everyone is respected and given equal opportunity. Innovation can come from anywhere and the next big idea could be yours. Job Description We are looking for a visionary AI Platform Architect with deep expertise in building and scaling cloud-native, AI-powered platforms. The ideal candidate will have experience deploying large-scale, customer-facing AI solutions and a deep understanding of modern cloud architecture, data systems, MLOps, and LLMOps. Responsibilities Design and develop scalable AI/ML platforms and pipelines across AWS, Azure, and GCP. Architect end-to-end LLM pipelines including model training, fine-tuning, serving, inference APIs, and monitoring. Lead cross-functional teams in delivering AI solutions from experimentation to production. Implement MLOps and LLMOps best practices using tools like MLFlow, SageMaker, Langchain, and LangGraph. Design GPU-optimized architectures for training and inference of LLMs using DeepSpeed, vLLM, and other modern frameworks. Support infrastructure automation and container orchestration with Kubernetes, Docker, and CI/CD pipelines. Collaborate with internal stakeholders and clients to understand requirements, evangelize platform solutions, and ensure successful delivery. Key Skills and Expertise Cloud and DevOps: Expertise in AWS, Azure, GCP especially VPC design, cloud databases, and serverless architecture. Certified in AWS Professional Solution Architect, AWS ML Specialty, or Azure Solutions Architect Expert (preferred). Proficient with Kubernetes, Docker, FluentD, Kibana, Grafana, Prometheus. Data and Streaming: Experience with OLTP/OLAP databases and cloud-native data warehouses like BigQuery, Aurora, Spanner. Hands-on with Kafka, Apache Flink, Spark, Airflow, Databricks, Apache Iceberg, Presto. AI/ML & LLM Expertise: In-depth understanding of LLMs (GPT, Gemini, Claude, Mixtral, Llama, Hugging Face OSS models). LLMOps frameworks: Langchain, Langgraph, Langflow, Flowise, LLamaIndex. ML lifecycle tools: MLFlow, SageMaker, Vertex AI, Azure AI, AWS Bedrock. Proven experience in model optimization, fine-tuning, and high-throughput inference systems. Programming Languages: Proficient in Python, SQL, and JavaScript. Preferred Qualifications 10+ years in cloud and AI/ML platform architecture roles. Experience delivering AI solutions for enterprise-scale clients. Hands-on experience with GPU architecture and parallel/distributed training. Strong communication skills with ability to influence technical and business stakeholders. Work on cutting-edge AI technologies and shape future product experiences used by millions. Collaborate with world-class engineers and scientists in a diverse, inclusive culture. Be part of a company that values creativity, innovation, and employee well-being. Adobe is proud to be an Equal Opportunity Employer. We welcome and encourage candidates from all backgrounds to apply.
Senior Software Engineer - Devops
Finbox
DevOps Engineer | FinBox Location: India (Specific location not mentioned) Experience: 5+ Years About FinBox: Where Fintech Meets Fun! Welcome to FinBox, the buzzing hive of tech innovation and creativity! Since our inception in 2017, FinBox has built some of the most advanced technologies in the financial services space that help lenders like Banks, NBFCs and large enterprises build and launch credit products within a matter of days, not months or years. FinBox is a Series A funded company which is expanding globally with offices in India, Vietnam, Indonesia and Philippines. Our vision is to build the best-in-class infrastructure for lending products and help Banks & Financial Services companies across the world scale and launch credit programs that set a new standard in the era of digital finance. So far, we ve helped our customers disburse Billions of Dollars in credit across unsecured and secured credit including personal loans, working capital loans, business loans, mortgage and education loans. FinBox solutions are already being used by over 100+ companies to deliver credit to over 5 million customers every month. Why Should You be a FinBoxer: Innovative Environment: At FinBox, we foster a culture of creativity and experimentation, encouraging our team to push the boundaries of what's possible in fintech. Impactful Work: Your contributions will directly impact the lives of millions, helping to provide fair and accessible credit to individuals and businesses alike. Growth Opportunities: We are a Series A funded startup and have ample opportunities for growth, professional development and career advancement. Collaborative Culture: Join a diverse and inclusive team of experts who are passionate about making a difference and supporting one another. Who s a Great FinBoxer: At FinBox, we re on the lookout for exceptional folks who are all about innovation and impact. If you re excited to shake things up in the banking & financial services world, keep reading! Creative Thinkers: If your brain is always bubbling with out-of-the-box ideas and wild solutions, you re our kind of person. We love disruptors who challenge the norm and bring fresh perspectives to the table. Customer Heroes: Our customers are our champions, and we need heroes who can understand their needs, deliver magical experiences, and go above and beyond to keep them happy. Team Players: We believe in the power of we. If you thrive in a collaborative environment, value different viewpoints, and enjoy being part of a spirited, supportive team, you ll fit right in. About the Job As a DevOps Engineer at FinBox, you will also impart your strong understanding of cloud computing and infrastructure-as-a-service tools. You will be given ownership for taking new initiatives to implement industry best practices in process improvements, implementing automation and orchestration, and ensuring that software applications are delivered quickly, reliably, and at scale. How You'll Contribute Build and set up new development tools and infrastructure which can be helpful in scaling the product Strong understanding of CI/CD to implement and maintain automated build, test, and deployment pipelines to ensure continuous delivery of high-quality software Design, build and maintain an automated secure & resilient cloud infrastructure Collaborate with other developers to ensure that development follows established processes & practices Implement and maintain a practice of monitoring and logging to ensure the availability, performance and security of the application Enhance the development and delivery process by implementing best practices and tools in the industry Troubleshoot and resolve issues related to software development, deployment, and infrastructure by identifying the root cause Provide Day 2 technical support with products and the deployments and document the phases involved throughout the process Train and guide other members on the best DevOps practices and tools Who You Are An exceptional 5+ years of experience in handling CI/CD and implementing the best development practices In-depth knowledge of scripting languages, preferred are Go/Python/Bash In-depth working knowledge in Kubernetes,Terraform, Docker, Jenkins, etc Strong knowledge of system design & algorithms Hands-on experience on the cloud provider, AWS Strong knowledge of automating existing systems, repetitive tasks and deployments Carrying a knack for promoting high-quality coding practices with clean and reusable code Fair understanding of monitoring frameworks such as Grafana, Prometheus, Datadog, etc
Senior Software Development Engineer Idc Vn Edge
Oracle
Job Description: Senior Software Development Engineer - Oracle Cloud Infrastructure Core Services Development Team Role: Senior Software Development Engineer Team: OCI Virtual Networking Core Services Development Team Location: India Career Level: IC3 Experience: 4+ years Overview: Oracle's Cloud Infrastructure (OCI) is building state-of-the-art infrastructure-as-a-service (IaaS) technologies that operate at high scale across a globally distributed, multi-tenant cloud. The OCI Virtual Networking team is at the heart of this effort, developing distributed, highly available virtual networking services. This team is responsible for foundational cloud services, such as the Virtual Cloud Network (VCN), VPN, Customer Cloud Connectivity, Network Firewalls, and other edge services. As a Senior Software Development Engineer, you will be responsible for designing, developing, and optimizing complex distributed systems that interact with end users and network infrastructure. Your role will involve working on distributed services, developing algorithms for efficient data transfer across networks, and ensuring scalability and reliability within Oracle's cloud environment. You will work closely with a collaborative, agile team of engineers while contributing to building the future of cloud networking services. Key Responsibilities: Software Development & Design: Design, develop, and implement distributed networking services within OCI's Virtual Cloud Network (VCN). Focus on writing clean, maintainable, and optimized code to enhance performance and scalability. Develop and optimize algorithms to ensure efficient data transfer and network operations across the distributed cloud infrastructure. Ensure the performance and scalability of the code, especially when deployed in a cloud environment. Collaboration & Agile Work Environment: Collaborate closely with cross-functional teams in a fast-paced, agile development environment. Participate in the full software development lifecycle, from planning and design to testing and deployment. Work with other team members to ensure the integration of various OCI services, with a focus on automation and scalability. Operational Support & Troubleshooting: Contribute to the operational support of production services, including on-call duties. Troubleshoot and resolve complex issues, ensuring high availability and reliability of networking services. Provide technical leadership and contribute to the continuous improvement of the services. Leadership & Mentorship: Take ownership of parts of the service and its components, leading from design to implementation. Mentor junior engineers and provide technical guidance and support. Share knowledge and contribute to the team s growth through code reviews, knowledge-sharing sessions, and coaching. Technical and Professional Requirements: Programming Expertise: Expert-level experience with Java in developing large-scale, high-performance applications. Experience in concurrent programming and the design of distributed systems. Proficiency in solving complex problems related to scalability, performance, and reliability in cloud environments. Cloud & Distributed Systems: Experience in building and maintaining distributed, scalable services, especially within cloud infrastructures. Strong knowledge of cloud technologies and networking protocols. System Design & Optimization: Solid understanding of system architecture, including how components interact in a distributed, cloud-based system. Ability to optimize code for performance and scalability in production environments. Operational Understanding: Experience in operating production services and providing support during on-call rotations. Understanding of troubleshooting complex system issues, particularly in a distributed cloud environment. Team Collaboration & Communication: Ability to work in a collaborative and agile team environment. Strong verbal and written communication skills for effective coordination across teams. Preferred Qualifications: Experience in Large-Scale Distributed Services: Prior experience in building and scaling distributed services, particularly in cloud or network-related domains. Python Skills: Knowledge of Python for scripting, automation, and solving network-related problems is a plus. Additional Skills: Experience with cloud services, such as VPN, firewalls, network connectivity, and network security. Exposure to containerization technologies such as Docker and orchestration tools like Kubernetes is advantageous. Educational Requirements: Bachelor s or Master s degree in Computer Science, Electrical/Hardware Engineering, or a related field. At Oracle, you will have the opportunity to work on cutting-edge technologies that power cloud networking at a global scale. You will be part of a dynamic and innovative team, contributing to the development of highly scalable and distributed networking services within Oracle's cloud infrastructure. Your expertise will be crucial to driving the evolution of cloud technologies, and you will have a chance to mentor junior engineers while working in a collaborative, fast-paced environment. Qualification : Bachelors or Masters degree in Computer Science, Electrical/Hardware Engineering, or a related field.
Site Reliability Developer 2/3
Oracle
Job Description: Site Reliability Engineer - OCI Cloud Engineering Team Role: Site Reliability Engineer (SRE) Team: OCI OLTP (Online Transaction Processing) Location: Kiev Career Level: IC2 Experience: 5+ years Overview: Oracle Cloud Infrastructure s (OCI) OLTP organization is seeking a Site Reliability Engineer (SRE) to join our dynamic and fast-paced Cloud engineering team. The team is responsible for mission-critical distributed systems and cloud services, and we are looking for an engineer who is deeply interested in databases, distributed systems, and cloud services. If you thrive in an environment where innovation, problem-solving, and operational excellence intersect, this is an exciting opportunity for you! As a member of the SRE services, you will focus on Cloud Services, building deployments, operations, security vulnerability mitigation, and automation. You will be instrumental in fostering a culture of Site Reliability Engineering (SRE) within the team, and your work will directly contribute to ensuring the stability, performance, and reliability of Oracle s global cloud service infrastructure. This role requires someone who is adaptable, highly motivated, and capable of managing large-scale cloud environments with a focus on continuous improvement. Key Responsibilities: Cloud Service Operations & Reliability: Deploy, operate, and maintain large-scale cloud service products in a highly available, fault-tolerant, and scalable environment. Collaborate with internal teams to identify and mitigate cross-team issues that pose operational risks to cloud services. Focus on systems reliability and ensure the continuous availability of cloud services by automating tasks and eliminating manual interventions. Automation & Improvements: Automate operational tasks and improve service deployments, focusing on scaling, performance, and uptime. Contribute to CI/CD systems, ensuring seamless integration and continuous delivery for cloud-based services. Leverage automation tools such as Terraform, Grafana, and Bitbucket to streamline operations. Security & Incident Response: Mitigate security vulnerabilities within cloud services and ensure compliance with Oracle's security standards. Participate in on-call rotations to provide immediate troubleshooting support and ensure rapid issue resolution. Perform deep analysis of service performance and collaborate with team members to diagnose and resolve issues that affect service availability or performance. Collaborative Problem-Solving: Work closely with cross-functional teams, including development, database, networking, and storage experts, to ensure the reliability and performance of services. Identify systemic issues and potential risks, develop solutions, and ensure proper documentation and communication with stakeholders. Documentation & Knowledge Sharing: Contribute to documentation such as runbooks, operational guides, and troubleshooting manuals. Mentor junior engineers and share knowledge on best practices for site reliability engineering and cloud service operations. Continuous Learning: Stay up to date with new cloud technologies, trends, and best practices, and actively implement them in your day-to-day work. Technical and Professional Requirements: Cloud Services & Infrastructure: 5+ years of experience in SRE, DevOps, or Automation roles with a focus on large-scale infrastructure and cloud services. Hands-on experience with cloud platforms (e.g., OCI, AWS, Azure) and expertise in compute, database, networking, and storage services within cloud environments. Automation & Tooling: Proficiency with automation tools such as Terraform, Grafana, LumberJack, and Shepherd. Solid experience in using CI/CD tools and processes for cloud service deployments and operations. Scripting & Systems: Strong knowledge of scripting languages, particularly Python and Java. Familiarity with Linux systems, docker containers, virtualized infrastructure, and orchestration (e.g., Kubernetes). Performance & Troubleshooting: Excellent troubleshooting skills with a focus on performance, availability, reliability, and scalability of distributed systems. Experience in operating fault-tolerant, highly available, high-throughput distributed systems. Security & Incident Management: Familiarity with security practices and mitigating security vulnerabilities in cloud services. Proven ability to handle incident response and provide efficient troubleshooting during on-call rotations. Collaboration & Communication: Strong verbal and written communication skills, capable of working effectively with diverse teams across multiple geographies. Ability to work in a highly collaborative environment, driving operational excellence and customer satisfaction. Preferred Qualifications: Experience in operating and maintaining multi-tenant, cloud-based infrastructure with a focus on scalability and high availability. Familiarity with tools and platforms like Grafana, Prometheus, and other observability and monitoring tools. Experience in networking and storage technologies in a cloud environment. Joining OCI s OLTP team as an SRE gives you the opportunity to work with cutting-edge technologies and contribute to the operational excellence of Oracle s global cloud infrastructure. This is a chance to grow your skills in a highly dynamic environment and to solve complex problems that directly impact mission-critical cloud services. With a focus on automation, scalability, and high performance, you will be an essential part of a team that powers Oracle s leading cloud services. If you are an experienced engineer passionate about cloud technologies, automation, and ensuring the reliability of large-scale systems, we encourage you to apply and join us in this exciting journey!
Software Engineer III, Scaled Infrastructure
Google Careers
Software Engineer at Google Minimum Qualifications: Bachelor's degree or equivalent practical experience. 2 years of experience with software development in one or more programming languages, or 1 year of experience with an advanced degree in an industry setting. 2 years of experience with data structures or algorithms in either an academic or industry setting, and building software for data privacy or security (e.g., identity and access management). Experience with C++, infrastructure design, and Android app development. Preferred Qualifications: Master's degree or PhD in Computer Science or related technical fields. 2 years of experience with performance, large scale systems data analysis, visualization tools, or debugging. Experience developing accessible technologies. Experience with Security Analysis, Program Analysis, Decompiler. Knowledge of code and system health, diagnosis and resolution, and software test engineering. About the Job: Google's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information at massive scale, and extend well beyond web search. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google s needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward. In this role, you will participate in the development of large scale Google infrastructure Design, as well as develop, and maintain scalable Google wide infrastructure to scan 1 million Android apps per day, manage Petabytes of analysis data, and protect over 3 billion devices. You will build systems to manage and extract intelligence from various sources and make the intelligence available to various internal systems and stakeholders The Platforms and Ecosystems product area encompasses Google's various computing software platforms across environments (desktop, mobile, applications). The products provide enterprises, and ultimately end users, the ability to utilize and manage their services at scale. We build innovative and compelling software products from apps to TVs, from laptops to phones that have an impact on people s lives across the world. Responsibilities: Write product or system development code. Participate in, or lead design reviews with peers and stakeholders to decide amongst available technologies. Review code developed by other developers and provide feedback to ensure best practices (e.g., style guidelines, checking code in, accuracy, testability, and efficiency). Contribute to existing documentation or educational content and adapt content based on product/program updates and user feedback. Triage product or system issues and debug/track/resolve by analyzing the sources of issues and the impact on hardware, network, or service operations and quality.
Senior Software Development Engineer In Test (sdet)
Couchbase
Job Title: Senior Software Development Engineer in Test (SDET) Location: Bangalore, India (Office-based role) About Couchbase: As industries race to embrace AI, traditional database solutions fall short of the growing demands for versatility, performance, and affordability. Couchbase is leading the way with Capella, the developer data platform for critical applications in the AI-driven world. By uniting transactional, analytical, mobile, and AI workloads into a seamless, fully managed solution, Couchbase empowers developers and enterprises to build and scale applications with unmatched flexibility, performance, and cost-efficiency from cloud to edge. Trusted by over 30% of the Fortune 100, Couchbase is unlocking innovation, accelerating AI transformation, and redefining customer experiences. Come join our mission! Job Overview: Couchbase is expanding rapidly, and our Engineering and Cloud teams are at the heart of this growth. As a Senior Software Development Engineer in Test (SDET), you will be a key contributor to ensuring high quality across our data infrastructure systems. You will collaborate closely with engineering teams to optimize test frameworks in Python/Golang, scaling them to handle larger workloads under stress and heavy load. This role provides an exciting opportunity to contribute to the growth and development of our products and to shape the future of our testing and automation strategies. Responsibilities: Automation & Test Framework: Lead the design, implementation, and maintenance of automation and test frameworks to support large-scale data infrastructure systems. Scaling Systems: Optimize test frameworks (Python/Golang) to handle systems under large scale, high load, and stressful conditions. Component Testing: Drive testing across multiple components, including API interfaces, databases, storage, file systems, and OS-level functionality. Environment Configuration: Set up and configure test environments, including Windows and Linux OS, networking, proxies, and client-server tests. Collaboration: Work with cross-functional teams to ensure quality testing and continuous improvement of products through integrated automated testing. Analysis & Reporting: Provide in-depth analysis and generate clear, actionable reports on test results, issues, and areas for improvement. Problem-Solving: Identify and resolve complex testing issues in a timely manner, demonstrating self-motivation and keen analytical skills. Communication: Maintain strong communication with both technical and non-technical teams to provide clarity on testing outcomes, issues, and resolutions. Requirements: Experience: 4 to 6 years of hands-on experience in automation and test framework implementation. Programming Skills: Proficiency in Python, C/C++, Java, or Golang. Testing Expertise: Demonstrated experience testing APIs, databases, storage systems, file systems, and operating systems. Technical Understanding: Good understanding of large-scale distributed systems, relational/NoSQL databases, OS concepts, and networking. Test Environments: Experience configuring test environments and working with infrastructure as a service (IaaS) across Windows and Linux OS. Problem-Solving & Analytical Skills: Strong attention to detail, excellent problem-solving skills, and curiosity for identifying and addressing complex issues. Collaboration: Ability to thrive in a fast-paced environment and work effectively within a team. At Couchbase, we reimagine database technology to enable modern, flexible, and cost-effective applications that drive premium customer experiences. Our Capella platform delivers cutting-edge solutions, empowering businesses to rapidly build applications that scale with performance and flexibility. Benefits at Couchbase: Generous Time Off Program: Flexibility to care for yourself and your family. Wellness Benefits: A variety of world-class medical plans, dental, vision, life insurance, and employee assistance programs. Financial Planning: RSU equity program, ESPP, retirement planning, and business travel insurance. Career Growth: We value your contributions and provide opportunities to grow and make an impact. Fun Perks: Ergonomic office setup, food & snacks for in-office employees, and more!
Autoit Solutioning Engineer, Lead
Qualcomm
Job Title: Site Reliability Engineer (SRE) General Summary: We are seeking a highly skilled and motivated Site Reliability Engineer (SRE) to join our dynamic team. This role is critical in ensuring the stability, scalability, and security of our infrastructure and services. As an SRE, you will work collaboratively with software engineers, data scientists, and product managers to optimize system reliability while driving automation and continuous improvement. You will be responsible for modernizing traditional services, implementing cutting-edge technology, and proactively managing infrastructure to maintain operational excellence. If you are passionate about automation, DevSecOps, system performance, and infrastructure resilience, this role offers an exciting opportunity to make a meaningful impact. Key Responsibilities: System Monitoring & Incident Response: Continuously monitor system health, detect anomalies, and respond to incidents promptly. Investigate and troubleshoot service-related issues, ensuring minimal disruption. Implement proactive measures to prevent downtime and optimize system stability. Infrastructure Automation & DevOps Implementation: Develop and maintain Infrastructure-as-Code (IaC) scripts to automate deployments and scaling. Automate routine operational tasks to improve efficiency and reduce manual intervention. Leverage DevSecOps practices to ensure secure and resilient deployments. Performance Optimization & Capacity Planning: Collaborate with development teams to enhance software performance and system responsiveness. Identify and resolve system bottlenecks to improve speed, efficiency, and reliability. Forecast resource requirements based on traffic patterns and business growth. Security, Compliance & Risk Management: Implement security best practices and compliance measures across all infrastructure layers. Conduct security audits and ensure systems meet industry-standard security guidelines. Proactively assess and mitigate risks associated with infrastructure and deployments. Required Qualifications & Skills: Technical Expertise: Extensive experience with Linux-based environments (Ubuntu, RedHat), including system administration and troubleshooting. Strong proficiency in scripting and automation using Python, Bash, or Go. Experience with containerization and orchestration technologies such as Docker and Kubernetes. Familiarity with CI/CD pipelines and tools like Jenkins, Puppet, Vault, and Splunk. Hands-on experience with cloud platforms (AWS, Azure, or GCP). Problem-Solving & Leadership: Strong analytical skills with the ability to diagnose and resolve complex system issues. Self-driven, highly motivated, and able to work independently in a fast-paced environment. Ability to collaborate cross-functionally and communicate technical solutions effectively. Security & Reliability Focus: Solid understanding of DevSecOps principles and secure system design. Ability to implement monitoring, logging, and alerting solutions to maintain system resilience. Passion for continuous learning and leveraging data-driven approaches for system improvement. Work in a high-impact role that directly contributes to the reliability and scalability of mission-critical systems. Be part of an innovative, forward-thinking team that values automation, collaboration, and continuous improvement. Competitive salary, professional development opportunities, and an environment that fosters growth and innovation. If you are a passionate, results-driven SRE, we invite you to join us and play a pivotal role in shaping the future of our infrastructure.
Engineering Manager - Backend
Databricks
About Databricks At Databricks, we are driven by our passion for enabling data teams to solve the world s most challenging problems from building the next mode of transportation to accelerating medical breakthroughs. We achieve this by creating the world s leading data and AI infrastructure platform, empowering our customers to leverage deep data insights to transform their businesses. Founded by engineers and customer-obsessed, we thrive on solving technical challenges, whether it s designing next-gen UI/UX for data interaction or scaling services and infrastructure across millions of virtual machines. And we re just getting started. The Opportunity As one of the first Engineering Managers in the Software Engineering team at Databricks India, you will be at the forefront of building infrastructure and products at scale for the Databricks platform. You will lead a talented team working on complex and highly scalable systems, contributing to multiple domains, including: Resource management infrastructure for big data and machine learning workloads scalable, secure, and cloud-agnostic. Reliable, scalable services and client libraries that handle massive amounts of data across multiple regions and cloud providers. Developer tools for operating services across different clouds and environments. Services and infrastructure at the intersection of machine learning and distributed systems. The Impact You Will Have Build and lead an exceptional engineering team by hiring and retaining top talent. Ensure high technical standards through robust processes (e.g., architecture reviews, testing) and foster a culture of engineering excellence. Define and execute long-term roadmaps, collaborating with engineering and product leadership. Coordinate execution and unblock cross-team projects, ensuring alignment and timely delivery. What We Look For 10+ years of experience with large-scale distributed systems, including strong expertise in testing, monitoring, and defining SLAs. Proven experience as a Software Engineering Leader, building and scaling engineering teams from the ground up. Strong technical background, with extensive experience managing high-performing software engineering teams. Proven ability to partner with Product Management, Sales, and Customers to develop innovative features and products. BS (or higher) in Computer Science or a related field. About Databricks Databricks is the data and AI company, trusted by more than 10,000 organizations worldwide, including Comcast, Cond Nast, Grammarly, and over 50% of the Fortune 500. We help unify and democratize data, analytics, and AI through the Databricks Data Intelligence Platform. Founded by the original creators of Apache Spark, Delta Lake, MLflow, and the Lakehouse architecture, Databricks is headquartered in San Francisco with offices across the globe. Qualification : BS (or higher) in Computer Science, or a related field.
Senior Engineering Manager - Backend
Databricks
About Databricks At Databricks, we are passionate about empowering data teams to solve the world s toughest challenges from creating the next mode of transportation to accelerating medical breakthroughs. We build and run the world s best data and AI infrastructure platform, enabling our customers to leverage deep data insights to transform their businesses. Founded by engineers and customer-obsessed, we thrive on solving technical challenges whether it's designing next-gen UI/UX for interacting with data or scaling our services and infrastructure across millions of virtual machines. And we re only getting started. The Opportunity As one of the first Engineering Managers in the Software Engineering team at Databricks India, you will lead a team of talented engineers to build infrastructure and products at scale for the Databricks platform. Our teams work across diverse domains, including: Resource Management Infrastructure: Power big data and machine learning workloads on a scalable, secure, and cloud-agnostic platform. Reliable, Scalable Services and Client Libraries: Handle massive amounts of data across multiple regions and cloud providers. Developer Tools: Help Databricks engineers operate services across different clouds and environments. Services and Infrastructure at the Intersection of Machine Learning and Distributed Systems. The Impact You Will Have Hire and grow a world-class engineering team, fostering a supportive and collaborative environment. Support engineers career development, providing regular feedback and cultivating future engineering leaders. Ensure high technical standards by implementing effective processes (e.g., architecture reviews, testing) and promoting a culture of engineering excellence. Collaborate with engineering and product leadership to define and execute long-term roadmaps. Coordinate execution and unblock cross-team projects, ensuring timely delivery and alignment with business objectives. What We Look For 12+ years of experience with large-scale distributed systems, including expertise in testing, monitoring, and defining SLAs. Proven track record as a Software Engineering Leader, with experience building and scaling engineering teams from the ground up. Extensive experience managing high-performing software engineering teams. Strong collaboration skills to partner with Product Management, Sales, and Customers in developing innovative features and products. BS (or higher) in Computer Science or a related field. About Databricks Databricks is the data and AI company, trusted by over 10,000 organizations worldwide, including Comcast, Cond Nast, Grammarly, and more than 50% of the Fortune 500. We help unify and democratize data, analytics, and AI through the Databricks Data Intelligence Platform. Headquartered in San Francisco, Databricks was founded by the original creators of Apache Spark, Delta Lake, MLflow, and the Lakehouse architecture, with offices across the globe. Qualification : BS (or higher) in Computer Science, or a related field.
Customer Engineer, Ai Infrastructure, Google Cloud
Google Careers
Minimum qualifications: Bachelor's degree in Computer Science, Mathematics, a related technical field, or equivalent practical experience. 10 years of experience with cloud native architecture in a customer-facing or support role. 5 years of experience with cloud infrastructure. 5 years of experience in a technical role focused on AI infrastructure or related areas Experience building and operationalizing machine learning models. Experience with GPU programming (e.g., CUDA, OpenCL) and optimization techniques. Preferred qualifications: Experience with high-performance computing (HPC) environments and contributions to open-source projects related to AI or infrastructure. Experience training and fine-tuning large models (e.g., image, language, segmentation, recommendation, genomics) with accelerators. Experience with performance profiling tools (e.g., TensorFlow profiler, PyTorch profiler, Tensorboard). Experience designing/architecting large-scale infrastructure farms for specialist AI use cases. Experience with running MLPerf benchmarks, distributed training and optimizing performance versus costs. Excellent communication, presentation, and teamwork skills. About the job The Google Cloud Platform team helps customers transform and build what's next for their business all with technology built in the cloud. Our products are developed for security, reliability and scalability, running the full stack from infrastructure to applications to devices and hardware. Our teams are dedicated to helping our customers developers, small and large businesses, educational institutions and government agencies see the benefits of our technology come to life. As part of an entrepreneurial team in this rapidly growing business, you will play a key role in understanding the needs of our customers and help shape the future of businesses of all sizes use technology to connect with customers, employees and partners. As a Customer Engineer for AI Infrastructure, you will be the technical expert and trusted advisor for our customers, helping them design, deploy, and optimize AI solutions using cutting-edge hardware and software. Your focus will be on GPUs, accelerators (including FPGAs and ASICs), and Google TPUs. You will work closely with Sales, Product Management, and Engineering to ensure our customers achieve maximum value from their AI investments. You will be responsible for scaling and helping accelerate GCP AI Infrastructure business growth. Google Cloud accelerates every organization s ability to digitally transform its business and industry. We deliver enterprise-grade solutions that leverage Google s cutting-edge technology, and tools that help developers build more sustainably. Customers in more than 200 countries and territories turn to Google Cloud as their trusted partner to enable growth and solve their most critical business problems. Responsibilities Be a trusted advisor to customers, helping them understand and incorporate AI accelerators into their overall cloud strategy by recommending migration paths, integration strategies, and application architecture that incorporate Google Cloud AI optimized infrastructure. Demonstrate how Google Cloud is differentiated, highlighting the power of accelerators by working with customers on proof-of-concepts, demonstrating features, optimizing model performance, profiling, and bench-marking. Influence Google Cloud strategy at the intersection of infrastructure and AI/ML by advocating for enterprise customer requirements. Travel to customer sites and events as needed. Be responsible for business growth and workload acceleration on AI infrastructure products and solutions for GCP. Qualification : Bachelor's degree in Computer Science, Mathematics, a related technical field, or equivalent practical experience.
Wifi Automation Development Engineer
Intel Corporation
We are seeking a highly skilled and motivated Python and AWS Automation Engineer to join our dynamic team. In this role you will be responsible to automate infrastructure , deployment, and operational tasks using Python/C# and AWS services. Will play a critical part in developing and optimizing our cloud based analytics, streamlining workflows, and enhancing validation efficiency. As a key member of the automation team, expected to work closely with DevOps, engineering, and validation teams to implement scalable and reliable automation solutions for various WiFi features and validation processes. You will also be instrumental in integrating AWS services, creating efficient scripts, and ensuring seamless automation of cloud-based resources and services. Qualifications Key Responsibilities:* Design, develop, and maintain Python/C#-based automation scripts to process data to develop meaningful analytics for validation using AWS resources. * Leverage AWS services (such as Kubernetes) to automate processing for data analytics* Develop automation scripts and libraries for WiFi features* Collaborate with DevOps engineers to integrate automation processes into CI/CD pipelines (e.g., Jenkins)* Monitor and troubleshoot automation workflows to ensure they are running smoothly and efficiently.* Follow guidelines on best practices for cloud automation, scalability, security, and cost optimization in AWS.* Participate in the design and implementation of logging, monitoring, and alerting systems using AWS CloudWatch and other monitoring tools.* Develop and maintain detailed documentation for automation scripts, processes, and AWS configurations.* Ensure adherence to security best practices and compliance standards for cloud-based applications and infrastructure.Required Skills and Qualifications:* Proven hands on experience in Python and C# programming, with a focus on automation and cloud services.* Hands-on experience with AWS services, including Kubernetes* Strong understanding of cloud infrastructure and best practices for automating, scaling, and monitoring.* Experience in integrating automation with CI/CD pipelines and using tools like Jenkins* Knowledge of containerization and orchestration technologies such as Docker, Kubernetes, and ECS.* Hands on experience with automation using rest APIs and UI automation* Familiarity with version control systems (e.g., Git) and Gerrit* Strong problem-solving and troubleshooting skills.* Ability to work independently and collaborate effectively within a cross-functional team.* Excellent written and verbal communication skills.Domain Knowledge1. Hands on Experience in configuring/handling automation setups with different topologies2. WiFi Experience - Usage / Configuring of WiFi sniffer / Attenuator / Access Point will be helpful3. Fair understanding about test engineering skills, Validation Methodologies, Debugging techniquesSoft skills:1. Ability to work independently and collaborate effectively within a cross-functional team.2. Good written and verbal communication skills.3. Quick learning of new technologies4. Stakeholder management Inside this Business Group The Client Computing Group (CCG) is responsible for driving business strategy and product development for Intel's PC products and platforms, spanning form factors such as notebooks, desktops, 2 in 1s, all in ones. Working with our partners across the industry, we intend to deliver purposeful computing experiences that unlock people's potential - allowing each person use our products to focus, create and connect in ways that matter most to them. As the largest business unit at Intel, CCG is investing more heavily in the PC, ramping its capabilities even more aggressively, and designing the PC experience even more deliberately, including delivering a predictable cadence of leadership products. As a result, we are able to fuel innovation across Intel, providing an important source of IP and scale, as well as help the company deliver on its purpose of enriching the lives of every person on earth. Posting Statement All qualified applicants will receive consideration for employment without regard to race, color, religion, religious creed, sex, national origin, ancestry, age, physical or mental disability, medical condition, genetic information, military and veteran status, marital status, pregnancy, gender, gender expression, gender identity, sexual orientation, or any other characteristic protected by local law, regulation, or ordinance. Benefits We offer a total compensation package that ranks among the best in the industry. It consists of competitive pay, stock, bonuses, as well as, benefit programs which include health, retirement, and vacation. Find more information about all of our Amazing Benefits here.It has come to our notice that some people have received fake job interview letters ostensibly issued by Intel, inviting them to attend interviews in Intel s offices for various positions and further requiring them to deposit money to be eligible for the interviews. We wish to bring to your notice that these letters are not issued by Intel or any of its authorized representatives. Hiring at Intel is based purely on merit and Intel does not ask or require candidates to deposit any money. We would urge people interested in working for Intel
1 - 20 of 0 jobs
* No exact matches found. Showing closest results insteadNo results found
Modify search criteria or create an alert to get relevant jobs as soon as they’re posted