ML Infrastructure Engineer Jobs in Bengaluru
1067 Jobs Found
Staff Engineer - Software Development
Aviatrix Systems
Staff Engineer - Software Development (Cloud AI & Network Security) Location: Bengaluru Company: Aviatrix Experience Required: 7+ Years About Aviatrix: Aviatrix is a global leader in cloud network security, trusted by over 500 enterprises. We provide a specialized platform for securing multi-cloud environments, giving organizations the control and visibility needed to modernize their cloud strategies. Architectural Focus & Impact As a Staff Engineer, you will architect and deliver advanced AI-driven network security solutions. This role bridges the gap between Distributed Systems (Python/Go), Real-time Telemetry, and LLM-integrated automation to build self-learning, adaptive security infrastructures. Technical Expertise Core Software Engineering: Languages: Deep proficiency in Python and Go (Golang). Distributed Systems: Mastery of Kubernetes, Microservices, and high-scale observability (Prometheus, ELK). Data Pipelines: Experience with real-time stream processing using Kafka, Flink, Kinesis, or Pub/Sub. Networking & Security Domain: Cloud Infrastructure: Expert knowledge of VPC/VNet design, Routing, Load Balancers, and Overlays. Firewall Technologies: Hands-on with Deep Packet Inspection (DPI), NGFW/IDS/IPS, and Cloud-native firewalls (AWS, Azure, GCP). Security Frameworks: Alignment with Zero Trust, NIST CSF, and CIS Benchmarks. AI & Machine Learning Integration: Model Serving: Experience serving ML models via REST or gRPC. Generative AI: Familiarity with LLM integration, RAG (Retrieval-Augmented Generation), LangChain, and vector databases. Key Responsibilities System Architecture: Lead the design of cloud-native microservices for security control planes. AI-Driven Features: Integrate LLMs for Natural Language-to-Firewall Rule translation and automated incident summarization. Technical Leadership: Mentor junior engineers and set high standards through rigorous Design and Code Reviews. Cross-Functional Collaboration: Partner with Data Scientists and Cloud Networking teams to deliver production-grade AI features. Benefits & Why Join Us Regional Package: Comprehensive pension, private medical coverage, and life assurance. Wellbeing: Annual wellbeing stipend and generous holiday allowance. Growth Culture: We value unique career paths and prioritize candidates who are passionate about the intersection of AI and Security.
Data Engineering Lead
Fampay
Data Engineering Lead Bengaluru | Engineering | Full-Time About Fam (formerly FamPay) Fam is India s first payments app designed for everyone aged 11 and above. FamApp enables seamless online and offline payments through UPI and FamCard. Our mission is to empower over **250 million young Indians** to start their financial journey early, becoming financially aware and confident. Founded in 2019 by IIT Roorkee alumni, Fam is backed by top-tier investors including Elevation Capital, Y-Combinator, Peak XV (Sequoia Capital India), Venture Highway, and angels like Kunal Shah and Amrish Rao. About the Role We re looking for a visionary **Data Engineering Lead** to take **end-to-end ownership** of Fam s data ecosystem from data ingestion and storage to processing and delivering actionable insights. You ll **define the data strategy and architecture** that supports both batch and **real-time** use cases, ensuring scalability, reliability, and governance across the organization. You will be instrumental in enabling accurate, complete, and trusted data flow that powers business intelligence, analytics, and product decision-making. This role involves **leadership, strategic thinking**, and hands-on problem solving. What You ll Do Own the full data lifecycle: ingestion, organization, storage, processing, and presentation. Define and execute **data architecture and strategy** aligned with operational and analytical goals. Build **scalable, reliable, and observable data systems** supporting batch and near real-time processing. Ensure **data quality, governance, and compliance**, proactively resolving discrepancies. Collaborate with product, engineering, and business teams to define, track, and optimize key metrics. Anticipate data-related challenges and implement preventive solutions. Lead, mentor, and grow the data engineering team, fostering innovation and accountability. Must-Haves 10+ years experience in data engineering, including proven leadership of teams or projects. Expertise designing, building, and scaling end-to-end data pipelines and systems. Deep understanding of the data lifecycle, from ingestion through business reporting. Strong communication skills and ability to collaborate across technical and business teams. Solid knowledge of **data governance, quality assurance, and compliance standards**. Experience with observability and proactive monitoring for data systems. Proficiency in Python and SQL; familiarity with Scala or Java. Hands-on experience with streaming and batch data frameworks. Experience designing large-scale data lakes and warehouses with best practices for schema evolution and partitioning. Strong background with **cloud platforms (AWS, GCP, or Azure)**. Fintech or regulated industry experience is a plus. Good to Have Fintech-specific data experience, including regulatory compliance and reporting. Deployment experience with **real-time analytics** and event-driven architectures. Familiarity with containerization and infrastructure tools like Docker, Kubernetes, Terraform. Knowledge of data observability tools (Monte Carlo, Databand, etc.). Exposure to **ML pipelines** and model deployment. Solve challenging problems at the intersection of big data, real-time processing, and fintech. Lead impactful data initiatives at a rapidly growing startup. Collaborate with a world-class team of engineers, data scientists, and product leaders. Competitive compensation, equity, and benefits. Clear career growth opportunities in leadership and innovation. Perks That Go Beyond the Paycheck Relocation assistance for a smooth move. Free office meals (lunch & dinner). Generous leave policies (birthday, period, parental support, and more). Salary advances and loan policies for financial support. Quarterly rewards, recognition, and referral incentives. Access to the latest gadgets and tools. Comprehensive health insurance with mental health support. Tax benefits like food coupons, phone allowances, and leasing options. Retirement benefits including PF contribution, leave encashment, and gratuity. About FamApp FamApp focuses on financial inclusion for the next generation by offering UPI and card payments to users aged 11+. Our flagship product, FamX, integrates UPI and card payments seamlessly, helping users manage, save, and learn about their finances effortlessly. With over **10 million users**, FamApp is revolutionizing how young Indians transact eliminating the need to carry cash and offering customizable FamX cards with personal doodles for a fun, unique payment experience. Join Our Dynamic Team At Fam, we foster a people-first culture with flexible work schedules, generous leave, comprehensive health benefits, and mental health support. You ll be part of a passionate, talented, and fun team shaping the future of fintech for India s youth.
Lead Platform Engineer
Team Vunet Systems
Lead Platform Engineer Observability Solutions Location: Bengaluru Experience: 6 10 Years Function: Observability Engineering | Platform Architecture | SRE Enablement Join VuNet Redefining Digital Observability at Scale VuNet is transforming the future of digital experiences through Business Journey Observability, combining Big Data and AI/ML to empower real-time visibility across payments, banking, and financial services. Monitoring 28+ billion transactions/month, our platform is trusted by top financial institutions and powers over 300 million users. Backed by Series B funding and recognized by Gartner, NASSCOM, and Forbes, we are leading the charge in building a new category of observability, proudly Made in India for global impact. Your Role: Lead Platform Engineer As the Lead Platform Engineer, you will architect and drive the development of packaged observability solutions across 100+ infrastructure and application technologies. You will define **golden signals**, build **data collection strategies**, and lead the standardization of alerts, dashboards, and RCA workflows for platforms like **Kubernetes, Oracle DB, and Tomcat**. This is a cross-functional leadership role that sits at the intersection of product, platform, DevOps, and SRE. You will **lead a team** and influence how observability is delivered, scaled, and adopted across complex environments. Key Responsibilities Observability Solution Development Design and lead the delivery of observability packages for databases, middleware, cloud-native, and legacy platforms. Define and implement data collection pipelines, including agents, APIs, logs, metrics, traces, and service discovery. Establish **golden signals, SLIs/SLOs**, and health KPIs for performance, availability, and anomaly detection. Dashboards, Alerts & RCA Develop standardized, reusable dashboards, alerts, reports, and troubleshooting playbooks. Automate **RCA workflows** to improve MTTR and reduce alert fatigue. Platform Enablement & Integration Work with engineering to enhance agent capabilities and support new data sources/formats. Guide implementation of platform features for better observability at scale. Team Leadership & Governance Lead and mentor a team of observability engineers and specialists. Define design patterns, reusable modules, and version-controlled libraries. Stakeholder Collaboration Partner with product managers, DevOps, SREs, and customer teams to gather requirements, align priorities, and validate use cases. Ensure deliverables are scalable, well-documented, and production-ready. What You Bring Must-Have Skills 6 10 years of experience in observability, platform engineering, or SRE roles. Hands-on with tools like Prometheus, Grafana, OpenTelemetry, ELK/EFK, Datadog, Splunk. Strong understanding of logs, metrics, traces, profiling, and collection strategies. Experience developing solutions for platforms like Kubernetes, Oracle, PostgreSQL, Tomcat, etc. Proficient in Python, Shell scripting, APIs, and automation tools (**Terraform**, etc.). Familiar with alert fatigue mitigation, anomaly detection, and RCA frameworks. Excellent communication, technical leadership, and documentation skills. Nice to Have Experience managing an observability marketplace or solution catalog. Contributions to open-source observability projects. Certifications in Kubernetes, Observability platforms, or cloud providers (AWS/GCP/Azure). Background in ITSM tools, CMDBs, or incident workflow automation. At VuNet, you ll help build a category-defining observability platform that s already transforming critical infrastructure for leading financial institutions. You ll work with passionate engineers, push technical boundaries, and grow in a high-trust, high-impact environment. What You ll Experience: Ownership of key observability initiatives impacting 300M+ users. Collaboration with SRE, DevOps, and product teams across real-time financial systems. Opportunity to experiment with and shape Gen AI, ML, and emerging telemetry trends. Perks & Benefits Health insurance for you, your parents, and dependents. 1:1 mental wellness support. Training programs, certifications, and career growth opportunities. Transparent, inclusive, and high-trust work culture. Access to cutting-edge technology and Gen AI-powered workspaces.
Distinguished Engineer - Machine Learning Engineering
Capital One
Distinguished Engineer Machine Learning Engineering Location: Bangalore Company: Capital One India About Us At Capital One India, we re redefining how technology powers financial services. Our teams work in a fast-paced, intellectually rigorous environment to tackle complex business challenges at scale. By harnessing the power of advanced analytics, data science, and machine learning, we create innovative, patentable solutions that transform customer experiences and drive the business forward. Team Overview: Machine Learning Experience (MLX) The MLX team leads Capital One s mission to build scalable, well-managed ML systems and platforms. We empower teams across the enterprise to develop, govern, and deploy machine learning models efficiently, securely, and at scale. From automated model governance to observability platforms, MLX enables end-to-end ML lifecycle management laying the foundation for AI-driven innovation across the organization. Role Overview We re looking for a Distinguished Engineer Machine Learning Engineering to join our MLX team. In this high-impact role, you'll architect and implement the platforms and tools that support model observability, automated governance, and ML model deployment at scale. This is an opportunity to drive enterprise-wide innovation and shape how ML is integrated into Capital One s core business systems. What You ll Do Design and build systems that capture and analyze large-scale model and feature metadata, including training metrics and runtime performance, to power model observability and governance automation. Partner with cross-functional teams including product managers, designers, and platform engineers to create scalable solutions that accelerate ML model lifecycle management. Lead efforts to enable automated governance decisions for ML models, ensuring compliance, auditability, and operational integrity. Architect and implement high-performance data pipelines that feed ML models with real-time and batch data. Contribute to the design and implementation of cloud-native ML systems using tools such as AWS, Kubernetes, and Terraform. Write clean, scalable, production-grade code in languages like Python, Go, or Java. Implement CI/CD pipelines, testing frameworks, and monitoring systems for ML applications. Drive the adoption of best practices in ML Ops, observability, and platform resilience. Basic Qualifications Master s Degree in Computer Science or related field. 15+ years of experience in software engineering or solution architecture. 10+ years building data-intensive, distributed computing systems. 10+ years programming in Python, Go, or Java. 8+ years of hands-on experience with industry-leading ML frameworks (e.g., Scikit-learn, TensorFlow, PyTorch, Dask, Spark). Preferred Qualifications PhD or Master's in Computer Science, Electrical Engineering, Mathematics, or related field. 5+ years of experience building, scaling, and optimizing production ML systems. Deep expertise in data preparation, feature engineering, and ML pipeline optimization. 10+ years writing performant, maintainable, and resilient production code. Strong experience deploying ML solutions on public cloud platforms (AWS, Azure, GCP). Expertise in distributed systems, file systems, or multi-node databases. Open-source contributor to ML tools or libraries. Published work in ML (papers, patents, blogs, etc.). 5+ years of experience in ML Ops (using MLflow, TFX, Kubeflow, etc.). Experience with LLMs and Generative AI applications (open-source or commercial models). Proven experience designing production-ready observability platforms for ML applications. Be at the forefront of building scalable, secure, and enterprise-grade ML platforms. Shape the future of AI and ML adoption in a top-tier financial institution. Collaborate with world-class engineers and data scientists. Solve real-world problems with high business impact. Thrive in a diverse, inclusive, and innovation-focused culture. Qualification : PhD or Master's in Computer Science, Electrical Engineering, Mathematics, or related field
Data Engineer
Capital One
Data Engineer Location: Bangalore Company: Capital One India About Capital One At Capital One, we're redefining how technology solves real-world financial challenges. As a technology-driven company, we bring together talented engineers, data scientists, and designers to innovate at scale and deliver meaningful impact to millions of customers. If you're passionate about building powerful data solutions, exploring cutting-edge technologies, and working in a collaborative, fast-paced environment this is the place for you. About the Role As a Data Engineer at Capital One, you ll join a team of innovators who design and build next-generation data platforms and pipelines that power real-time decision-making. You ll collaborate across disciplines engineering, product, machine learning, and cloud infrastructure to transform how we leverage data at scale. What You ll Do Collaborate across Agile teams to design, develop, test, and deploy data-driven solutions. Build and support scalable data pipelines using modern data engineering tools and cloud services. Work on real-time and batch data processing systems that integrate with distributed microservices and ML platforms. Use programming languages such as Python, Java, or Scala with SQL, NoSQL, and cloud data warehouses like Redshift or Snowflake. Contribute to code reviews, unit testing, and performance optimization to ensure high-quality data systems. Partner with product managers and platform teams to deliver robust, cloud-native data solutions that power business decisions. Stay ahead of tech trends, share knowledge, and mentor junior engineers. Basic Qualifications Bachelor s degree in Computer Science, Engineering, or a related field. 1.5+ years of hands-on experience in application or data engineering (excluding internships). At least 1 year of experience working with big data technologies. Preferred Qualifications 3+ years of application/data engineering experience using Python, Scala, Java, or SQL. 1+ year of experience with cloud platforms (AWS, Azure, or GCP). 2+ years of experience with distributed computing tools (Spark, Hadoop, Hive, EMR, Kafka, etc.). 1+ year working on real-time streaming applications. 1+ year of experience with NoSQL databases (MongoDB, Cassandra). 1+ year of experience with data warehousing (Redshift, Snowflake). 2+ years working with Linux/Unix systems and shell scripting. Familiarity with Agile methodologies and modern DevOps practices. Why Join Capital One Work on high-impact data solutions at one of the world s most innovative financial institutions. Be part of a collaborative tech culture that values experimentation and learning. Access to top-tier tools, mentorship, and career development opportunities. Competitive compensation and benefits in a mission-driven environment. Qualification : Bachelors degree in Computer Science, Engineering, or a related field
Principal Associate - Full Stack Engineering
Capital One
Principal Associate Full Stack Engineering (GenAI Observability) Location: Bangalore Company: Capital One India About Us At Capital One India, we re tackling some of the most complex problems in financial services using machine learning, advanced analytics, and cloud-first engineering. Our mission is to build cutting-edge, patentable solutions that transform customer experiences, enhance operational efficiency, and ensure robust risk and compliance standards. We re a team of makers, breakers, doers, and disruptors obsessed with turning data into real-world impact at scale. About the Team Machine Learning Experiences (MLX) The MLX team is pioneering the future of model governance, ML observability, and Generative AI infrastructure at Capital One. We re enabling teams to seamlessly deploy ML and GenAI models at scale, with full visibility into performance, health, compliance, and ethical usage. This is the platform powering the next generation of AI-driven financial products across the company. About the Role We re looking for a Principal Associate Full Stack Engineer to lead the development of observability platforms for Generative AI systems. You ll be part of a cross-functional team focused on governance automation, LLM monitoring, and intelligent diagnostics using telemetry data, metadata, and advanced analytics. You ll design systems to collect, analyze, and visualize performance data from our large-scale GenAI infrastructure, helping data scientists and engineers make faster, safer decisions. What You ll Do Lead architecture and development of observability tools and dashboards for monitoring GenAI models and platform health. Design and build core APIs and SDKs to instrument large language models (LLMs) and foundational models (training, fine-tuning, prompting stages). Integrate Generative AI to enable observability features like anomaly detection, predictive analytics, and copilot-assisted troubleshooting. Partner with platform, MLOps, and governance teams to ingest and analyze telemetry, metadata, and runtime metrics at scale. Drive development of tools to ensure compliance with AI ethics, data governance, and industry regulations. Collaborate with product, design, and research to turn complex requirements into scalable, cloud-native software solutions. Lead proof-of-concept initiatives to test and showcase how GenAI can improve platform observability and decision-making. Contribute to the open-source community and stay at the forefront of GenAI and ML infrastructure evolution. Basic Qualifications Bachelor s or Master s degree in Computer Science, Engineering, or related field 4+ years of experience building distributed, data-intensive systems using microservices architecture 4+ years of experience in backend development with Python, Go, or Java 4+ years of expertise with observability stacks (Prometheus, Grafana, ELK) and adapting them for AI systems Strong knowledge of OpenTelemetry, and experience building custom SDKs and APIs 5+ years of hands-on experience with Generative AI models, especially applied to observability, governance, or compliance 2+ years of experience with cloud platforms such as AWS, Azure, or GCP Preferred Qualifications 4+ years building and optimizing ML systems in production environments 3+ years of experience with MLOps tools like MLflow, Kubeflow, or commercial platforms Experience with GenAI frameworks and libraries like LangChain, Haystack, and vector databases (FAISS, Chroma, OpenSearch) Familiarity with emerging observability tools for LLMs such as Langfuse, Phoenix, Helicone, or OpenInference Contributor to open-source GenAI or ML infrastructure projects Author or co-author of published work in AI/ML observability, governance, or performance monitoring Experience with PyTorch, TensorFlow, Spark, or Dask Knowledge of NVIDIA GPU telemetry, CUDA programming, and performance optimization for AI workloads Understanding of AI ethics, data governance, and regulatory frameworks for machine learning systems Why Join Capital One India Work at the intersection of technology, AI, and compliance helping shape the future of responsible AI Join a team driving enterprise-wide adoption of Generative AI Collaborate with world-class engineers, data scientists, and product leaders Enjoy a high-performance culture that encourages innovation, learning, and mentorship Access to cutting-edge tools, open-source contributions, and cloud-native infrastructure Qualification : Bachelors or Masters degree in Computer Science, Engineering, or related field
ML Ops Engineer
Mpokket Financial Services Private Limited
Job Title: ML Ops Engineer Location: Bangalore Department: Data Science Employee Type: Full-time Experience Required: 3 5 years Position Overview We are seeking an experienced and motivated ML Ops Engineer to join our Data Science team. In this role, you will be responsible for deploying, monitoring, and maintaining machine learning models in production environments. You will work closely with data scientists, engineers, and product teams to ensure models are scalable, reliable, and aligned with business objectives. This role is ideal for professionals who are passionate about building robust ML pipelines and bringing machine learning solutions into real-world applications at scale. Key Responsibilities Deploy and manage machine learning models in production environments, ensuring scalability, reliability, and performance. Build and maintain MLOps pipelines using platforms like Databricks and MLflow. Monitor model performance, accuracy, and health; implement alerting and diagnostics as needed. Develop and maintain RESTful APIs using Python frameworks such as Flask or Django to serve ML models. Optimize data workflows and collaborate with engineering teams to improve model integration and performance. Design strategies for automated model retraining, deployment, and version control. Write clean, maintainable, and efficient code using Python, adhering to OOP principles and best practices. Write complex queries using SQL and work with NoSQL databases to support data pipelines and feature stores. Leverage Python libraries such as PySpark, Pandas, scikit-learn, SQLAlchemy, and Requests. Minimum Qualifications Bachelor s or Master s degree in Computer Science, Statistics, Econometrics, Operations Research, or a related technical field. 3 5 years of experience in building, deploying, and monitoring machine learning solutions in production. Must-Have Skills Experience with Databricks and MLflow for model training and deployment. Proven expertise in machine learning model deployment and monitoring in live environments. Strong programming skills in Python, with solid understanding of data structures, algorithms, and OOP concepts. Experience developing RESTful APIs using Flask or Django. Proficient in SQL and NoSQL database operations. Hands-on knowledge of libraries such as Pandas, PySpark, scikit-learn, SQLAlchemy, and Requests. Strong analytical, problem-solving, and debugging skills. Good-to-Have Skills Experience with Kafka streaming and batch processing. Familiarity with CI/CD pipelines and version control systems like Git. Understanding of Python multiprocessing, worker/queue systems, and asynchronous/event-driven programming. This is a unique opportunity to work at the intersection of machine learning and DevOps. You'll play a critical role in operationalizing AI models and making them a core part of our product offerings. If you enjoy building scalable systems and solving real-world ML engineering challenges, we d love to meet you. Qualification : Bachelors or Masters degree in Computer Science, Statistics, Econometrics, Operations Research, or a related technical field
Senior Analyst
Latentview Analytics
Role: Senior Analyst Machine Learning Performance & Testing Location: Bengaluru, Karnataka, India Experience: 3 5 Years Employment Type: Permanent, Full-Time About the Role We are seeking a skilled and detail-oriented Senior Analyst with strong experience in ML model performance testing, load testing, and end-to-end (E2E) automation. This role is focused on ensuring scalable, low-latency deployment of production-grade machine learning models. The ideal candidate will be proficient in evaluating model performance under varied workloads, building robust test frameworks, and enhancing system monitoring. Key Responsibilities Conduct load testing and performance benchmarking for machine learning models under varying requests per second (RPS) scenarios. Develop and automate end-to-end test cases to validate model readiness and support smooth rollouts. Monitor and improve model scalability, response time, and error rates across production environments. Collaborate with ML engineers, backend developers, and QA test teams to ensure seamless integration and testing workflows. Identify and address bottlenecks in model inference, helping improve performance for high-volume, low-latency applications. Set up alerting and observability pipelines for model health using industry-standard tools. Required Skills & Tools Performance Testing & Monitoring: ML Load Testing, Job Monitoring, Model Scalability Evaluation Platforms & Tools: Databricks, MLflow, Seldon, Kubeflow, Tecton, Jenkins Cloud Services: Experience with AWS and deploying/testing models in cloud environments Programming Languages: Proficiency in at least one of the following Python, Java, Scala Experience: Working with production-level ML models, especially involving high data volumes and real-time inference Strong communication skills and ability to work in cross-functional teams Preferred Qualifications Hands-on experience with CI/CD pipelines for ML systems Knowledge of A/B testing and canary deployments for ML models Experience building testing frameworks for ML infrastructure at scale Understanding of monitoring and alerting best practices in production ML systems Be at the forefront of ML operations and model performance optimization Collaborate with industry-leading engineers and contribute to cutting-edge AI deployments Gain deep exposure to real-time data systems, cloud platforms, and enterprise-scale ML testing Competitive compensation and an innovative, fast-paced work environment
Data Engineer
Colan Infotech
Data Engineer Experience: 5+ Years Location: Bangalore, Karnataka, India Job Type: Full Time Job Summary We are looking for a talented Data Engineer with over 5 years of experience and a strong foundation in machine learning development to join our team in Bangalore. The ideal candidate will have hands-on expertise in Python programming, machine learning basics, and computer vision techniques like custom object detection and OCR. Key Responsibilities Develop and maintain data pipelines supporting machine learning models and data analytics. Implement custom object detection algorithms and OCR solutions using computer vision techniques. Utilize Python ML libraries such as OpenCV, SciPy, NumPy, Matplotlib, Pandas, Scikit-learn, Keras, PyTorch, and TensorFlow. Collaborate with data scientists and software engineers to optimize data workflows and ML model deployment. Ensure data quality, integrity, and scalability within data infrastructure. Troubleshoot and improve existing machine learning systems and pipelines. Required Skills Minimum of 5 years of experience in data engineering or related roles. At least 2 years of hands-on experience as a Machine Learning developer. Strong programming skills in Python. Solid understanding of machine learning fundamentals. Practical experience with custom object detection and OCR applications. Proficiency in key Python libraries for machine learning and data processing. Ability to work collaboratively in cross-functional teams. Qualifications Any graduate degree from a recognized university. Work with cutting-edge machine learning technologies in a supportive, innovative environment in Bangalore. Grow your career by solving complex problems and building impactful data solutions with a passionate team. Qualification : Any graduate degree from a recognized university.
Software Engineer, Backend (AI Team)
Limechat
Job Title: Software Engineer, Backend (AI Team) Location: Bengaluru, India Company: LimeChat About LimeChat LimeChat is building the future of conversational commerce enabling brands to deliver human-level interactions at scale via WhatsApp and other messaging platforms. As a proud graduate of Y Combinator s Winter 2021 batch, we serve 300+ top-tier brands like HUL, ITC, Wow Skin Science, Piramal Health, and Snitch. Our mission is simple: use Generative AI to automate and personalize customer interactions in e-commerce and now expanding into BFSI, Health, and Retail sectors. If you're a backend engineer who thrives on impact, collaboration, and building innovative systems at scale, this is your opportunity to do work that truly matters. What You ll Do Architect and Develop Backend Systems: Design robust, scalable backend architectures for AI products that handle millions of conversations. Integrations and APIs: Build and maintain seamless, secure integrations with third-party platforms and internal services. Work with AI Products: Collaborate with ML engineers and product teams to connect AI models and agents to real-time customer journeys. Database Management: Design and optimize relational and NoSQL databases (e.g., PostgreSQL, MongoDB). Performance & Reliability: Identify bottlenecks and implement backend improvements to ensure high performance and reliability. Collaborate Cross-Functionally: Work closely with product, design, and frontend teams to ship features that delight users. Write Clean, Maintainable Code: Follow best practices in code quality, documentation, and testing. Participate in peer code reviews. You Should Have Must-Haves 2 4 years of backend experience in a high-growth tech/startup environment Proficiency in Python, Node.js, and frameworks like Django Strong command of SQL and NoSQL databases (e.g., PostgreSQL, MongoDB) Solid understanding of RESTful API design and best practices Experience with Git, code reviews, and agile development workflows Excellent debugging and code analysis skills, including performance optimization Nice-to-Haves Hands-on experience with Docker, Kubernetes, or other container orchestration tools Familiarity with CI/CD pipelines (GitLab CI, Jenkins, etc.) Experience with API load testing, monitoring, and observability tools Exposure to AI/ML pipelines and/or conversational AI systems Why You ll Love Working Here Massive Impact: Join a lean, fast-moving team where your work directly influences product and user experience. Innovation-First Culture: Work at the intersection of AI, automation, and customer experience. Smart Team: Collaborate with ex-founders, IITians, and top engineers. Fast-Growth Startup: Backed by leading VCs and part of Y Combinator, we re scaling globally. Ownership and Autonomy: You ll be trusted to take full ownership and drive initiatives end to end. Quotes We Live By It s okay to fail. It s not okay to not try. Do the right thing when others are not looking. Apply now and be part of the LimeChat revolution.
Engineering Leader - Machine Learning
Eightfold
Engineering Leader - Machine Learning Location: Bangalore, Karnataka, India Employment Type: Full-Time | Hybrid Work Model About Eightfold.ai At Eightfold.ai, we re revolutionizing the future of employment by leveraging artificial intelligence to connect individuals to the right career opportunities based on their skills, not just their network. Our AI-powered Talent Intelligence Platform is transforming how organizations plan, hire, develop, and retain a diverse workforce. With $410M+ in funding and a $2B+ valuation, we re growing rapidly, and if you're passionate about solving one of society's most critical challenges employment then Eightfold is the place for you. Led by visionary leaders like Ashutosh Garg (former Google Search and Personalization leader) and Varun Kacholia (former Facebook and Youtube leader), we are shaping the future of AI-powered talent management. About the AI Platform Team Our AI/ML team is the heart of Eightfold, pushing the boundaries of applied machine learning. We re working with massive datasets, solving complex problems, and building cutting-edge AI models that are reshaping how companies approach talent management. Join us if you re eager to work in a team where every day presents a new challenge and opportunity. What You ll Do As the Engineering Leader - Machine Learning, you ll be responsible for leading and mentoring a high-performing team of engineers, driving the success of AI-driven products at Eightfold. Your primary responsibilities will include: Team Leadership: Coach, mentor, and manage a talented team of engineers to foster a culture of innovation, collaboration, and high performance. ML Model Ownership: Lead the development and deployment of cutting-edge deep learning models across all Eightfold products, ensuring reliability, scalability, and high-quality performance. Platform Development: Help build high-performance, flexible infrastructure that supports a variety of deep learning techniques and modeling approaches. End-to-End ML Pipeline: Oversee the end-to-end process of building and deploying machine learning models, including creating robust data pipelines that can process unstructured data. ML Framework Implementation: Design and implement an intuitive ML development framework that ensures efficiency and ease of use for data scientists and engineers. Model Fairness: Work with our internal model fairness platform to ensure that we are providing equal opportunity for everyone through responsible ML practices. Cross-Team Collaboration: Work closely with product teams to apply deep learning techniques to solve complex business problems across various domains. What You Should Already Know To be successful in this role, you should have: Strong Foundation in ML & Deep Learning: Expertise in applying Natural Language Processing (NLP) and deep learning solutions to solve real-world problems. Experience with Language Models: Familiarity with advanced language models such as BERT, GPT-3, T5, and others. Academic Background: A BS, MS, or PhD in Computer Science, Data Science, Mathematics, or related fields. Proven ML Experience: Hands-on experience building and deploying machine learning models at scale, particularly in production environments. Programming Expertise: Strong knowledge of ML languages such as Python, C++, Java, R, Scala, and experience with scientific libraries like numpy, pandas, and frameworks like TensorFlow, PyTorch, scikit-learn, etc. Strong ML Theory Knowledge: In-depth understanding of ML theory and experience working with large-scale datasets, data ingestion, and processing systems. Experience with Distributed Systems: Familiarity with distributed systems, including REST APIs, microservices, and data processing frameworks. Metrics-Focused: A passion for building high-quality models that deliver results and metrics-driven outcomes. Nice to Have Real-Time Tech Problems: Experience with real-time processing or low-latency systems. Cloud Environments: Familiarity with cloud platforms like AWS, and experience using cloud-based ML tools. MLOps Tools & Pipelines: Experience with MLflow, Metaflow, or similar tools to streamline ML workflows and operations. Advanced Tech Stack: Familiarity with tools like Spark, MLlib, Databricks, Apache Airflow, etc. Impactful Work: Join a company dedicated to solving one of society's most pressing issues employment. Your work will have a direct impact on individuals' careers around the world. Innovation at Scale: Work with cutting-edge AI and ML technologies to shape the future of talent management. Competitive Compensation: Receive an attractive salary, equity, and comprehensive benefits package (including family medical, vision, and dental coverage). Collaborative Environment: Work in a culture that values transparency, ownership, and collaboration across teams. Hybrid Work Model: Enjoy a hybrid work environment, with flexibility for remote work and in-office collaboration at our Bangalore office. Growth Opportunities: Be part of a rapidly scaling company with vast opportunities for career development and leadership roles. Equal Opportunity Employer Eightfold.ai is an Equal Opportunity Employer. We do not discriminate based on race, color, religion, sex, sexual orientation, gender identity, national origin, age, or disability. If you are an experienced and visionary leader in Machine Learning, eager to make a lasting impact while solving one of society's most important challenges, we would love to hear from you. Qualification : BS or MS or PhD degree in Computer Science, Data Science or Mathematics
Backend Engineer
Sarvam
Backend Engineer Location: Bengaluru, Karnataka, India (On-Site) Department: Engineering Employment Type: Full-Time About Sarvam.ai At Sarvam.ai, we re on a mission to bring generative AI to Bharat. Headquartered in Bengaluru and founded by industry-leading AI experts, Sarvam.ai is pioneering India-first, cost-effective, and high-performance AI agents that enable enterprises to unlock new opportunities and deliver deeper customer value. Join us in shaping the future of AI for India and beyond. Role Overview We are seeking a Backend Engineer with strong Python skills to help build the core infrastructure powering our AI-driven voice and generative applications. You ll work at the intersection of high-performance backend systems and machine learning model orchestration, playing a pivotal role in bringing cutting-edge AI into production environments. Key Responsibilities Develop, maintain, and scale backend services and RESTful APIs using Python and FastAPI (or similar frameworks like Flask, Django). Design and implement Retrieval-Augmented Generation (RAG) systems for AI-powered applications. Build data pipelines and orchestrate workflows for ML/AI model deployment. Write clean, modular code following industry best practices including unit testing and code reviews. Collaborate cross-functionally with ML engineers, data scientists, and product teams to integrate AI models into production environments. Optimize database operations for structured and unstructured data using both SQL and NoSQL solutions. Implement CI/CD pipelines and work with version control systems like Git for smooth deployment cycles. Participate in system architecture discussions and contribute to scaling and performance enhancements. Must-Have Skills & Qualifications Education: Bachelor s degree in Computer Science, Engineering, or a related technical field. Programming: Strong proficiency in Python and sound knowledge of programming principles. Web Frameworks: Experience with FastAPI, Flask, or Django. Database Knowledge: Comfortable working with SQL and NoSQL databases. AI & ML Exposure: Understanding of ML model lifecycle, deployment practices, and basic machine learning concepts. RAG Systems: Exposure to Retrieval-Augmented Generation or AI search systems. Version Control: Hands-on experience with Git and Git-based workflows. Analytical Mindset: Strong problem-solving and debugging skills. Teamwork: Excellent communication and collaboration skills. Nice-to-Have (Preferred Experience) Projects: Demonstrated experience building backend applications via academic, freelance, or open-source work. Cloud Experience: Familiarity with AWS, GCP, or Azure services. DevOps Tools: Exposure to Linux, Docker, Kubernetes, and CI/CD pipelines. Open Source: Active GitHub profile or contributions to open-source projects. Work on real-world AI applications impacting millions across India. Be part of a high-caliber team of AI and product engineering experts. Get in early at a fast-growing generative AI startup redefining the future of enterprise AI in India. Opportunity to grow rapidly, take ownership, and drive innovation. Qualification : Bachelors degree in Computer Science, Engineering, or a related technical field.
Data Scientist Lead - L1
Wipro Limited
Data Scientist Lead - L1 Requisition ID: 64997 Location: Bengaluru, India Company: Wipro Limited (NYSE: WIT, BSE: 507685, NSE: WIPRO) Company Overview Wipro Limited is a leading technology services and consulting company focused on building innovative solutions that address clients most complex digital transformation needs. Leveraging our holistic portfolio of capabilities in consulting, design, engineering, and operations, we help clients realize their boldest ambitions and build future-ready, sustainable businesses. With over 230,000 employees and business partners across 65 countries, we deliver on the promise of helping our customers, colleagues, and communities thrive in an ever-changing world. Job Description Role Purpose: The purpose of this role is to define, architect, and lead the delivery of machine learning and AI solutions. Key Responsibilities: Demand generation through support in Solution development Support Go-To-Market strategy. Collaborate with sales, pre-sales & consulting teams to assist in creating solutions and propositions for proactive demand generation. Contribute to the development of solutions, proof of concepts aligned to key offerings to enable solution-led sales. Collaborate with different colleges and institutes for recruitment, joint research initiatives, and provide data science courses. Revenue generation through Building & operationalizing Machine Learning, Deep Learning solutions Develop Machine Learning / Deep Learning models for decision augmentation or for automation solutions. Collaborate with ML Engineers, Data Engineers, and IT to evaluate ML deployment options. Integrate model performance management tools into the current business infrastructure. Team Management Resourcing: Support recruitment process to onboard the right resources for the team. Talent Management: Support onboarding and training for team members to enhance capability & effectiveness. Manage team attrition. Performance Management: Conduct timely performance reviews and provide constructive feedback to direct reports. Be a role model to the team for the five habits. Ensure that the Performance Nxt process is followed for the entire team. Employee Satisfaction and Engagement: Lead and drive engagement initiatives for the team. Performance Parameters: No. Performance Parameter Measure 1. Demand generation Order booking 2. Revenue generation through delivery Timeliness, customer success stories, customer use cases 3. Capability Building & Team Management % trained on new skills, Team attrition % Mandatory Skills: AI Cognitive Experience: 5-8 Years About Wipro Wipro is building a modern digital transformation business with bold ambitions. Join a team that values reinvention of yourself, your career, and your skills. Wipro is a place that empowers you to design your own career reinvention, evolve, and grow. Applications from people with disabilities are explicitly welcome.
Ai Platform Architect
Adobe
AI Platform Architect Location: Bangalore, Karnataka, India Employment Type: Full-Time About Adobe Adobe is changing the world through digital experiences. Whether you're an emerging artist or a global brand, our tools empower creativity and innovation across every screen. From powerful imaging and video solutions to immersive web and app design, Adobe s mission is to help people and businesses deliver exceptional digital experiences. We are committed to creating an inclusive workplace where everyone is respected and given equal opportunity. Innovation can come from anywhere and the next big idea could be yours. Job Description We are looking for a visionary AI Platform Architect with deep expertise in building and scaling cloud-native, AI-powered platforms. The ideal candidate will have experience deploying large-scale, customer-facing AI solutions and a deep understanding of modern cloud architecture, data systems, MLOps, and LLMOps. Responsibilities Design and develop scalable AI/ML platforms and pipelines across AWS, Azure, and GCP. Architect end-to-end LLM pipelines including model training, fine-tuning, serving, inference APIs, and monitoring. Lead cross-functional teams in delivering AI solutions from experimentation to production. Implement MLOps and LLMOps best practices using tools like MLFlow, SageMaker, Langchain, and LangGraph. Design GPU-optimized architectures for training and inference of LLMs using DeepSpeed, vLLM, and other modern frameworks. Support infrastructure automation and container orchestration with Kubernetes, Docker, and CI/CD pipelines. Collaborate with internal stakeholders and clients to understand requirements, evangelize platform solutions, and ensure successful delivery. Key Skills and Expertise Cloud and DevOps: Expertise in AWS, Azure, GCP especially VPC design, cloud databases, and serverless architecture. Certified in AWS Professional Solution Architect, AWS ML Specialty, or Azure Solutions Architect Expert (preferred). Proficient with Kubernetes, Docker, FluentD, Kibana, Grafana, Prometheus. Data and Streaming: Experience with OLTP/OLAP databases and cloud-native data warehouses like BigQuery, Aurora, Spanner. Hands-on with Kafka, Apache Flink, Spark, Airflow, Databricks, Apache Iceberg, Presto. AI/ML & LLM Expertise: In-depth understanding of LLMs (GPT, Gemini, Claude, Mixtral, Llama, Hugging Face OSS models). LLMOps frameworks: Langchain, Langgraph, Langflow, Flowise, LLamaIndex. ML lifecycle tools: MLFlow, SageMaker, Vertex AI, Azure AI, AWS Bedrock. Proven experience in model optimization, fine-tuning, and high-throughput inference systems. Programming Languages: Proficient in Python, SQL, and JavaScript. Preferred Qualifications 10+ years in cloud and AI/ML platform architecture roles. Experience delivering AI solutions for enterprise-scale clients. Hands-on experience with GPU architecture and parallel/distributed training. Strong communication skills with ability to influence technical and business stakeholders. Work on cutting-edge AI technologies and shape future product experiences used by millions. Collaborate with world-class engineers and scientists in a diverse, inclusive culture. Be part of a company that values creativity, innovation, and employee well-being. Adobe is proud to be an Equal Opportunity Employer. We welcome and encourage candidates from all backgrounds to apply.
Machine Learning Engineer 5
Adobe
Machine Learning Engineer 5 Location: Bangalore, Karnataka, India Employment Type: Full-Time About Adobe Adobe is transforming the world through digital experiences. From individual creators to global enterprises, our tools empower everyone to create stunning visuals, videos, applications, and more. We believe in fostering an inclusive and innovative workplace where ideas can come from anyone, anywhere and the next big breakthrough could be yours. The Opportunity Adobe Express enables users from beginners to professionals to design eye-catching content with ease, powered by advanced AI technologies. At the heart of this mission is the AI Foundations team, which is building modular, reusable AI systems and the Horizon AI Stack to supercharge the Adobe Express experience. As a founding member of this elite team, you will tackle strategic AI challenges and deliver scalable, high-impact solutions. What You ll Do Drive the vision and development of reusable, cloud-scale AI systems for Adobe Express. Collaborate with Adobe Research, Applied AI teams, and engineering stakeholders to align AI capabilities with product strategy. Lead all aspects of ML product development: data pipelines, model development, and quality evaluations. Recruit, mentor, and manage a high-performing team of ML engineers and data scientists. Balance cutting-edge research with business-critical deliverables prototyping new ideas, validating, and scaling them. Foster a culture of innovation, experimentation, and scientific rigor focused on solving customer problems. Ensure engineering excellence in architecture, development, and on-time delivery of AI solutions. Basic Qualifications Bachelor s, Master s, or Ph.D. in Computer Science, Machine Learning, Applied Math, Data Science, or related field. 8+ years of industry experience in machine learning, deep learning, or optimization algorithms. 3+ years of experience managing ML/AI teams in a product-focused environment. Preferred Qualifications Experience working in creative tools, design platforms, or imaging domains. Familiarity with LLMs, Diffusion Models, and generative AI techniques. Proven experience in building large-scale, production-grade ML systems. What You Need to Succeed Demonstrated success in building and managing top-tier ML teams that deliver high-quality solutions. Experience shipping widely adopted AI-driven features to large customer bases. Customer-obsessed with a strong product sense and ability to translate user needs into AI capabilities. Outstanding communication skills able to articulate technical ideas to both technical and non-technical audiences. Strong problem-solving, leadership, and strategic decision-making abilities. Work at the intersection of creativity and AI building technologies that empower millions of users globally. Collaborate with world-class researchers and engineers in a mission-driven, inclusive environment. Enjoy robust career growth, leadership opportunities, and excellent employee benefits. Adobe is proud to be an Equal Opportunity Employer. We believe diversity fuels innovation and we re committed to building an inclusive environment for all employees. Qualification : Bachelors, Masters, or Ph.D. in Computer Science, Machine Learning, Applied Math, Data Science, or related field.
Software Engineer III - AI/ML, Platforms and Devices
Google Careers
Software Engineer III - AI/ML, Platforms and Devices Company: Google Location: Bengaluru, Karnataka, India Minimum Qualifications: Bachelor s degree or equivalent practical experience. 2 years of experience in software development with one or more programming languages, or 1 year with an advanced degree. 2 years of experience in data structures or algorithms. 1 year of experience in one or more of the following: Speech/audio (e.g., technology duplicating and responding to the human voice), reinforcement learning (e.g., sequential decision making), ML infrastructure, or specialization in another ML field. 1 year of experience with ML infrastructure (e.g., model deployment, model evaluation, optimization, data processing, debugging). Preferred Qualifications: Master's degree or PhD in Computer Science or a related technical field. Experience developing accessible technologies. About the Job Google's software engineers work on cutting-edge technologies that transform how billions of users connect, explore, and interact with information. Our products must handle data at a massive scale, far beyond web search. We seek engineers who bring innovative ideas from various fields, including information retrieval, distributed computing, large-scale system design, networking, data storage, security, artificial intelligence (AI), natural language processing (NLP), UI design, and mobile development. As a Software Engineer, you will work on training and optimizing complex machine learning (ML) models for the Tensor Processing Unit (TPU). By enabling models across diverse applications like camera, speech, Translate, TTS (Text-to-Speech), and others on Edge TPU, you will gain valuable experience in efficient model architectures, optimization techniques, and on-device machine learning at Google. You will also be responsible for managing project priorities, deadlines, and deliverables. Google's mission is to organize the world s information and make it universally accessible and useful. Our Devices & Services team combines the best of Google AI, software, and hardware to create radically helpful experiences for users. We design and develop new technologies and hardware to make user interactions faster, more seamless, and powerful. Whether advancing form factors, improving interaction methods, or innovating new ways to capture and sense the world around us, our Devices & Services team is helping make people's lives better through technology. Responsibilities Write product or system development code. Collaborate with peers and stakeholders through design and code reviews to ensure best practices (e.g., style guidelines, accuracy, testability, and efficiency). Contribute to documentation or educational content and adapt based on product updates and user feedback. Triage product or system issues, debug, track, and resolve by analyzing the source of issues and their impact on hardware, network, or service operations. Implement solutions in one or more specialized Machine Learning (ML) areas, utilize ML infrastructure, and contribute to model optimization and data processing. Qualification : Master's degree or PhD in Computer Science or a related technical field.
Software Engineer Iii, Machine Learning
Google Careers
Software Engineer III - Machine Learning, Google One Company: Google Location: Bengaluru, Karnataka, India Minimum Qualifications: Bachelor s degree or equivalent practical experience. 2 years of experience in software development with one or more programming languages, or 1 year with an advanced degree. 2 years of experience with data structures or algorithms in either an academic or industry setting. Preferred Qualifications: Experience with Data Analytics, Privacy Data Handling, Privacy Design Document, and Mathematical Optimization. Understanding of Machine Learning (ML). Excellent software engineering and problem-solving skills through programming. Excellent communications skills. About the Job Google software engineers create next-generation technologies that transform how billions of users connect, explore, and interact with information. Our products must scale to handle vast amounts of data, and our work extends far beyond web search. We are looking for engineers with fresh ideas from a variety of areas including information retrieval, distributed computing, large-scale system design, networking, data storage, security, artificial intelligence (AI), natural language processing (NLP), UI design, and mobile technologies. As a Software Engineer at Google, you will work on projects that are critical to Google s needs, with opportunities to switch teams and projects as our fast-paced business grows. We need engineers who are versatile, show leadership qualities, and are enthusiastic about taking on new problems across the full-stack to push technology forward. Google One (G1) is Google s membership service rooted in storage, offering access to premium features. Our team works across multiple product areas to personalize user experiences through Machine Learning. The Platforms and Ecosystems product area encompasses Google's various computing software platforms across environments such as desktop, mobile, and applications. These products provide enterprises, and ultimately end users, the ability to utilize and manage services at scale. We build innovative software products from apps to TVs, from laptops to phones that impact people s lives globally. Responsibilities Write product or system development code. Collaborate with peers and stakeholders through design and code reviews to ensure best practices (e.g., style guidelines, accuracy, testability, and efficiency). Contribute to documentation or educational content and adapt based on product updates and user feedback. Triage product or system issues, debug, track, and resolve them by analyzing the source of issues and their impact on hardware, network, or service operations. Implement solutions in one or more specialized Machine Learning (ML) areas, utilize ML infrastructure, and contribute to model optimization and data processing.
Senior Machine Learning Engineer
Chevron Corporation
Senior Machine Learning Engineer Location: Bengaluru, India Company: Chevron Experience Required: 5 10 Years Department: AI/ML Engineering Work Mode: Hybrid / Global Operations Support About the Role Chevron is actively seeking a Senior Machine Learning Engineer to join our cutting-edge AI team. You will be responsible for designing, building, and optimizing advanced machine learning systems that power transformative applications in artificial intelligence. In this role, you will develop self-learning applications and refine AI systems using robust engineering, statistics, and software design practices. Key Responsibilities Study and transform data science prototypes into production-ready systems Design scalable and robust machine learning systems Research and implement modern ML algorithms and tools Develop ML applications based on business and technical requirements Select appropriate datasets, data pipelines, and data representation techniques Run experiments and evaluate results to fine-tune models Perform statistical analysis and ML performance optimization Train and retrain models as new data becomes available Extend and customize existing ML libraries and frameworks Stay up to date with the latest ML research and trends Required Qualifications 5 10 years of proven experience as a Machine Learning Engineer or in a similar role Hands-on experience with Azure Machine Learning and MLOps Strong skills in data structures, data modeling, and software architecture Deep knowledge of math, probability, statistics, and algorithm design Proficient in programming languages: Python, R Familiarity with ML frameworks and libraries such as Keras, PyTorch, scikit-learn Excellent analytical and problem-solving skills Strong communication and teamwork abilities Working Hours Chevron supports international teams. Work hours are scheduled to align with global collaboration: Work Days: Monday to Friday Shift Options: 8:00 AM 5:00 PM or 1:30 PM 10:30 PM IST Opportunity to work on impactful ML/AI solutions at enterprise scale Flexible work culture with global exposure Advanced tools, infrastructure, and data at your fingertips Professional growth in a forward-thinking, innovation-driven environment Equal Opportunity Statement Chevron is an equal opportunity employer and adheres to inclusive hiring practices. All qualified candidates will receive consideration without regard to race, gender, age, religion, nationality, sexual orientation, disability, or any other protected status. Chevron also participates in E-Verify as required by law in applicable jurisdictions. Apply Today
Research Engineer
International Business Machines
Research Engineer Location: Bangalore, Karnataka, India Job Type: Full-Time Experience Level: 0-8 years Company: IBM Research India (IRL) Introduction: IBM Research is the innovation engine of IBM and is the largest industrial research organization in the world. With 12 labs across 6 continents and over 3200 researchers globally, we produce more patents daily than any other organization. At IBM Research India (IRL), we are shaping the future of computing in areas like AI, Hybrid Cloud, and Quantum Computing. Our work is at the forefront of breakthrough innovations in Foundation Models, AI systems, large-scale data engineering, and more. We are looking for top talent to join us in our exciting and dynamic projects, pushing the boundaries of innovation. As a Research Engineer, you will work on pioneering research and development in the most cutting-edge fields of AI and computing. Role Overview: The Research Engineer role at IBM India Research Lab (IRL) involves working on challenging, dynamic, and highly innovative projects in the fields of AI, machine learning, and data systems. Your responsibilities will span multiple areas including optimizing AI models for large-scale distributed systems, pre-training foundation models, and developing real-world use cases that leverage IBM s infrastructure and models. Key Responsibilities: Optimized Runtime Stacks for Foundation Models: Work on fine-tuning, inference serving, and large-scale data engineering for AI models. Focus on multi-stage tuning, reinforcement learning, inference-time compute, and preparing data for complex AI systems. Model Optimization Across Accelerators: Develop solutions to optimize models for multi-accelerator environments, particularly focusing on IBM s AIU accelerator. Work on compiler optimizations, specialized kernels, libraries, and tools to enhance model performance. Pre-training and Deployment of Foundation Models: Participate in pre-training language models and multi-modal foundation models. Work on distributed training procedures, model alignment, and creating pipelines for various tasks, including LLM-generated data pipelines. Research and Use Case Development: Develop and implement use cases that effectively leverage infrastructure and models to drive real-world value. Contribute to creating frameworks for human-data collection and deploying models on user-centric platforms. Required Education and Experience: Education: A Master s degree in Computer Science, AI, or related fields from a top institution. Experience: 0-8 years of experience working with modern ML techniques, including but not limited to model architectures, data processing, fine-tuning techniques, reinforcement learning, distributed training, and inference optimizations. Technical Skills: Experience with big data platforms such as Ray and Spark. Experience with Pytorch FSDP and HuggingFace libraries. Proficiency in programming with Python or web development technologies. Mindset and Attitude: A growth mindset and pragmatic approach to problem-solving. Preferred Experience: Research Experience: Peer-reviewed research at top machine learning or systems conferences. Advanced Technical Skills: Experience working with pytorch.compile, CUDA, Triton kernels, GPU scheduling, and memory management. Open Source Contributions: Experience working within open-source communities, contributing to or developing open-source projects. Innovative Environment: Be at the forefront of technological innovation, working on cutting-edge projects in AI, quantum computing, and more. Global Impact: Work on projects that influence both academic research and commercial product development, making a global impact. Career Development: IBM offers abundant opportunities for learning and growth, with access to the latest technologies and research. Collaborative Culture: Work with a diverse team of world-class researchers and engineers in a collaborative, open-source-driven environment. Apply today and become a part of the team that s redefining innovation. Qualification : A Masters degree in Computer Science, AI, or related fields from a top institution.
Manager, Software Engineering, Ais Coe
Bain & Company
What Makes Us a Great Place to Work We are proud to be consistently recognized as one of the world's best places to work, a champion of diversity, and a model of social responsibility. We are currently ranked #1 on Glassdoor's Best Places to Work list, and we have maintained a spot in the top four for the last 13 years. We believe that diversity, inclusion, and collaboration are key to building extraordinary teams. We hire people with exceptional talents, abilities, and potential, then create an environment where you can become the best version of yourself and thrive both professionally and personally. We are publicly recognized by external parties such as Fortune, Vault, Mogul, Working Mother, Glassdoor, and the Human Rights Campaign for being a great place to work for diversity and inclusion, women, LGBTQ, and parents. Who You ll Work With Vector is Bain s integrated digital and analytics capability, bringing together Enterprise Technology and AI, Insights & Solutions (AIS) to deliver cutting-edge innovation. AIS, formed through the merger of Bain's Advanced Analytics and Innovation & Design teams, is a diverse group of experts in analytics, engineering, product management, and design. Together, we create human-centric solutions that leverage the power of data and artificial intelligence to drive competitive advantage for our clients. As part of AIS, the India-based Center of Excellence (COE) specializes in delivering Generative AI (Gen AI) and advanced engineering solutions for our clients globally. The COE will also lead the charge on building solution accelerators for our clients. What You ll Do As a Manager, AIS Software Engineering, you will lead the development and application of technical solutions to address complex problems in various industries. You will guide a diverse engineering team through the entire engineering lifecycle while collaborating with remote and distributed teams, including both technical and consulting teams across the globe. Your responsibilities will include designing, developing, optimizing, and deploying cutting-edge software engineering solutions and infrastructure at the production scale required by the world s largest companies. Collaboration: Work closely with colleagues in the AIS COE (e.g., data scientists, ML engineers, software engineers, platform engineers) to build software solutions that solve clients business problems. Technical Leadership: Serve as the overall technical leader, providing deep expertise in software engineering, distributed systems, AI, and application architecture design. Lead the development and delivery of end-to-end solutions for client cases. Development Lifecycle: Lead key parts of the software development life cycle, including architecture design, writing clean code, conducting code reviews, writing documentation, and identifying issues and resolutions. Framework Development: Collaborate on (or lead) the development of re-usable common frameworks, models, and components to address common software engineering problems across industries and business functions. Best Practices & Innovation: Drive best practices in software engineering and share learnings with team members in AIS. Promote industry-leading innovations to create significant client impact. Team Management: Contribute to creating a great working environment that attracts talented engineers. Act as a PD Advisor as needed and support recruiting and onboarding new team members. Travel Requirements: Minimum travel, based on project needs and training opportunities. About You Education & Experience: Bachelors/Master s degree in Computer Science, Engineering, or a related technical field. 8+ years of hands-on experience in web development, programming languages, version control, software design patterns, infrastructure, deployment, integration, and unit testing. 3+ years of experience managing software engineers. A proven track record of leading a team and collaborating on strategic initiatives, with experience in shipping production applications and software analytics products. Technical Expertise: Expert knowledge (5+ years) of Python. Deep experience with server-side frameworks and technologies such as FastAPI, Node.js, Flask. Familiarity with cloud platforms like AWS, Azure, or GCP. Strong understanding of software architecture, DB design, scalability, and SQL, with experience in RDBMS (e.g., MySQL, PostgreSQL) and NoSQL (e.g., MongoDB, Cassandra, Elasticsearch). Experience with client-side technologies such as React, Vue.js, HTML, and CSS. Familiarity with DevSecOps principles, CI/CD tools, MLOps, LLMOps, and infrastructure as code (Jenkins, Docker, Kubernetes, Terraform). Knowledge of software security, privacy regulations, and cybersecurity. Soft Skills: Strong interpersonal and communication skills, including the ability to explain technical concepts to colleagues and clients from other disciplines. Curiosity, proactivity, and critical thinking. Ability to collaborate effectively with people at all levels and with multi-office/region teams. Ability to work independently, manage multiple priorities, and thrive in a fast-paced and ambiguous environment. About Us Bain & Company is a global consultancy that helps the world s most ambitious change makers define the future. Across 65 cities in 40 countries, we work alongside our clients as one team with a shared ambition to achieve extraordinary results, outperform the competition, and redefine industries. We complement our tailored, integrated expertise with a vibrant ecosystem of digital innovators to deliver better, faster, and more enduring outcomes. Our 10-year commitment to invest more than $1 billion in pro bono services brings our talent, expertise, and insight to organizations tackling today s urgent challenges in education, racial equity, social justice, economic development, and the environment. We earned a platinum rating from EcoVadis, the leading platform for environmental, social, and ethical performa...
1 - 20 of 0 jobs
* No exact matches found. Showing closest results insteadNo results found
Modify search criteria or create an alert to get relevant jobs as soon as they’re posted