Azure AI Engineer Jobs in Bengaluru
1130 Jobs Found
Staff Engineer - Software Development
Aviatrix Systems
Staff Engineer - Software Development (Cloud AI & Network Security) Location: Bengaluru Company: Aviatrix Experience Required: 7+ Years About Aviatrix: Aviatrix is a global leader in cloud network security, trusted by over 500 enterprises. We provide a specialized platform for securing multi-cloud environments, giving organizations the control and visibility needed to modernize their cloud strategies. Architectural Focus & Impact As a Staff Engineer, you will architect and deliver advanced AI-driven network security solutions. This role bridges the gap between Distributed Systems (Python/Go), Real-time Telemetry, and LLM-integrated automation to build self-learning, adaptive security infrastructures. Technical Expertise Core Software Engineering: Languages: Deep proficiency in Python and Go (Golang). Distributed Systems: Mastery of Kubernetes, Microservices, and high-scale observability (Prometheus, ELK). Data Pipelines: Experience with real-time stream processing using Kafka, Flink, Kinesis, or Pub/Sub. Networking & Security Domain: Cloud Infrastructure: Expert knowledge of VPC/VNet design, Routing, Load Balancers, and Overlays. Firewall Technologies: Hands-on with Deep Packet Inspection (DPI), NGFW/IDS/IPS, and Cloud-native firewalls (AWS, Azure, GCP). Security Frameworks: Alignment with Zero Trust, NIST CSF, and CIS Benchmarks. AI & Machine Learning Integration: Model Serving: Experience serving ML models via REST or gRPC. Generative AI: Familiarity with LLM integration, RAG (Retrieval-Augmented Generation), LangChain, and vector databases. Key Responsibilities System Architecture: Lead the design of cloud-native microservices for security control planes. AI-Driven Features: Integrate LLMs for Natural Language-to-Firewall Rule translation and automated incident summarization. Technical Leadership: Mentor junior engineers and set high standards through rigorous Design and Code Reviews. Cross-Functional Collaboration: Partner with Data Scientists and Cloud Networking teams to deliver production-grade AI features. Benefits & Why Join Us Regional Package: Comprehensive pension, private medical coverage, and life assurance. Wellbeing: Annual wellbeing stipend and generous holiday allowance. Growth Culture: We value unique career paths and prioritize candidates who are passionate about the intersection of AI and Security.
Manager, Enterprise Applications
Aviatrix Systems
Manager, Enterprise Applications Location: Bengaluru Team: Tech Ops Experience Required: 8+ Years (including 2+ years in Leadership) About Aviatrix: Aviatrix is a leading cloud network security company trusted by over 500 enterprises. Our Cloud Native Security Fabric (CNSF) delivers runtime security and visibility, enabling teams to adopt AI and serverless computing with confidence. The Role: Technical Manager & Architect We are seeking a Manager, Enterprise Applications for a hybrid role that balances hands-on engineering with strategic leadership. You will drive the design of secure, scalable applications while fostering a culture of accountability and excellence within our Bengaluru development team. Technical Expertise Core Technology Stack: Serverless Mastery: Expert-level proficiency with AWS Lambda and AWS Step Functions. Full-Stack Development: Professional experience in Node.js, Python, and React. Data Management: Strong knowledge of Postgres and cloud platforms (AWS, Azure, GCP). Advanced Engineering Skills: AI & LLM Integration: Experience building LLM-powered applications and leveraging AI tools for code generation and architecture exploration. Security & Authentication: Deep understanding of SSO integrations, API design, and distributed systems security. Infrastructure: Familiarity with DevOps, CI/CD, and automated testing frameworks. Key Responsibilities Technical Leadership: System Design: Lead the delivery of complex enterprise applications, making critical strategic trade-offs and proposals. Operational Efficiency: Continuously improve application reliability, performance, and security. Cross-Functional Representation: Act as the primary technical liaison for stakeholders and business partners. People & Team Management: Team Mentorship: Conduct 1:1s, performance reviews, and career development discussions for junior engineers. Execution & Planning: Drive team performance through strategic planning, tracking, and alignment with organizational priorities. Growth: Lead technical hiring to expand the Enterprise Applications team. Benefits & Why Aviatrix Total Rewards: Comprehensive pension, private medical, life assurance, and long-term disability. Wellbeing: Annual wellbeing stipend and generous holiday allowance. Inclusive Environment: We value unique paths. If you are passionate about leading engineers and solving cloud-scale challenges, we want to hear from you.
Erp Engineer
Dozee
ERP Engineer Location: Bengaluru Department: Hardware Employment Type: Full-Time About Dozee Dozee Health AI is a pioneer in AI-powered, contactless Remote Patient Monitoring (RPM) and Early Warning Systems (EWS), driving transformation in healthcare at scale. Headquartered in Bengaluru, Dozee has quickly emerged as India s #1 RPM company. Trusted by leading healthcare providers across India, the USA, and Africa, Dozee s innovative solutions enhance patient safety, improve outcomes, and reduce costs. We aim to Save a Million Lives with Health AI. Role Overview We are looking for an ERP Engineer to manage and optimize Microsoft Dynamics ERP modules, ensuring seamless performance and data integrity across our systems. You will work alongside technical and business teams to implement customizations and integrations that enhance our operational efficiency and scalability. Key Responsibilities ERP Management & Customization Manage and maintain Microsoft Dynamics ERP modules to ensure system stability and optimal performance. Develop and deploy customizations, workflows, and process automations that align with business needs. Integrate ERP with internal systems (CRM, HRMS, Finance) and third-party applications using APIs and data connectors. Collaboration & Implementation Collaborate with cross-functional teams to analyze business requirements and design ERP solutions. Conduct testing, document new features, and support system enhancements. Provide troubleshooting and technical support to ensure minimal system downtime. Training & Optimization Lead end-user training and create user manuals for smooth ERP adoption. Monitor system performance, suggest improvements, and optimize ERP configurations. Work with stakeholders to enhance ERP processes in finance, procurement, inventory, and production. Requirements Experience & Qualifications 3 5 years of hands-on experience with ERP management, customization, and integration. Specific experience with Microsoft Dynamics 365 Business Central / Dynamics NAV is preferred. Bachelor s or Master s degree in a relevant field. Skills Strong understanding of ERP architecture, data structures, and business workflows. Experience with integration tools and middleware such as Azure Integration Services or Power Automate. Familiarity with project management tools like JIRA, Confluence, or Asana. Personal Attributes Strong analytical and problem-solving abilities with high attention to detail. Excellent communication skills and the ability to collaborate with multiple stakeholders. Why Join Dozee Be part of a mission-driven company transforming healthcare with AI. Work with top healthcare providers and cutting-edge technology. Opportunity to drive impactful change in a high-growth, fast-paced environment. Qualification : Bachelors or Masters degree in a relevant field
Staff Software Engineer, Ai & Automation
Okta
Staff Software Engineer AI & Automation Location: Bengaluru Company: Okta, The World s Identity Company Experience: 8+ Years Type: Full-Time About Okta Okta is the world s leading identity platform. We empower people to securely access any technology, anywhere, on any device. With products like the Okta Platform and Auth0, we place identity at the core of business security, enabling growth through safe digital transformation. At Okta, we value diverse perspectives and experiences. We believe in learning, collaboration, and building an inclusive environment where everyone belongs. About the Team The Business Technology - Shared Services team is at the forefront of Okta s internal digital transformation. We focus on building intelligent, automated platforms that simplify operations and deliver smarter, faster experiences to both employees and customers. We collaborate across engineering, data science, security, and business units to deliver cutting-edge solutions powered by Generative AI (GenAI), virtual agents, workflow orchestration, and intelligent recommendations. The Opportunity As a Staff Software Engineer, you ll play a critical role in designing and developing AI-powered platforms that drive automation, scale, and intelligence across Okta s business. You ll help make LLM-powered solutions and intelligent automation a reality for the enterprise ensuring performance, security, and reliability at scale. This is a hands-on, individual contributor (IC) role, ideal for engineers who are passionate about solving complex problems, architecting scalable systems, and pushing the boundaries of AI integration. What You ll Do Design & Build: Develop scalable backend services that embed GenAI and automation into core business workflows (e.g., virtual agents, document intelligence, smart routing). Collaborate Across Teams: Work closely with product managers, data scientists, and other engineers from ideation to production. Architect for Scale: Make key architectural decisions around LLM integration, API design, data flow, and observability. Code with Excellence: Write clean, secure, and maintainable code in Python, Java, or similar languages. Build for Production: Use Docker, Kubernetes, and CI/CD pipelines to build and deploy high-availability services. Champion Best Practices: Promote high standards for testing, security, code reviews, and operational readiness. Mentor & Guide: Support a collaborative team culture through peer mentorship and design reviews. What You ll Bring 8+ years of experience in software engineering with a strong track record of building and maintaining production-grade, cloud-native services. Expertise in distributed systems, API development, and cloud infrastructure (AWS, GCP, or Azure). Proficiency in Python, Java, or Go. Experience with Docker, Kubernetes, and observability tools (e.g., Prometheus, Grafana, ELK). Exposure to AI/ML concepts and eagerness to work with LLMs, NLP, or automation platforms. A strong sense of ownership, collaborative mindset, and a bias toward action. Passion for learning and working with emerging technologies especially in the AI and automation space. Why Join Okta Make AI Real: Help move GenAI from experimentation to enterprise-wide impact. Build with Purpose: Work on challenges that simplify and secure Okta s internal operations. Grow in a Human-Centered Culture: Join a humble, technically driven team that values learning, excellence, and personal growth. Join Okta and shape how identity, AI, and automation come together to power the modern enterprise.
Principal Associate - Full Stack Engineering
Capital One
Principal Associate Full Stack Engineering (GenAI Observability) Location: Bangalore Company: Capital One India About Us At Capital One India, we re tackling some of the most complex problems in financial services using machine learning, advanced analytics, and cloud-first engineering. Our mission is to build cutting-edge, patentable solutions that transform customer experiences, enhance operational efficiency, and ensure robust risk and compliance standards. We re a team of makers, breakers, doers, and disruptors obsessed with turning data into real-world impact at scale. About the Team Machine Learning Experiences (MLX) The MLX team is pioneering the future of model governance, ML observability, and Generative AI infrastructure at Capital One. We re enabling teams to seamlessly deploy ML and GenAI models at scale, with full visibility into performance, health, compliance, and ethical usage. This is the platform powering the next generation of AI-driven financial products across the company. About the Role We re looking for a Principal Associate Full Stack Engineer to lead the development of observability platforms for Generative AI systems. You ll be part of a cross-functional team focused on governance automation, LLM monitoring, and intelligent diagnostics using telemetry data, metadata, and advanced analytics. You ll design systems to collect, analyze, and visualize performance data from our large-scale GenAI infrastructure, helping data scientists and engineers make faster, safer decisions. What You ll Do Lead architecture and development of observability tools and dashboards for monitoring GenAI models and platform health. Design and build core APIs and SDKs to instrument large language models (LLMs) and foundational models (training, fine-tuning, prompting stages). Integrate Generative AI to enable observability features like anomaly detection, predictive analytics, and copilot-assisted troubleshooting. Partner with platform, MLOps, and governance teams to ingest and analyze telemetry, metadata, and runtime metrics at scale. Drive development of tools to ensure compliance with AI ethics, data governance, and industry regulations. Collaborate with product, design, and research to turn complex requirements into scalable, cloud-native software solutions. Lead proof-of-concept initiatives to test and showcase how GenAI can improve platform observability and decision-making. Contribute to the open-source community and stay at the forefront of GenAI and ML infrastructure evolution. Basic Qualifications Bachelor s or Master s degree in Computer Science, Engineering, or related field 4+ years of experience building distributed, data-intensive systems using microservices architecture 4+ years of experience in backend development with Python, Go, or Java 4+ years of expertise with observability stacks (Prometheus, Grafana, ELK) and adapting them for AI systems Strong knowledge of OpenTelemetry, and experience building custom SDKs and APIs 5+ years of hands-on experience with Generative AI models, especially applied to observability, governance, or compliance 2+ years of experience with cloud platforms such as AWS, Azure, or GCP Preferred Qualifications 4+ years building and optimizing ML systems in production environments 3+ years of experience with MLOps tools like MLflow, Kubeflow, or commercial platforms Experience with GenAI frameworks and libraries like LangChain, Haystack, and vector databases (FAISS, Chroma, OpenSearch) Familiarity with emerging observability tools for LLMs such as Langfuse, Phoenix, Helicone, or OpenInference Contributor to open-source GenAI or ML infrastructure projects Author or co-author of published work in AI/ML observability, governance, or performance monitoring Experience with PyTorch, TensorFlow, Spark, or Dask Knowledge of NVIDIA GPU telemetry, CUDA programming, and performance optimization for AI workloads Understanding of AI ethics, data governance, and regulatory frameworks for machine learning systems Why Join Capital One India Work at the intersection of technology, AI, and compliance helping shape the future of responsible AI Join a team driving enterprise-wide adoption of Generative AI Collaborate with world-class engineers, data scientists, and product leaders Enjoy a high-performance culture that encourages innovation, learning, and mentorship Access to cutting-edge tools, open-source contributions, and cloud-native infrastructure Qualification : Bachelors or Masters degree in Computer Science, Engineering, or related field
Senior Ai Engineer
Themathcompany
Job Title: Senior AI Engineer Location: Bengaluru, Karnataka, India Department: GenAI Experience: 4.5 to 7 years Open Positions: 5 About the Role As a Senior AI Engineer, you will design, build, and maintain scalable AI solutions with a strong focus on Generative AI technologies such as large language models (LLMs), embeddings, and retrieval techniques. You will lead a team of AI engineers and collaborate with stakeholders to deliver impactful AI-driven products aligned with business goals. Your role includes mentoring, project planning, ensuring data quality, and driving continuous process improvements. Key Responsibilities Design, develop, and deploy scalable AI/ML solutions, specializing in advanced Generative AI (LLMs, embeddings, retrieval-augmented generation, prompt engineering). Lead, mentor, and develop a team of AI engineers in a collaborative, inclusive environment. Coordinate with stakeholders to gather requirements, prioritize tasks, and define project timelines. Ensure projects align with overall business objectives and data strategies. Oversee data quality, integrity, and security in AI engineering projects. Build reusable frameworks to enhance the efficiency and scalability of AI systems. Manage client communications to translate requirements into technical outcomes. Identify skill gaps and create opportunities for professional development. Drive initiatives for improving data operations and AI delivery efficiency. Required Technical Skills 4.5 to 7 years of experience developing and deploying scalable AI/ML solutions. Strong expertise in data modeling, relational and NoSQL databases, software development lifecycle, unit testing, and functional programming. Proficient in designing and implementing advanced Generative AI solutions including LLMs, embeddings, retrieval techniques, and prompt engineering. Experience designing and optimizing Retrieval-Augmented Generation (RAG) systems. Proficiency with Databricks workflows, including job and cluster management, and API usage. Solid understanding of data structures, algorithms, multiprocessing, and optimization techniques. Skilled in Python libraries such as Pandas, NumPy, FastAPI for data processing and API development. Expertise in SQL optimization and database schema design. Experience deploying AI models using Docker and Kubernetes. Familiarity with version control using GitHub. Hands-on experience with cloud platforms like Azure, AWS, or GCP for AI deployments. Optional experience with PySpark for data processing. Basic understanding of CI/CD pipelines and deployment best practices. Required Non-Technical Skills Strong problem-solving ability with financial impact awareness in both team management and solution delivery. Excellent verbal and written communication skills, comfortable interacting with mid-level client management. Ability to balance pragmatic solutions versus perfect outcomes and rally teams accordingly. Strong interpersonal skills including conflict resolution, empathy, negotiation, and active listening. Demonstrated leadership and mentorship capabilities. Self-motivated with a strong sense of ownership. Good to Have Familiarity with data visualization tools and techniques. Understanding of data security, privacy, governance, and compliance frameworks. Experience with graph databases and graph processing frameworks. Knowledge of data virtualization and federation methods. Skills in data profiling and data quality management. Education Bachelor s degree in Engineering, Computer Science, or a related field. Qualification : Bachelors degree in Engineering, Computer Science, or a related field.
Lead AI/ML Engineer
Synechron
Position Title: Lead AI/ML Engineer Location: Bengaluru Bellandur (GTP) Employment Type: Full-time Job Summary Synechron is seeking a seasoned Lead AI/ML Engineer to lead cutting-edge initiatives in artificial intelligence and machine learning. This role requires deep technical expertise in AI/ML, including deep learning, NLP, computer vision, and generative AI, combined with strong leadership skills to manage teams and drive innovation. You will collaborate closely with stakeholders to deliver high-impact solutions and play a key role in shaping the company s AI-driven digital transformation. Key Responsibilities Lead end-to-end development of AI/ML solutions across use cases involving machine learning, deep learning, computer vision, NLP, and generative AI. Design and implement scalable models and algorithms, ensuring performance, accuracy, and interpretability. Collaborate with product owners, engineers, and business stakeholders to align AI/ML initiatives with strategic objectives. Mentor and guide data scientists and ML engineers to elevate technical quality and delivery standards. Continuously evaluate emerging AI technologies, frameworks, and methodologies to enhance team capabilities. Contribute to innovation strategy by identifying new business opportunities driven by AI/ML. Ensure data and model governance, ethical AI practices, and responsible deployment of AI solutions. Required Skills & Tools Core Competencies: Strong hands-on experience in machine learning, deep learning, NLP, computer vision, and generative AI. Expertise in Python (primary), with additional knowledge of R, SQL, Java, or C++. Deep understanding of statistics, linear algebra, probability, and algorithm design. Frameworks & Tools: TensorFlow, PyTorch, Keras, OpenCV, Hugging Face, spaCy, Scikit-learn, etc. Data processing tools like Pandas, NumPy, and Spark. Version control and CI/CD tools (e.g., Git, MLflow, Docker, Airflow). Methodologies: Proficient in Agile development and model lifecycle management. Experience with productionizing AI models and deploying them on cloud platforms (AWS, Azure, GCP). Experience 8 12 years of experience in AI/ML, Data Science, or related domains. Minimum 8+ years in a leadership or technical lead role driving successful AI/ML initiatives. Proven record of delivering high-impact AI/ML projects at scale. Experience mentoring and managing AI/ML teams in collaborative, cross-functional environments. Day-to-Day Activities Direct and oversee AI/ML model development, validation, deployment, and monitoring. Review project designs, provide technical oversight, and conduct code/model reviews. Drive research and experimentation to apply novel AI methods to solve real-world problems. Support proposal development and client engagements with AI/ML subject matter expertise. Foster a culture of continuous learning, innovation, and data-driven thinking. Qualifications Bachelor s or Master s degree in Computer Science, Data Science, Artificial Intelligence, or a related field. Relevant certifications (e.g., TensorFlow, AWS Machine Learning, Azure AI Engineer) are a plus. Soft Skills Strong leadership, team management, and mentoring capabilities. Excellent communication and stakeholder engagement skills. Strategic thinking with a passion for innovation and emerging technologies. Ability to thrive in a dynamic, fast-paced environment and lead through ambiguity. Diversity & Inclusion at Synechron Synechron is committed to a diverse and inclusive workplace. Through our Same Difference DEI initiative, we celebrate unique backgrounds and perspectives while fostering a respectful, empowering environment. We welcome applications from candidates of all identities and provide support through mentoring, internal mobility, and flexible work arrangements. Qualification : Bachelors or Masters degree in Computer Science, Data Science, Artificial Intelligence, or a related field.
Devops Engineer
Team Vunet Systems
DevOps Engineer Location: Bengaluru, India Experience: 3 - 5 Years Job Type: Full-time About VuNet VuNet is a deep-tech leader in Business Journey Observability, leveraging Big Data and Machine Learning to deliver end-to-end digital experience monitoring for major financial institutions. The platform monitors over 28 billion transactions monthly, powering top banks and enterprises in India and MEA. Work on cutting-edge observability technology Join a Series B funded, award-winning startup recognized by Gartner, Forbes, and NASSCOM Collaborate in a fast-paced, innovative environment focused on learning and growth Access to mental wellness support, health insurance (covering family), and career development programs Role Overview: DevOps Engineer Design, develop, and maintain VuSmartMaps deployments across on-premises, cloud, and hybrid environments Automate deployments using Infrastructure-as-Code (IaC) and CI/CD pipelines Manage cybersecurity assessments and remediations for deployments Collaborate with development teams to improve deployment processes and infrastructure support Publish VuSmartMaps in cloud marketplaces (AWS, Azure, GCP) Stay current on DevOps, CI/CD, infrastructure orchestration, cybersecurity, AI workflows, and big data technologies Key Responsibilities Develop and maintain IaC frameworks enabling flexible VuSmartMaps deployment Build and manage CI/CD pipelines using GitHub Actions, Jenkins Monitor infrastructure, conduct cybersecurity testing, and manage patching Improve deployment efficiency and customer experience Collaborate cross-functionally for seamless integration and rollout Must-Have Skills 3+ years building/managing CI/CD pipelines (GitHub Actions, Jenkins) Certified/experienced in Kubernetes, Docker, Terraform, Helm, YAML Hands-on experience with GitOps workflows Knowledge of web servers (Nginx, Django), identity providers (Active Directory, LDAP), load balancers (Traefik) Experience with databases (PostgreSQL, Elasticsearch, Hadoop stack) and secrets management (Key Vault) Familiarity with cloud services (AWS, Azure, GCP) across IaaS, PaaS, SaaS layers Strong Linux and scripting skills (Bash, Python) Excellent communication skills for cross-team collaboration Good-to-Have Skills Exposure to Red Hat OpenShift, VMware, Ansible, Chef, Puppet Familiarity with container orchestration tools (Podman, Docker Swarm, Nomad) Experience optimizing dockerized microservices and container images Benefits Comprehensive health insurance covering you and your family Mental health and 1:1 counseling support Learning culture focused on innovation and career growth Inclusive, transparent workplace culture Access to new Gen AI tools and integrated tech workspace Career development and skill enhancement programs
Artificial Intelligence Engineer
In4velocity
Artificial Intelligence Engineer Experience: 3 - 8 Years Location: Bangalore (Work from Office) Job Overview We are seeking a talented and experienced Artificial Intelligence Engineer to join our dynamic team. You will be responsible for designing, training, and deploying advanced AI and machine learning models to solve complex business problems. The ideal candidate will have strong expertise in Python, deep learning frameworks, and handling large datasets, coupled with the ability to integrate AI solutions seamlessly into existing systems. Technical Skills Proficiency in programming languages such as Python, R, or Java. Strong experience with machine learning algorithms (regression, classification, clustering). Hands-on expertise with deep learning frameworks like TensorFlow and PyTorch. Skilled in data manipulation, cleaning, and preprocessing techniques. Experience integrating AI models into existing applications and systems. Familiarity with cloud platforms (AWS, Azure, GCP) for AI model deployment. Preferred Skills Proficiency in Python with NLP libraries such as OpenAI s GPT, LangChain, etc. Knowledge of Large Language Models (LLMs) and fine-tuning methodologies. Experience with Retrieval-Augmented Generation (RAG) to enhance AI responses. Ability to develop and host APIs (preferably on IIS). Basic understanding of SQL and expertise in training, tuning, and deploying models leveraging SQL databases. Benefits Flexible working hours Learning and development opportunities Medical and insurance benefits Company Core Values Positive attitude and collaborative approach to achieving team goals. Clear and respectful communication skills (verbal and written). Emphasis on teamwork over individualism it s all about the we , not the me **. Growth mindset with a commitment to continuous learning and skill improvement. About In4Velocity Since 2004, In4Velocity has been a trusted partner for real estate, construction, and infrastructure companies, helping streamline their operations through innovative technology. Our flagship product, In4Suite , provides a comprehensive, unified ecosystem connecting all aspects of real estate organizations for a full 360-degree view. Supported by a powerful Business Intelligence system and unparalleled global support, our product is the go-to platform for digital transformation in real estate development and construction management worldwide. Join us to be part of a pioneering force driving innovation and progress in the real estate domain.
Senior Frontend Engineer
Cognite
Senior Frontend Engineer Atlas AI Location: Bengaluru (Rathi Legacy, Rohan Tech Park, Hoodi) Team: Product Engineering Employment: Full-Time | Hybrid About Cognite Cognite is a global SaaS leader innovating with AI and data to solve industrial challenges. Our products like Cognite Atlas AI and Cognite Data Fusion (CDF) enable transformational digitalization for Oil & Gas, Chemicals, Pharma, Manufacturing, and Energy sectors. Award-winning and recognized globally, we are shaping the future of industrial operations. About Atlas AI Atlas AI aims to revolutionize manufacturing and energy operations by building an advanced framework of industrial AI agents. This initiative leverages Cognite Data Fusion s industrial data expertise and targets collaboration with industry leaders and strategic partners to drive impactful AI innovation. Your Role Join our co-innovation team as a Senior Frontend Engineer focused on building next-gen AI-driven industrial agent applications. Architect and develop modern, responsive web applications with React and TypeScript. Build full-stack generative AI solutions, focusing on AI agents and multi-agent systems powered by large language models integrated via Cognite Data Fusion. Collaborate with architects, data engineers, and domain experts to create scalable AI agent solutions. Implement software development best practices including Git workflows, CI/CD, and robust testing. Work directly with customers and stakeholders to co-innovate and align solutions with real business needs. Contribute to product strategy and technical decision-making. Your Experience & Skills 5+ years in product software engineering, ideally with a focus on Generative AI/ML applications. Strong frontend skills: React, TypeScript, JavaScript; full-stack experience including Python. Experience with multi-agent systems development, preferably using frameworks like LangChain. Knowledge of knowledge graphs and related tech (GraphQL, Neo4j). Familiarity with cloud platforms (AWS, GCP, Azure) and container orchestration (Docker, Kubernetes). Experience with REST APIs, debugging tools, and software lifecycle best practices. Comfortable working with data formats (CSV, JSON, SQL) and databases like SQLite or SQL databases. Strong problem-solving, communication, and collaboration skills. Work in a vibrant, inclusive environment with 70+ nationalities and a strong DEI focus. Hybrid work mode from a modern office in Bengaluru with a culture that fosters impact and ownership. Direct access to decision-makers with minimal bureaucracy. Collaborate with top-tier talent on ambitious AI and industrial digital transformation projects. Engage actively in the Cognite community through HUB events and partnerships. Make an Impact Join Cognite to drive the future of industrial AI, creating innovative tools that empower industries worldwide. We welcome diverse candidates passionate about AI and frontend engineering.
Backend Engineer - Rag & Ml Specialisation
Sarvam
Backend Engineer - RAG & ML Specialization Location: Bengaluru, Karnataka, India (On-Site) Department: Engineering Employment Type: Full-Time About Sarvam.ai Sarvam.ai is a cutting-edge generative AI startup based in Bengaluru, India, on a mission to make AI accessible and impactful for Bharat. We develop high-performance, cost-effective AI agents tailored to the Indian market, empowering enterprises to unlock new opportunities and create meaningful customer connections. Join us as we reshape AI for India and beyond. Role Overview As a Backend Engineer specializing in RAG (Retrieval-Augmented Generation) systems and Machine Learning (ML) applications, you'll be building scalable backend systems that power AI-driven services. Your work will be critical in developing high-performance platforms for voice and generative AI applications, ensuring secure, scalable, and seamless AI model deployments. Key Responsibilities Backend Development: Design, develop, and maintain scalable, efficient backend applications and RESTful APIs using Python and FastAPI. RAG System Implementation: Build and optimize Retrieval-Augmented Generation (RAG) systems for AI applications, focusing on enhancing AI-driven search and retrieval capabilities. Data Pipeline Management: Develop and manage data pipelines and workflows for integrating AI and ML models into production systems. Code Quality: Ensure adherence to coding best practices, including writing modular code, implementing unit tests, and conducting code reviews. Cross-functional Collaboration: Work closely with AI/ML engineers, data scientists, and other teams to integrate machine learning models into backend systems. Database Optimization: Optimize database queries and efficiently manage both structured and unstructured data. CI/CD Practices: Continuously integrate and deploy code, using version control systems like Git and CI/CD pipelines. System Architecture: Contribute to architectural discussions and improvements, focusing on scalability and performance optimization. Must-Have Skills & Qualifications Educational Background: Bachelor's degree in Computer Science, Engineering, or a related technical field. Programming Skills: Strong proficiency in Python, with a solid understanding of programming fundamentals. Web Frameworks: Experience building backend services using FastAPI, Flask, or Django. Database Knowledge: Familiarity with SQL operations and NoSQL databases for efficient data management. AI & ML Exposure: Hands-on experience with Machine Learning and Deep Learning techniques, and understanding of AI model deployment in production environments. RAG Systems Experience: Prior exposure to Retrieval-Augmented Generation (RAG) architectures, with experience building AI-driven search systems. Version Control: Proficiency with Git and understanding of version control workflows. Problem Solving: Strong analytical and debugging skills to address complex technical challenges. Soft Skills: Excellent communication, collaboration, and problem-solving abilities. Good to Have (Preferred Experience) Backend Projects: Demonstrated experience working on backend applications using Python frameworks (FastAPI, Flask, Django) through academic or personal projects. Cloud Knowledge: Basic understanding of cloud platforms and services such as AWS, GCP, or Azure. DevOps & Containers: Exposure to Linux/Unix environments and containerization concepts (Docker, Kubernetes). CI/CD: Experience setting up CI/CD pipelines for automated testing and deployment. Open Source Contributions: Contributions to open-source projects or a strong GitHub profile showcasing backend development expertise. Impactful Work: Work on groundbreaking generative AI applications that are transforming the future of technology in India. Collaborative Environment: Join a high-performing team of AI experts and engineers, driving innovation and delivering real-world solutions. Growth Opportunities: Be a key player in a fast-growing AI startup, with the opportunity to grow alongside the company. Cutting-edge Technologies: Leverage the latest in AI, Machine Learning, and Cloud Technologies to build state-of-the-art systems. Qualification : Bachelor's degree in Computer Science, Engineering, or a related technical field.
Machine Learning Engineer - Speech Ai (asr & Tts)
Sarvam
Machine Learning Engineer - Speech AI (ASR & TTS) Location: Bengaluru, Karnataka, India (On-Site) Department: Engineering Employment Type: Full-Time About Sarvam.ai Sarvam.ai is a pioneering generative AI startup headquartered in Bengaluru, India. We specialize in leading transformative research and development in speech and language technologies. Focused on building state-of-the-art ASR (Automatic Speech Recognition) and TTS (Text-to-Speech) models, particularly for Indic languages, we aim to redefine human-computer interaction with cutting-edge, AI-driven solutions. Join us as we push the boundaries of Speech AI to create inclusive, scalable, and intelligent voice-based applications for diverse communities worldwide. Role Overview We are seeking an experienced Machine Learning Engineer specializing in Speech AI (ASR & TTS). The ideal candidate will work on deep learning-based ASR and TTS models, improving accuracy, efficiency, and multilingual capabilities while deploying them at scale. The role involves developing and optimizing speech recognition and synthesis models with a focus on low-resource languages, real-time inference, and scalability. If you have a passion for speech processing and deep learning, this is a great opportunity to make a significant impact in a rapidly growing field. Key Responsibilities ASR (Automatic Speech Recognition) Develop, train, and optimize speech-to-text models using state-of-the-art architectures like Wav2Vec, Whisper, Conformer, and DeepSpeech. Implement techniques for low-latency ASR inference, including beam search, language model integration, and real-time transcription. Improve speech recognition accuracy for low-resource languages, especially Indic languages, using transfer learning and data augmentation. Optimize ASR pipelines for noise robustness, speaker adaptation, and domain-specific transcription. TTS (Text-to-Speech) Develop and fine-tune neural TTS models such as Tacotron, FastSpeech, VITS, or WaveNet for high-quality, natural-sounding speech synthesis. Implement multilingual and expressive TTS models with prosody and emotion control. Optimize TTS inference for deployment on edge devices, mobile, and cloud platforms. Improve speech synthesis quality through voice cloning, neural vocoders (HiFi-GAN, WaveGlow), and prosody modeling. General Speech AI Responsibilities Benchmark and profile ASR/TTS models to improve latency, efficiency, and deployment performance. Deploy scalable speech AI APIs on AWS, Azure, or GCP for real-world applications. Optimize ASR & TTS models for edge and offline inference. Stay updated with the latest advancements in speech AI, neural vocoders, and real-time inference techniques. Must-Have Qualifications Experience: 2-3 years of experience in speech AI, deep learning, or machine learning, with a focus on ASR & TTS. Education: Bachelor s or Master s degree in Computer Science, AI/ML, Speech Processing, or a related field. ML Frameworks: Proficiency in PyTorch or TensorFlow for training and deploying ASR/TTS models. ASR Expertise: Experience with speech-to-text architectures like Whisper, Wav2Vec, Conformer, or DeepSpeech. TTS Expertise: Experience with speech synthesis models like Tacotron, FastSpeech, or VITS. Speech Signal Processing: Understanding of MFCCs, STFT, phonemes, prosody modeling, and feature extraction. Inference Optimization: Hands-on experience with TensorRT, ONNX, or quantization (INT8, FP16) for ASR/TTS. Cloud & Edge Deployment: Experience deploying speech models on AWS, GCP, or Azure. Preferred Qualifications Experience with speech diarization, speaker recognition, or language modeling for ASR. Familiarity with zero-shot TTS, voice cloning, and multilingual speech modeling. Understanding of CUDA optimization and low-bit quantization for ASR/TTS models. Contributions to open-source speech AI projects or a strong GitHub portfolio showcasing relevant work. Experience with real-time streaming ASR/TTS applications and low-latency inference. Innovative Impact: Work on AI-driven speech solutions that are changing how people interact with technology, especially in low-resource languages. Cutting-Edge Technology: Contribute to the development of state-of-the-art speech AI models in a rapidly advancing field. Collaborative Environment: Work with a team of experts in AI, machine learning, and speech processing, in a startup culture. Growth Opportunities: Sarvam.ai offers exciting career growth in a fast-paced environment with opportunities for personal and professional development. Interested candidates are invited to submit their resume, cover letter, and any relevant project portfolios or GitHub links showcasing their experience in ASR, TTS, or Speech AI. Strong AI-related projects, whether in industry, research, or personal work, will be highly valued. Qualification : Bachelors or Masters degree in Computer Science, AI/ML, Speech Processing, or a related field.
Digital - Technology Specialist - Azure & Ai
Microsoft
Digital - Technology Specialist - Azure & AI Location: Bangalore, Karnataka, India Employment Type: Full-Time About the Role At Microsoft, the Small Medium Enterprises and Channel (SME&C) team is at the forefront of driving AI-powered global sales. We are empowering businesses of all sizes through the transformative power of Microsoft technologies. In this dynamic role, you will engage directly with customers and partners, leveraging your expertise in Azure and AI technologies to scale solutions and drive business outcomes. SME&C is more than a sales organization it s a vibrant, innovative community. By joining us, you ll be part of a high-growth, customer-obsessed team dedicated to redefining how businesses adopt technology for growth and innovation. Key Responsibilities Scale Customer Engagements: Work with customer technical decision-makers to anticipate needs, gather data, and drive technical discussions that lead to successful outcomes for Azure & AI technologies. Engage Through Partners: Collaborate with partners and internal resources to facilitate technical engagements and overcome blockers. Build Strategy: Contribute to strategy development by sharing competitive insights and feedback from customer sessions, shaping how Microsoft solutions can drive customer success. Solution Design and Proof: Demonstrate and apply Microsoft solutions to customer challenges through architectural design sessions (ADS), proof of concept (POC), and solution demonstrations. Technical Leadership: Build your domain knowledge, conduct training, and act as a mentor within the community to grow technical expertise and enhance customer engagements. Qualifications Required/Minimum Qualifications: 3+ years of technical pre-sales or technical consulting experience, OR Bachelor's Degree in Computer Science, Information Technology, Engineering, or related field with 4+ years of technical experience. Relevant certifications in Microsoft or competitive platforms, such as Microsoft Office 365, Power BI, Azure, or Cloud Platform Technologies. Additional or Preferred Qualifications: 7+ years of experience in technical pre-sales, technical consulting, or related fields. 4+ years of hands-on experience with cloud, hybrid, or on-premises infrastructure, architecture designs, migrations, and industry standards. Expertise in Azure and AI technologies with a strong ability to craft and deliver customized solutions to customers. Join us in a collaborative, fast-paced, and digital-first environment where your contributions will have a direct impact on the success of businesses globally. At Microsoft, we foster a culture of inclusion, continuous learning, and innovation. Employee Benefits Industry-leading healthcare coverage Generous paid time off and family leave policies Access to learning and development resources Employee discounts and savings programs Maternity and paternity leave Global networking and community engagement opportunities Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, disability status, or any other characteristic protected by law. Qualification : Bachelor's Degree in Computer Science, Information Technology, Engineering, or related field with 4+ years of technical experience.
Ai Platform Architect
Adobe
AI Platform Architect Location: Bangalore, Karnataka, India Employment Type: Full-Time About Adobe Adobe is changing the world through digital experiences. Whether you're an emerging artist or a global brand, our tools empower creativity and innovation across every screen. From powerful imaging and video solutions to immersive web and app design, Adobe s mission is to help people and businesses deliver exceptional digital experiences. We are committed to creating an inclusive workplace where everyone is respected and given equal opportunity. Innovation can come from anywhere and the next big idea could be yours. Job Description We are looking for a visionary AI Platform Architect with deep expertise in building and scaling cloud-native, AI-powered platforms. The ideal candidate will have experience deploying large-scale, customer-facing AI solutions and a deep understanding of modern cloud architecture, data systems, MLOps, and LLMOps. Responsibilities Design and develop scalable AI/ML platforms and pipelines across AWS, Azure, and GCP. Architect end-to-end LLM pipelines including model training, fine-tuning, serving, inference APIs, and monitoring. Lead cross-functional teams in delivering AI solutions from experimentation to production. Implement MLOps and LLMOps best practices using tools like MLFlow, SageMaker, Langchain, and LangGraph. Design GPU-optimized architectures for training and inference of LLMs using DeepSpeed, vLLM, and other modern frameworks. Support infrastructure automation and container orchestration with Kubernetes, Docker, and CI/CD pipelines. Collaborate with internal stakeholders and clients to understand requirements, evangelize platform solutions, and ensure successful delivery. Key Skills and Expertise Cloud and DevOps: Expertise in AWS, Azure, GCP especially VPC design, cloud databases, and serverless architecture. Certified in AWS Professional Solution Architect, AWS ML Specialty, or Azure Solutions Architect Expert (preferred). Proficient with Kubernetes, Docker, FluentD, Kibana, Grafana, Prometheus. Data and Streaming: Experience with OLTP/OLAP databases and cloud-native data warehouses like BigQuery, Aurora, Spanner. Hands-on with Kafka, Apache Flink, Spark, Airflow, Databricks, Apache Iceberg, Presto. AI/ML & LLM Expertise: In-depth understanding of LLMs (GPT, Gemini, Claude, Mixtral, Llama, Hugging Face OSS models). LLMOps frameworks: Langchain, Langgraph, Langflow, Flowise, LLamaIndex. ML lifecycle tools: MLFlow, SageMaker, Vertex AI, Azure AI, AWS Bedrock. Proven experience in model optimization, fine-tuning, and high-throughput inference systems. Programming Languages: Proficient in Python, SQL, and JavaScript. Preferred Qualifications 10+ years in cloud and AI/ML platform architecture roles. Experience delivering AI solutions for enterprise-scale clients. Hands-on experience with GPU architecture and parallel/distributed training. Strong communication skills with ability to influence technical and business stakeholders. Work on cutting-edge AI technologies and shape future product experiences used by millions. Collaborate with world-class engineers and scientists in a diverse, inclusive culture. Be part of a company that values creativity, innovation, and employee well-being. Adobe is proud to be an Equal Opportunity Employer. We welcome and encourage candidates from all backgrounds to apply.
Machine Learning Engineer
Test Company
Machine Learning Engineer Full-Time - Bengaluru, India - Data Science / Artificial Intelligence / Engineering Join our dynamic Data Science / Artificial Intelligence / Engineering team in Bengaluru, India as a Full-Time Machine Learning Engineer and play a key role in driving data-driven innovation! We are seeking a skilled and results-oriented Machine Learning Engineer to design, build, and deploy scalable machine learning models that address real-world business challenges. You will collaborate closely with data scientists, engineers, and product managers to transform raw data into actionable insights and integrate intelligent features into our products. As a Machine Learning Engineer, you will be responsible for the complete lifecycle of machine learning models and pipelines, from design and development to seamless deployment for a variety of applications. This includes classification, regression, clustering, recommendation systems, and time-series forecasting. You will leverage your expertise to preprocess and analyze large and complex datasets, extracting meaningful features and valuable insights. Collaboration with cross-functional teams will be crucial as you identify strategic ML opportunities and define clear success metrics. A key aspect of this role involves optimizing machine learning models for peak performance, scalability, and accuracy within production environments. You will build robust APIs or efficient microservices to integrate these models seamlessly into our applications, utilizing tools such as Flask or FastAPI. Continuous improvement is paramount, and you will be responsible for the ongoing monitoring and retraining of models based on their performance and any signs of data drift. Staying at the forefront of the field is essential, and you will be expected to stay updated with the latest ML research and emerging technologies, applying them to continuously enhance our product capabilities. Key Responsibilities: Design, develop, and deploy machine learning models and pipelines for diverse applications including classification, regression, clustering, recommendation, and time-series forecasting. Preprocess and analyze large datasets to extract meaningful features and actionable insights. Collaborate effectively with cross-functional teams to identify strategic ML opportunities and define clear success metrics. Optimize models for maximum performance, scalability, and accuracy in production environments. Build robust APIs or efficient microservices to integrate ML models into applications using tools like Flask or FastAPI. Continuously monitor and retrain models based on performance metrics and potential data drift. Stay updated with the latest ML research and technologies and apply them to enhance product capabilities. Minimum Qualifications: Bachelor s or Master s degree in Computer Science, Data Science, Statistics, or a related field. 2+ years of proven experience as a Machine Learning Engineer or in a similar role. Strong proficiency in Python and key ML libraries such as Scikit-learn, XGBoost, TensorFlow, or PyTorch. Practical experience working with both SQL and NoSQL databases. Solid knowledge of essential data preprocessing, effective feature engineering, and robust model evaluation techniques. Familiarity with standard software engineering practices, including version control (Git), thorough code reviews, and efficient CI/CD pipelines. Preferred Qualifications: Prior experience with deep learning, natural language processing (NLP), or computer vision. Familiarity with major cloud services like AWS, GCP, or Azure (especially SageMaker, Vertex AI, etc.). Understanding of modern MLOps tools and practices (e.g., MLflow, Kubeflow, DVC). Practical experience with containerization and orchestration tools (Docker, Kubernetes). Knowledge of big data tools (e.g., Spark, Hadoop) is considered a significant plus. What We Offer: Competitive salary and performance-based incentives to reward your contributions. Comprehensive health insurance and valuable wellness benefits to support your well-being. Dedicated learning and development programs for continuous professional growth. Exciting opportunities to work on impactful, real-world AI/ML projects with significant scale. A collaborative, inclusive, and innovative work culture that fosters teamwork and creativity. Flexible working hours and a hybrid work model to promote a healthy work-life balance.
Ai Ml Engineer
Wipro Limited
AI/ML Engineer - Bengaluru, India Experience: 4 to 6 years About Wipro Wipro Limited (NYSE: WIT, BSE: 507685, NSE: WIPRO) is a global leader in technology services and consulting. Operating in 65 countries with over 230,000 employees, Wipro helps organizations achieve digital transformation through innovative, future-ready solutions. Role Overview We are hiring an experienced AI/ML Engineer to work on advanced machine learning and Generative AI solutions. You will design prompt engineering pipelines, develop ML models using frameworks like TensorFlow and PyTorch, and deliver scalable backend services using Python and Django. Key Responsibilities Design and implement GenAI-based use cases using prompt engineering techniques Build REST APIs and scalable microservices using Python and Django Evaluate GenAI model performance and implement guardrails Deploy applications to Azure or AWS cloud environments Lead prompt lifecycle management including tuning, templating, and optimization Ensure integration with databases and external APIs Use CI/CD tools for continuous deployment (Azure DevOps, Jenkins, Ansible, Terraform) Mandatory Skills Python, Django, REGEX AI/ML, Deep Learning, NLP TensorFlow, PyTorch Generative AI, LLMs, RAG Pipelines REST API development Preferred Skills API Gateways: WSO2, KONG, nginx Apache HTTP Server Azure DevOps, Ansible, Jenkins, Terraform Databricks Additional Requirements Understanding of OOP, design patterns, and scalable architecture Familiarity with Docker and version control tools like GitHub or Bitbucket Team leadership in prompt engineering and stakeholder engagement Knowledge of secure prompt design to mitigate injection or leakage risks Join a forward-thinking digital transformation company where your ideas shape the future. We value diversity, innovation, and continuous learning. Applications from individuals with disabilities are highly encouraged.
Gen Ai Engineer - L1
Wipro Limited
Gen AI Engineer - L1 | Bengaluru, India About Wipro Wipro Limited (NYSE: WIT, BSE: 507685, NSE: WIPRO) is a global IT consulting and services company with over 230,000 employees across 65 countries. We offer innovative solutions in consulting, engineering, and operations to solve complex digital transformation challenges. Role Overview We are looking for an experienced Gen AI Engineer - L1 with deep expertise in Generative AI, LLMs, RAG pipelines, and Python-based machine learning frameworks. The role will focus on developing secure, scalable AI systems using modern tools and cloud platforms. Key Responsibilities Design, implement, and optimize generative AI models using LangChain, LLaMA, Hugging Face, etc. Develop RAG pipelines and integrate with LLMs for advanced AI solutions. Create, test, and optimize prompt templates across different base models. Implement guardrails for prompt security to prevent prompt injection, jailbreaks, and leaks. Build efficient backend applications using Python, Django, and related tools. Work with vector databases to enhance generative AI workflows. Collaborate on data grooming and model training across business units. Benchmark model performance and develop auto-prompting systems. Ensure adherence to minimum design standards in prompt engineering use cases. Mandatory Skills Gen AI, LLMs, RAG Pipelines LangChain, LLaMA, Hugging Face Python, TensorFlow, PyTorch, Django NLP, Machine Learning, Deep Learning Vector Database integration Preferred Skills Azure or AWS Cloud Platforms MLOps, Kubernetes GitHub, Bitbucket Experience with GPT-4 Domain exposure in Banking or Financial Services Join Wipro to be part of a company that thrives on innovation, reinvention, and digital excellence. Work on impactful GenAI projects, grow your career, and contribute to shaping the future of AI in real-world applications. We welcome applications from individuals with disabilities.
Senior Machine Learning Engineer
Chevron Corporation
Senior Machine Learning Engineer Location: Bengaluru, India Company: Chevron Experience Required: 5 10 Years Department: AI/ML Engineering Work Mode: Hybrid / Global Operations Support About the Role Chevron is actively seeking a Senior Machine Learning Engineer to join our cutting-edge AI team. You will be responsible for designing, building, and optimizing advanced machine learning systems that power transformative applications in artificial intelligence. In this role, you will develop self-learning applications and refine AI systems using robust engineering, statistics, and software design practices. Key Responsibilities Study and transform data science prototypes into production-ready systems Design scalable and robust machine learning systems Research and implement modern ML algorithms and tools Develop ML applications based on business and technical requirements Select appropriate datasets, data pipelines, and data representation techniques Run experiments and evaluate results to fine-tune models Perform statistical analysis and ML performance optimization Train and retrain models as new data becomes available Extend and customize existing ML libraries and frameworks Stay up to date with the latest ML research and trends Required Qualifications 5 10 years of proven experience as a Machine Learning Engineer or in a similar role Hands-on experience with Azure Machine Learning and MLOps Strong skills in data structures, data modeling, and software architecture Deep knowledge of math, probability, statistics, and algorithm design Proficient in programming languages: Python, R Familiarity with ML frameworks and libraries such as Keras, PyTorch, scikit-learn Excellent analytical and problem-solving skills Strong communication and teamwork abilities Working Hours Chevron supports international teams. Work hours are scheduled to align with global collaboration: Work Days: Monday to Friday Shift Options: 8:00 AM 5:00 PM or 1:30 PM 10:30 PM IST Opportunity to work on impactful ML/AI solutions at enterprise scale Flexible work culture with global exposure Advanced tools, infrastructure, and data at your fingertips Professional growth in a forward-thinking, innovation-driven environment Equal Opportunity Statement Chevron is an equal opportunity employer and adheres to inclusive hiring practices. All qualified candidates will receive consideration without regard to race, gender, age, religion, nationality, sexual orientation, disability, or any other protected status. Chevron also participates in E-Verify as required by law in applicable jurisdictions. Apply Today
Engineer, Principal/manager - Machine Learning, Ai
Qualcomm India Private Limited
Engineer, Principal/Manager - Machine Learning, AI Location: Bangalore, Karnataka, India Company: Qualcomm India Private Limited General Summary Qualcomm is seeking an experienced and visionary Principal AI/ML Engineer to lead research, development, and optimization of AI inference systems. This role involves developing high-performance AI models, optimizing deployments across various hardware platforms, and contributing to research in model compression, quantization, and hardware-aware optimization. Education & Experience PhD with 6+ years, Master's with 7+ years, or Bachelor's with 8+ years in Engineering, CS, or related field. 20+ years of experience in AI/ML development; 5+ years in inference optimization and debugging. Key Responsibilities Model Optimization & Quantization Optimize models using quantization (INT8, INT4, mixed precision), pruning, and knowledge distillation. Implement PTQ and QAT techniques for deployment. Experience with TensorRT, ONNX Runtime, OpenVINO, TVM. AI Hardware Acceleration & Deployment Target platforms: Hexagon DSP, CUDA GPUs, TPUs, NPUs, FPGAs, Habana Gaudi, Apple Neural Engine. Use Python APIs: cuDNN, XLA, MLIR for hardware acceleration. Benchmark and debug performance across platforms. AI Research & Innovation Research on efficient AI inference: model compression, low-bit precision, sparse computing. Explore architectures like Sparse Transformers, Mixture of Experts, Flash Attention. Publish in ML conferences: NeurIPS, ICML, CVPR; contribute to open-source projects. Technical Expertise Optimization of LLMs, LMMs, LVMs for inference. Deep Learning frameworks: TensorFlow, PyTorch, JAX, ONNX. Expert in CUDA, cuPy, Numba, TensorRT, ONNX Runtime, OpenVINO. Skilled in Python for scalable AI development. Experience with ML runtime delegates: TFLite, ONNX, Qualcomm AI Stack. Debugging: Netron, TensorBoard, PyTorch Profiler, Nsight, perf, Py-Spy. Cloud inference: AWS Inferentia, Azure ML, GCP AI Platform, Habana Gaudi. Hardware-aware optimization: oneDNN, ROCm, MLIR, SparseML. Contributions to open-source and research publications are a strong plus. Leadership & Collaboration Lead a team of engineers in Python-based AI inference and optimization. Collaborate with researchers, software engineers, DevOps, and hardware vendors. Define debugging, deployment, and performance tuning best practices.
Lead Full-stack Engineer (client Facing Role)
Bain & Company
Job Title: Cloud-Based AI Developer - Advanced Analytics Group (AAG) Company: Bain & Company Job Type: Full-Time Employment Type: Permanent What Makes Us a Great Place to Work: We are proud to be consistently recognized as one of the world s best places to work, a champion of diversity, and a model of social responsibility. We are currently ranked #1 on Glassdoor's Best Places to Work list, and we have maintained a spot in the top four for the last 13 years. Diversity, inclusion, and collaboration are key to building extraordinary teams. We hire people with exceptional talents, abilities, and potential, creating an environment where you can thrive both professionally and personally. We are publicly recognized for being a great place to work for diversity and inclusion, women, LGBTQ, and parents. Who You ll Work With: As a member of Bain s Advanced Analytics Group (AAG), you will work alongside generalist consultants to help clients across industries solve their biggest problems using expertise in data science, customer insights, statistics, machine learning, data management, supply chain analytics, and data engineering. AAG team members hold advanced degrees in computer science, engineering, AI, data science, physics, statistics, mathematics, and other quantitative disciplines, with backgrounds in tech, data science, marketing analytics, and academia. We are committed to building a diverse and inclusive team and encourage candidates of all backgrounds to apply. What You ll Do: As a member of the AAG, you will be responsible for designing, developing, and maintaining cloud-based AI applications that provide high-quality, scalable, and secure solutions for our clients. Your work will encompass the full stack, from API design to deployment, delivering analytics solutions across various sectors. Cloud-Based AI Development: Design, develop, and maintain cloud-based AI applications, ensuring scalability and security, leveraging full-stack technology solutions. Cross-Functional Collaboration: Work with product managers, data scientists, and other engineers to define and implement analytics features that meet business requirements. Cloud and Containerization: Use Kubernetes and containerization technologies to deploy, manage, and scale applications in cloud environments for optimal performance. API & Microservices Development: Develop and maintain APIs and microservices to expose analytics functionality, adhering to industry best practices for design and documentation. Security and Compliance: Implement robust security measures to protect sensitive data and ensure compliance with data privacy regulations. Troubleshooting and Performance Monitoring: Continuously monitor and troubleshoot application performance, resolving issues impacting system reliability and user experience. Code Reviews and Best Practices: Participate in code reviews and contribute to the establishment of coding standards to ensure high-quality, maintainable code. Emerging Trends and Technologies: Stay current with emerging trends in cloud computing, data analytics, and software engineering to enhance the platform s capabilities. Collaboration with DevOps: Work with DevOps and infrastructure teams to automate deployment and release processes, optimizing the development workflow. Client Collaboration: Collaborate closely with business consulting teams to assess opportunities and develop analytics solutions across sectors. Education and Influence: Influence and educate clients on analytics application engineering capabilities, supporting their teams directly. Travel: Expect occasional travel (30%) for project work. About You: Required Qualifications: Education: Master s degree in Computer Science, Engineering, or a related technical field. Experience: 3+ years of experience at Senior or Staff level, or equivalent. Expertise in client-side technologies such as React, Angular, Vue.js, HTML, and CSS. Experience with server-side technologies such as Django, Flask, and Fast API. Proficiency with cloud platforms (AWS, Azure, GCP) and Terraform automation (good to have). 3+ years of expertise in Python. Experience using Git for version control and collaboration. Familiarity with DevOps, CI/CD, and tools like GitHub Actions. Demonstrated interest in LLMs, prompt engineering, and Langchain. Experience with workflow orchestration tools such as dbt, Beam, Airflow, Luigi, Metaflow, Kubeflow, or similar. Experience in the implementation of large-scale structured or unstructured databases, as well as containerization technologies like Docker and Kubernetes. Skills and Knowledge: Strong interpersonal and communication skills to explain complex engineering topics to colleagues and clients from various disciplines. Curiosity, proactivity, and critical thinking. Solid computer science fundamentals in data structures, algorithms, automated testing, object-oriented programming, performance complexity, and software architecture. Expertise in designing API interfaces and knowledge of data architecture and database schema design. Familiarity with agile development methodologies. Join Bain & Company: Become a part of a forward-thinking team committed to solving complex problems, building innovative solutions, and delivering impactful data analytics and AI solutions. Collaborate with talented professionals and gain valuable experience that shapes the future of data analytics and AI. Qualification : Masters degree in Computer Science, Engineering, or a related technical field.
1 - 20 of 0 jobs
* No exact matches found. Showing closest results insteadNo results found
Modify search criteria or create an alert to get relevant jobs as soon as they’re posted