Prompt Engineering Jobs in Bengaluru
908 Jobs Found
Mts - Software Development (cloud Ai Network Security Developer)
Aviatrix Systems
MTS - Software Developer (Cloud AI Network Security Developer) Location: Bengaluru Company: Aviatrix Experience: 1 3 years About Aviatrix: Aviatrix is a cloud network security leader trusted by over 500 enterprises. We specialize in securing multi-cloud environments, offering runtime protection and advanced control for modern cloud infrastructures. Role Strategy & Impact In this role, you will build next-generation intelligent cloud network security solutions. You will focus on developing Python/Go microservices that fuse network visibility with LLM-driven insights to redefine cloud firewall capabilities. Technical Requirements Core Competencies: Development: Professional experience in Go (Golang) or Python. Cloud Networking: Fundamentals of Routing, NAT, VPNs, and Subnets. Security: Understanding of Firewall concepts (ACLs) and Zero Trust architecture. AI Integration: Experience using AI/LLM APIs (OpenAI, Vertex AI, etc.). Data Infrastructure: Workflows involving Kafka, data ingestion, and stream processing. Cloud Ecosystem: Hands-on familiarity with AWS, Azure, or GCP. Preferred Qualifications: Network Observability: Experience with NetFlow, IPFIX, or VPC Flow Logs. Modern DevOps: Hands-on with Kubernetes, Container Networking, and Terraform. Generative AI: Knowledge of Prompt Engineering or RAG-based systems. Key Responsibilities Control Plane Development: Build services for firewall rules and policy orchestration. AI Workflows: Integrate LLM-based assistants for anomaly detection and alert summarization. Telemetry Pipelines: Maintain high-performance data pipelines for security event metrics. Security Logic: Design logic for threat pattern recognition and posture scoring. Benefits & Why Join Us Global Benefits: Private medical, pension, and life assurance. Work-Life Balance: Generous holiday allowance and annual wellbeing stipend. Growth Mindset: We value diverse paths if you are passionate about AI and Security, we want to hear from you.
Ai Native Product Manager
Leap Finance
AI Native Product Manager Location: Bengaluru Type: Full-Time Experience Required: 2+ Years in PM (with a B2C focus) Role Overview: Architecting the Future of AI We are seeking a visionary AI Native Product Manager to drive innovation within our core product suite. You will leverage deep expertise in LLMs, Generative AI, and AI Prompting to translate market trends and user needs into a cohesive, high-impact product strategy. This is a role for a "tinkerer" who understands the technical nuances of AI and the psychological drivers of B2C users. The Ideal Candidate Profile AI Specialization: Technical Fluency: Deep knowledge of Large Language Models (LLMs), Gen AI workflows, and advanced Prompt Engineering. Hands-on Tinkering: A genuine passion for AI advancements and a history of experimenting with cutting-edge AI tools. Product Leadership: Metrics Ownership: Proven track record of owning and successfully driving Head Metrics for a digital product. B2C Expertise: Essential experience in B2C product management, focusing on user behavior and retention. GTM Strategy: Experience with Product Launches and Go-to-Market (GTM) strategies is a significant advantage. Agile Mastery: Expert understanding of Agile methodologies and end-to-end product lifecycle management. Key Responsibilities Strategy & Discovery: Visionary Roadmap: Develop and adapt a clear product vision based on market analysis and evolving AI trends. Requirements Engineering: Translate stakeholder needs into precise product specs, user stories, and acceptance criteria. Execution & Growth: Cross-functional Leadership: Lead engineering, design, and marketing teams to ensure seamless execution of the roadmap. End-to-End Development: Manage scope, timelines, and resources to deliver high-quality AI-driven features. Launch & Iterate: Execute successful product launches and monitor post-launch performance to inform continuous improvements. Data-Driven Decisions: Use data analytics and user behavior insights to track KPIs and pivot strategies proactively. Core Competencies Communication: Ability to convey complex AI concepts to non-technical stakeholders clearly. Problem Solving: A proactive, hands-on approach to tackling the unique challenges of non-deterministic AI outputs. Quantitative Mindset: Comfortable with advanced analytics tools to validate product hypotheses.
Ai Agent Engineer
Observe.ai Networks Private Limited
AI Agent Engineer Location: Bengaluru About Us: Observe.AI Observe.AI is the leading AI agent platform for customer experience. We enable enterprises to deploy AI agents that automate customer interactions, delivering natural conversations with predictable outcomes. Our platform combines advanced speech understanding, workflow automation, and enterprise-grade governance to execute end-to-end workflows with AI agents. We also empower teams to guide and augment human agents with AI copilots, while analyzing 100% of interactions for insights, coaching, and quality management. Leading companies like DoorDash, Affordable Care, Signify Health, and Verida rely on Observe.AI to accelerate service speed, increase operational efficiency, and strengthen customer loyalty across all channels. We re looking for an AI Agent Engineer to take the lead in building and deploying enterprise-grade Voice, Chat AI agents, and AI Copilot solutions. This role is hands-on, customer-facing, and crucial for bringing AI solutions to life from design and integration to deployment and optimization. As an AI Agent Engineer, you'll **own the end-to-end lifecycle of AI agents**: from building and integrating them to testing, deploying, and tuning performance to meet client requirements. What You ll Be Doing: AI Agent Development & Deployment: Take full ownership of building and deploying AI agents, including designing prompts, workflows, integrations, telephony setup, and evaluation forms. Client Engagement & Demos: Lead weekly client demos, showcase progress, gather feedback, and act as the primary technical contact once the solution is defined. Systems Integration: Configure and integrate APIs, handle data mappings, manage authentication, error handling, and connect AI agents to CRMs, databases, or knowledge systems. Telephony Integration: Set up and optimize SIP/CCaaS/PSTN routing, configure fallbacks, pass metadata, and troubleshoot call quality issues. Optimization & Iteration: Continuously monitor agent performance, refine prompts, conduct iterative tests, and ensure agents meet automation and containment targets. Strategic Consultation: Translate customer requirements into actionable solutions, while working consultatively to unblock challenges related to security, connectivity, or knowledge ingestion. Collaboration with Engineering: Work alongside the product and engineering teams for deeper technical fixes and platform improvements, while leading client delivery independently. What You ll Bring to the Role: 3+ years of experience in conversational AI, ML engineering, or system integration, with hands-on delivery of AI/LLM-based solutions. Strong expertise in prompt engineering, workflow building, API integration, and telephony systems (SIP, Twilio, Amazon Connect, etc.). Familiarity with Large Language Models (GPT, Claude, Gemini) and orchestration frameworks like LangChain and LlamaIndex. Solid ML knowledge in areas such as embeddings, retrieval-augmented generation (RAG), evaluation frameworks, and fine-tuning models for optimal performance. Proficiency in programming languages such as Python, JavaScript, or similar. Customer-facing experience, with the ability to lead deep technical discussions and conduct weekly project demos. A strong problem-solving mindset, with the ability to find workarounds, unblock integrations, and adapt to unique customer ecosystems. Bachelor s degree in Computer Science, Engineering, or a related technical field. Experience with Integration Platform-as-a-Service (iPaaS) providers such as n8n, Zapier, or similar, and a strong understanding of API integrations and data flow management. Extensive hands-on experience with telephony integrations, including protocols like SIP, PSTN, and other telephony technologies. Perks & Benefits: Medical Insurance: Comprehensive medical coverage and free online doctor consultations. Generous Leave Policies: Annual privilege and sick leave (as per Karnataka S&E Act), national and festive holidays, plus parental leave. Learning & Development Fund: Support for continuous learning and professional development. Fun Team Culture: Regular fun events to foster a collaborative and engaging work environment. Flexible Benefit Plans: Tax-saving benefits (meal cards, PF, etc.) and other flexible benefit options. Qualification : Bachelors degree in Computer Science, Engineering, or a related technical field
Technical Project Manager - Ai Delivery
Exotel
Technical Project Manager - AI Delivery Location: Bengaluru Employment Type: Full-time About Us Exotel is the leading full-stack customer engagement platform and business-focused virtual telecom operator for emerging markets. Founded in 2011, Exotel powers over 50 million daily engagements across voice, video, and messaging channels. Our cloud-based solutions are trusted by over 6,000 companies in more than 60 countries including major players like Ola, Swiggy, Flipkart, GoJek, Byju s, HDFC Bank, Zomato, and Urban Company. We are a Series D company valued at $100 million, with $60 million in ARR, and we provide communication APIs, a modern omnichannel contact center, and a conversational AI platform hosted on the cloud. About the Role Exotel is looking for a Technical Project Manager - AI Delivery to oversee the end-to-end delivery of complex AI-driven projects. You ll be responsible for managing the complete project lifecycle from initiation to closure ensuring that all deliverables meet quality standards, customer requirements, and timelines. As the primary customer interface, you will manage expectations, resolve issues proactively, and ensure successful AI-first implementations for enterprise customers. In this role, you will also contribute technical insights, provide support to your team, and help drive continuous improvement while maintaining a customer-centric focus. Responsibilities Project Planning & Execution: Develop detailed project plans, including scope, objectives, timelines, budgets, and resource allocation. Track progress, ensure deliverables meet quality standards, and ensure timely delivery. Customer Interface: Act as the primary point of contact for customers on assigned projects. Conduct regular status meetings, manage customer expectations, and ensure that their needs are consistently met. Scope & Requirement Management: Work closely with customers to define and document project requirements, manage scope changes, and ensure alignment with product capabilities. Risk Management: Identify potential project risks, develop mitigation strategies, and ensure timely issue resolution. Escalate risks to the Lead Project Manager when necessary. Cross-Functional Coordination: Coordinate daily activities with delivery and engineering teams, ensuring that technical tasks align with project timelines. Facilitate smooth handovers to support engineers post-delivery. Reporting & Stakeholder Communication: Prepare and present regular project status reports to the Lead Project Manager and other internal stakeholders. Methodology Adherence: Ensure all project activities adhere to AI delivery methodologies and best practices, optimizing workflows and processes. Mentorship: Provide mentorship to junior project team members, fostering a high-performing, value-driven organizational culture. Ownership: Take ownership of business satisfaction through the tested deployment of solutions and by consistently delivering on project objectives. Experience: 5+ years of project management experience, preferably in software or SaaS delivery, with a proven track record of managing complex projects from initiation to closure. Technical Knowledge: Strong understanding of integration methods for CRMs and APIs. Familiarity with cloud systems, architecture, networking, and deployment methodologies. AI/ML Knowledge: Familiarity with AI/ML, NLP, or conversational AI concepts is a plus. Requirements Gathering: Experience in gathering and translating customer requirements into actionable business use cases. Customer Management: Ability to run customer meetings, manage expectations, and handle change requests effectively. Technical Expertise: Strong understanding of Linux, networking, databases, message queues, and caching. GenAI Exposure: Hands-on experience with GenAI technologies, such as prompt engineering and Large Language Models (LLM) applications. Soft Skills: Excellent time management, communication, and interpersonal skills. Strong organizational and problem-solving abilities. Customer-Centric: Proactive, customer-focused mindset, ensuring timely issue resolution and high-quality delivery. General Skills Lead the implementation and testing of GenAI projects, ensuring alignment with customer requirements and business goals. Coordinate with pre-sales, product, and support teams to set expectations and deliver according to timelines. Ensure adherence to SLAs, proactively resolve delivery bottlenecks, and maintain a smooth delivery pipeline. Mentor junior engineers and uphold high-quality standards in all project deliverables. Innovation at Scale: Work on cutting-edge AI and communication technologies impacting millions of people daily. Growth & Impact: Be part of a rapidly growing company with ample opportunities for career development and personal growth. Collaborative Culture: Join a passionate, supportive, and high-performing team where collaboration and innovation are core values. Competitive Benefits: Enjoy comprehensive health insurance, mental wellness support, and a robust benefits package. If you are an experienced Project Manager with a strong technical background and a passion for AI-driven solutions, we d love to hear from you. Apply Now to join the Exotel team as a Technical Project Manager - AI Delivery and play a key role in transforming customer engagement across emerging markets.
Operations Executive
Intugine Technologies
Operations Executive Location: Bengaluru Work Type: Full-Time Role Summary As an **Operations Executive**, you will be integral in managing day-to-day business operations, ensuring seamless workflows, coordinating across departments, and supporting management in driving organizational success. Key Responsibilities Oversee daily operational activities to ensure efficient workflow and timely task completion. Coordinate with internal teams to facilitate smooth communication and project execution. Monitor operational performance and recommend process improvements to boost efficiency. Maintain accurate records, reports, and documentation related to operations. Assist in developing and implementing policies, procedures, and enhancements. Manage vendor relations, procurement, and inventory tracking as needed. Prepare and present regular operational reports to management. Troubleshoot operational challenges and provide prompt solutions. Requirements Graduate degree in Engineering, Supply Chain, or related fields. 0-1 years of experience in B2B or SaaS implementation preferred. Ability to balance attention to detail with a strategic, big-picture mindset. Strong communication and interpersonal skills to engage diplomatically across all levels. Understanding of customer/client requirements. Excellent soft skills including time management, prioritization, and delegation. Knowledge of Supply Chain Management (SCM) is a plus. Creative thinker with energy to introduce new ideas and innovations. Self-motivated, responsible, and capable of working independently. Highly organized with the ability to manage multiple tasks efficiently. Qualification : Graduate degree in Engineering, Supply Chain or related fields
Senior Data Scientist - LLM
5c Network Pvt. Ltd.
Position: Senior Data Scientist LLM Location: Bangalore, Karnataka, India Type: Full-Time (On-site) Experience Required: 2+ years in Deep Learning, 1+ years in LLMs Industry: Healthcare AI Company Overview: 5C Network is pioneering multi-modal AI systems for autonomous diagnosis in medical imaging. We're building next-generation models that integrate deep learning with language understanding to revolutionize clinical workflows and diagnostic accuracy. Role Summary: As a Senior Data Scientist LLM, you will lead the development and deployment of Large Language Models (LLMs) focused on enhancing medical imaging diagnostics. You ll work on cutting-edge problems such as instruction fine-tuning, prompt engineering, and Retrieval-Augmented Generation (RAG), while ensuring scalability and robustness in production environments. Key Responsibilities: LLM Development & Fine-Tuning Design and optimize prompts for diverse clinical and imaging-related use cases. Perform instruction fine-tuning of LLMs to meet task-specific requirements. Develop reasoning pipelines including Chain of Thought (CoT) techniques for complex diagnostic workflows. LLM Deployment & Optimization Self-host and deploy LLMs in secure, scalable production environments. Apply quantization and other performance optimization methods to minimize compute and memory footprint. Ensure high performance, uptime, and security in AI deployments. Retrieval-Augmented Generation (RAG) & Vector Databases Develop and implement RAG pipelines by integrating LLMs with semantic search. Work with vector databases (e.g., Qdrant) to enable fast, efficient retrieval of contextual data. Optimize data storage, indexing, and retrieval to support clinical applications. Data Engineering & Annotation Build and manage high-quality datasets tailored for CoT and multi-step reasoning tasks. Lead data annotation efforts to enhance LLM understanding of medical contexts. Collaboration & Research Collaborate with researchers, ML engineers, and domain experts to bring LLM solutions from prototype to product. Stay ahead of the curve by experimenting with novel LLM architectures and emerging techniques. Qualifications: Bachelor's or Master s degree in Computer Science, Data Science, AI, or a related field. 2+ years of hands-on experience in deep learning. Minimum 1 year of experience working with LLMs (e.g., instruction tuning, prompt engineering, RAG). Prior experience in LLM deployment (self-hosting, optimization, quantization, scaling). Proficient in Python and common ML frameworks (e.g., PyTorch, Hugging Face Transformers). Familiarity with vector databases like Qdrant or similar. Strong interest or prior exposure to healthcare/medical AI. Excellent problem-solving, communication, and team collaboration skills. Technical Stack: Languages: Python Frameworks: PyTorch, Hugging Face Technologies: LLMs (e.g., GPT, LLaMA), Vector Databases (Qdrant), RAG, Quantization Tools: Docker, Kubernetes, REST APIs, Git Work on high-impact AI solutions in the healthcare domain. Collaborate with a team at the forefront of multi-modal diagnostic technology. Access cutting-edge tools and real-world data to drive innovation. Qualification : Bachelor's or Masters degree in Computer Science, Data Science, AI, or a related field.
Senior Ai Engineer
Themathcompany
Job Title: Senior AI Engineer Location: Bengaluru, Karnataka, India Department: GenAI Experience: 4.5 to 7 years Open Positions: 5 About the Role As a Senior AI Engineer, you will design, build, and maintain scalable AI solutions with a strong focus on Generative AI technologies such as large language models (LLMs), embeddings, and retrieval techniques. You will lead a team of AI engineers and collaborate with stakeholders to deliver impactful AI-driven products aligned with business goals. Your role includes mentoring, project planning, ensuring data quality, and driving continuous process improvements. Key Responsibilities Design, develop, and deploy scalable AI/ML solutions, specializing in advanced Generative AI (LLMs, embeddings, retrieval-augmented generation, prompt engineering). Lead, mentor, and develop a team of AI engineers in a collaborative, inclusive environment. Coordinate with stakeholders to gather requirements, prioritize tasks, and define project timelines. Ensure projects align with overall business objectives and data strategies. Oversee data quality, integrity, and security in AI engineering projects. Build reusable frameworks to enhance the efficiency and scalability of AI systems. Manage client communications to translate requirements into technical outcomes. Identify skill gaps and create opportunities for professional development. Drive initiatives for improving data operations and AI delivery efficiency. Required Technical Skills 4.5 to 7 years of experience developing and deploying scalable AI/ML solutions. Strong expertise in data modeling, relational and NoSQL databases, software development lifecycle, unit testing, and functional programming. Proficient in designing and implementing advanced Generative AI solutions including LLMs, embeddings, retrieval techniques, and prompt engineering. Experience designing and optimizing Retrieval-Augmented Generation (RAG) systems. Proficiency with Databricks workflows, including job and cluster management, and API usage. Solid understanding of data structures, algorithms, multiprocessing, and optimization techniques. Skilled in Python libraries such as Pandas, NumPy, FastAPI for data processing and API development. Expertise in SQL optimization and database schema design. Experience deploying AI models using Docker and Kubernetes. Familiarity with version control using GitHub. Hands-on experience with cloud platforms like Azure, AWS, or GCP for AI deployments. Optional experience with PySpark for data processing. Basic understanding of CI/CD pipelines and deployment best practices. Required Non-Technical Skills Strong problem-solving ability with financial impact awareness in both team management and solution delivery. Excellent verbal and written communication skills, comfortable interacting with mid-level client management. Ability to balance pragmatic solutions versus perfect outcomes and rally teams accordingly. Strong interpersonal skills including conflict resolution, empathy, negotiation, and active listening. Demonstrated leadership and mentorship capabilities. Self-motivated with a strong sense of ownership. Good to Have Familiarity with data visualization tools and techniques. Understanding of data security, privacy, governance, and compliance frameworks. Experience with graph databases and graph processing frameworks. Knowledge of data virtualization and federation methods. Skills in data profiling and data quality management. Education Bachelor s degree in Engineering, Computer Science, or a related field. Qualification : Bachelors degree in Engineering, Computer Science, or a related field.
Technical Support Engineer I/ Technical Support Engineer Ii
Zeta
Job Title: Technical Support Engineer I / II Location: Bengaluru, India Job Type: Full-time About Zeta Zeta is a Next-Gen Banking Tech company that empowers banks and fintechs to launch cutting-edge financial products. Founded by Bhavin Turakhia and Ramki Gaddipati in 2015, Zeta s flagship platform, Tachyon, is the world s first cloud-native, fully API-enabled processing stack. It brings together issuing, processing, lending, fraud and risk, core banking, and more into a unified offering. With over 20 million cards issued globally and trusted by some of the largest banks and fintechs, Zeta is redefining the banking infrastructure landscape. Backed by SoftBank, Mastercard, and other marquee investors, Zeta is valued at $1.5 billion and employs over 1700 professionals, with more than 70% in R&D. About the Role As a Technical Support Engineer I/II at Zeta, you ll be part of the Banking Technology Support team, responsible for troubleshooting, resolving, and escalating issues related to Zeta Tachyon a platform with 100+ APIs, multiple customer-facing interfaces, and extensive enterprise-grade infrastructure. This role offers the opportunity to work with high-performing engineering and product teams, support enterprise clients, and grow your career in the B2B SaaS + Fintech space. Responsibilities Customer Support: Provide first-level technical support to clients, resolving issues efficiently and maintaining high customer satisfaction. Incident Management: Monitor incoming support tickets, emails, and calls. Prioritize and manage based on urgency and business impact. Troubleshooting: Diagnose basic technical issues with banking systems, apps, or APIs using internal tools and knowledge bases. Documentation: Contribute to knowledge base articles, FAQs, and process documents to enhance self-service and internal efficiency. Escalation Handling: Escalate complex issues to L2/L3 teams with thorough documentation and coordinate for resolution. Collaboration: Work cross-functionally with engineers, business analysts, and system admins to resolve customer concerns. Compliance & Security: Ensure adherence to security, privacy, and regulatory standards when handling sensitive customer data. Required Skills & Competencies Strong problem-solving and debugging abilities. Excellent written and verbal communication skills; able to explain technical concepts to non-technical users. Customer-centric mindset with a focus on delivering prompt, quality service. Comfortable working in fast-paced, multi-tasking environments. Familiarity with: JIRA Postman Kibana, Grafana, Splunk (must-have) Exposure to ticketing systems and knowledge base platforms is a plus. Willingness to learn new tools and technologies in banking and payments. Experience & Qualifications Bachelor's degree in Computer Science, IT, or related engineering fields. 2.6+ years of overall experience in technical support roles within enterprise or banking technologies. At least 1 year of hands-on experience supporting enterprise-grade software products or platforms. Prior experience in the banking/payments/fintech domain is an advantage. Be part of one of the most innovative fintech platforms globally. Opportunity to work closely with industry leaders and high-growth enterprise clients. A culture of growth, learning, and empowerment. Equal Opportunity Employer Zeta is an equal opportunity employer committed to diversity and inclusion. We celebrate differences and are proud of our inclusive culture. Candidates from all backgrounds are encouraged to apply. Qualification : Bachelor's degree in Computer Science, IT, or related engineering fields.
Associate Architect, Ai
Aptean
Job Title: Associate Architect AI Location: Bangalore, India Employment Type: Full-Time, Regular Overview At Aptean, we design industry-specific ERP and business software that helps our customers transform operations and gain a competitive edge. With over 90 products, 4,500+ employees, and a global presence, we deliver solutions that are anything but generic. Now, we re building on that legacy by integrating cutting-edge AI into everything we do and we want you to be part of it. We are looking for an Associate Architect AI to play a key role in shaping Aptean s AI-first future. This is a hands-on, high-impact role focused on the design, development, and deployment of AI-powered solutions including smart chatbots, Retrieval-Augmented Generation (RAG) pipelines, predictive analytics, and autonomous AI agents. If you're passionate about solving real-world problems using the latest in LLMs, AI orchestration, and data automation we want to talk. Key Responsibilities Design and implement AI-led automation solutions, leveraging technologies like agentic AI and LLMs. Develop and refine structured prompts to improve AI model responses and contextual accuracy. Integrate Azure OpenAI and Microsoft Fabric AI Skills to enhance conversational AI in Fabric Lakehouse environments. Build and optimize RAG pipelines and semantic models for improved AI-driven content ingestion and understanding. Design prompts for AI agents, chatbots, and virtual assistants that automate business workflows. Research and implement best practices to minimize hallucinations, biases, and inconsistencies in AI-generated outputs. Collaborate on curating, cleaning, and preparing training datasets for machine learning models. Use Python and tools like LangChain to prototype and deploy AI-enabled solutions across multiple platforms. Monitor AI system performance, scalability, and business alignment; continuously iterate based on data and user feedback. Stay up to date with advancements in NLP, large language models (LLMs), and generative AI technologies. Qualifications Education: Bachelor s degree in Computer Science or related field (required) Master s degree (preferred) Experience: 8 12 years of experience in AI, machine learning, or data science roles Strong track record of delivering AI-driven solutions from concept to deployment Technical Skills and Competencies Hands-on experience with LLMs (e.g., GPT, Claude, Gemini, LLaMA) and AI APIs. Proficient in Python and ML frameworks such as TensorFlow, PyTorch, Keras, Scikit-learn. Experience with Microsoft Fabric, Fabric AI Skills, and Lakehouse/Medallion architecture. Familiarity with OpenAI API, LangChain, Azure, JSON, Java, JavaScript. Strong foundation in NLP concepts, prompt engineering, and model behavior analysis. Experience developing and deploying chatbots across platforms like Microsoft Teams, SharePoint, and web portals. Understanding of statistical models (e.g., regression, clustering) and ML algorithms (e.g., decision trees, neural networks). Strong written and verbal communication skills; able to interpret and explain AI behavior clearly. Innovative thinker with a startup mindset. Comfortable in a fast-paced, collaborative environment. Passionate about AI and its transformative potential. Focused on real outcomes and user-centric design. At Aptean, we believe in growth through innovation, and AI is at the core of that vision. You'll work on high-impact projects that stretch your capabilities, in a company that values curiosity, collaboration, and continuous learning. If you're excited to lead in the age of AI, this is your moment. Diversity & Inclusion at Aptean Aptean is committed to building a workplace where everyone belongs. We value diversity in all its forms and believe our differences make us stronger. We strive to create an inclusive environment where everyone has the opportunity to grow, succeed, and contribute their best work. Qualification : Bachelors degree in Computer Science or related field (required)
Senior Manager Data Science, Data Modelling & Analytics
Merkle B2b
Job Title: Senior Manager Data Science, Data Modelling & Analytics Location: Bengaluru Department: Insights & Analysis About the Role: As a Senior Manager, you will lead a team of data scientists and analysts, driving the development and deployment of advanced analytics solutions that enable data-driven decision-making. This role blends strategic leadership with hands-on technical expertise, playing a critical part in delivering impactful insights and analytics across the organization. Key Responsibilities: Hands-On Technical Contribution: Design, develop, and deploy advanced machine learning models and statistical analyses to address complex business challenges. Utilize Python, R, SQL, and other tools to manipulate data and build predictive models. Manage end-to-end data pipelines including collection, cleaning, transformation, and visualization. Collaborate with IT and data engineering teams to integrate analytics solutions into production environments. Provide thought leadership on analytics solutions and metrics aligned with business needs. Team Leadership & Development: Lead, mentor, and manage a team of data scientists and analysts, fostering collaboration and innovation. Guide career development, conduct performance evaluations, and promote skill enhancement. Encourage continuous learning and adoption of best practices in data science methodologies. Strategic Planning & Execution: Collaborate with senior leadership to define and execute data science strategy aligned with business goals. Identify and prioritize high-impact analytics projects that deliver business value. Ensure timely and quality delivery of analytics solutions balancing scope and resources. Client Engagement & Stakeholder Management: Act as primary point of contact for clients, translating business challenges into data science solutions. Lead client presentations, workshops, and discussions, effectively communicating complex analytical concepts. Build and maintain strong client relationships, managing expectations and deliverables. Deliver regular reports and dashboards to senior management and stakeholders. Bridge communication between technical teams and business units to align analytics initiatives with organizational objectives. Cross-Functional Collaboration: Work closely with Business Intelligence, Market Analytics, and Data Engineering teams to integrate analytics into business processes. Translate complex insights into actionable recommendations for non-technical stakeholders. Facilitate data-driven workshops and presentations across the organization. Collaborate with support functions to provide timely leadership updates on operational metrics. Governance & Compliance: Ensure compliance with data governance policies and data privacy regulations (e.g., GDPR, PDPA). Implement best practices for data quality, security, and ethical analytics use. Stay abreast of industry trends and regulatory changes affecting data analytics. Qualifications: Education: Bachelor s or Master s degree in Data Science, Computer Science, Statistics, Mathematics, or related field. Experience: 12+ years in advanced analytics, data science, data modelling, machine learning, or related fields. 5+ years in leadership roles managing analytics teams and projects. Experience in BFSI, Hi-Tech, Retail, or Healthcare industries preferred. Experience with media data is a plus. Technical Skills: Proficiency in Python, R, SQL. Experience with data visualization tools like Tableau, Power BI. Familiarity with big data platforms (Hadoop, Spark) and cloud services (AWS, GCP, Azure). Strong knowledge of machine learning frameworks and libraries. Soft Skills: Excellent analytical and problem-solving skills. Strong communication and interpersonal abilities. Ability to influence and drive organizational change. Strategic thinker focused on business outcomes. Desirable Expertise: Advanced Analytics Techniques: Descriptive Analytics: Statistical analysis, data visualization. Predictive Analytics: Regression, time series forecasting, classification, market mix modelling. Prescriptive Analytics: Optimization, simulation modelling. Text Analytics: NLP, sentiment analysis. Machine Learning Techniques: Supervised Learning: Linear/logistic regression, decision trees, random forests, gradient boosting, SVMs. Unsupervised Learning: Clustering, PCA, anomaly detection. Reinforcement Learning: Q-learning, deep Q-networks. Generative AI & Large Language Models (Good to Have): Experience with GPT, Gemini, LLAMA, etc. for text generation, summarization, conversational agents. Hyperparameter tuning, prompt engineering, embeddings, fine-tuning. Additional Skills: Proficiency with Tableau or Power BI (advanced visualization). Strong data management, structuring, and harmonization skills.
Solution Architect Industrial Agents
Cognite
Solution Architect Industrial Agents Location: Bengaluru Team: Global Strategic Services Architecture Type: Full-Time | Hybrid About Cognite Cognite is a global SaaS leader transforming industrial operations with cutting-edge AI and data solutions. Our key offerings including Cognite Atlas AI and Cognite Data Fusion (CDF) help industrial companies solve complex problems, improve efficiency, and make data-driven decisions at scale. Recognized as the 2024 Microsoft Energy & Resources Partner of the Year and the 2022 Technology Innovation Leader for Global Digital Industrial Platforms, we are reshaping the future of digital transformation across sectors like Oil & Gas, Manufacturing, Chemicals, Pharma, and Energy. Our Values Impact: We focus on delivering meaningful, real-world outcomes. Ownership: We take responsibility, embrace inclusivity, and step outside our comfort zones. Relentless: We pursue innovation with energy and integrity never ruthless, always responsible. Role Overview As a Solution Architect Industrial Agents, you will lead the design and development of advanced AI agent frameworks built on Cognite Data Fusion, Atlas AI, and cutting-edge generative AI technologies. You ll enable clients to deploy intelligent agents that autonomously manage and optimize complex industrial systems. This role requires deep expertise in AI, multi-agent systems, and data integration, with a focus on building repeatable, scalable, and production-ready solutions. You ll work closely with engineering, product, customer success, and global delivery teams to ensure successful implementations that create real customer value. What You'll Do Architect robust, scalable solutions that integrate Cognite platforms with AI agent frameworks. Design multi-agent systems using technologies like LangChain to solve real-world industrial problems. Lead the development of Retrieval-Augmented Generation (RAG) systems and intelligent prompt engineering. Implement and optimize vector databases (e.g., Pinecone, Weaviate, Faiss) for advanced semantic search and retrieval tasks. Guide data modeling and integration using Python, SQL, REST APIs, and Cognite s SDKs. Provide technical leadership and mentor engineers across AI and platform teams. Work with cross-functional teams to define requirements, develop prototypes, and launch production-ready solutions. Stay ahead of AI trends especially in generative AI, agent orchestration, reinforcement learning, and time series analysis. Collaborate with product and delivery teams to shape Cognite s evolving AI product suite. What You Bring 10+ years in software engineering, including 5+ years in AI and 2+ years in generative AI or intelligent systems. Proven hands-on experience with multi-agent systems (especially using LangChain or similar frameworks). Strong grasp of RAG architecture, knowledge graphs, and graph databases (e.g., Neo4j, RDF). Proficiency in Python (must-have), and optionally JavaScript or Java. Experience in vector database architecture, embedding creation, and high-performance similarity search. Deep understanding of LLMs, fine-tuning techniques, and prompt engineering. Familiarity with cloud environments (AWS, Azure), CI/CD, Docker, and Kubernetes. Strong communication and stakeholder management skills. Experience with time series forecasting models (e.g., LSTMs, ARIMA, Prophet) and real-time anomaly detection. Industrial domain experience is a plus but not required. Why Join Cognite Be part of a global team of 70+ nationalities driving the industrial AI revolution. Enjoy a flat organizational structure with direct access to leadership and decision-makers. Thrive in a modern, collaborative, hybrid work environment based in Bengaluru. Work on some of the world s most ambitious digital transformation projects. Join a culture of ownership, creativity, and continuous innovation. If you re passionate about shaping the future of industrial AI through intelligent agent systems, we want to hear from you. Cognite welcomes applicants from all backgrounds and identities.
Genai Engineer
Rubrik
GenAI Engineer Location: Bangalore, India About the Team Rubrik s IT team is at the forefront of AI-powered transformation, focused on building intelligent automation and scalable data solutions to support the company s mission. As part of the newly established IT AI team, you ll help develop AI agents and workflows to unlock the full potential of generative AI in data engineering. About the Role We re seeking a GenAI Engineer to join our Data Engineering team, specializing in the development and deployment of AI agents and workflows powered by large language models (LLMs). You ll be responsible for integrating data sources, developing MCP clients/servers, and collaborating with cross-functional teams to drive GenAI adoption across the organization. Key Responsibilities Design and implement data integrations using MCP protocols or traditional data extraction methods Build scalable data pipelines for GenAI model training and deployment using tools like Snowflake Cortex, Gemini Agentspace, and Databricks LLM (Mosaic AI, RAG, Model Serving) Ensure high standards of data quality, integrity, and scalability for both structured and unstructured AI workloads Collaborate with business, data engineering, and application development teams to create AI-enabled products Integrate pipelines into existing infrastructure for seamless data flow and analytics Support high-quality data products tailored for AI and machine learning workflows Required Experience 1+ years building AI Agents or working with platforms like Snowflake Cortex, Gemini Agentspace, or similar open-source frameworks 3+ years in data engineering with a focus on AI/ML workloads 5+ years in data analytics using Snowflake or Databricks Strong programming skills in Python, Java, or Scala Familiarity with data storage, APIs, and cloud configurations Experience with data governance and scalable system design Solid understanding of LLMs, transformers, and frameworks such as LangChain Strong analytical and problem-solving skills in a dynamic environment Preferred Qualifications Proven experience building AI Agents and agentic workflows Understanding of MCP and Agent2Agent protocols Knowledge of generative model architectures and applications in data engineering Familiarity with data security and governance best practices in GenAI Experience with Agile methodologies and tools such as Jira and GitHub Join Us in Securing the World s Data Rubrik (NYSE: RBRK) is committed to delivering Zero Trust Data Security. Our ML-powered platform protects data across enterprise, cloud, and SaaS applications, helping organizations stay resilient against cyberattacks and disruptions.
Senior AI Program Manager
Rubrik
Senior AI Program Manager Location: Bangalore, India (Rubrik Office) Team: IT AI (Artificial Intelligence) About the Team Rubrik s IT AI team is leading AI-driven transformation across the organization, leveraging data, automation, and cutting-edge tech to support the company's mission of securing the world s data. This team partners across departments to deliver impactful, scalable AI solutions. Role Overview As a Senior AI Program Manager, you will lead the strategic planning, execution, and governance of AI initiatives across Rubrik s global IT operations. You ll collaborate with cross-functional business units and technical teams to deliver innovative, high-value AI solutions that align with Rubrik's business goals. Key Responsibilities Develop & manage a comprehensive AI program roadmap aligned with business goals. Collaborate with stakeholders across functions (Sales, HR, Finance, Legal, Support, etc.) to gather and prioritize AI solution requirements. Quantify ROI of AI initiatives and drive value-based prioritization. Oversee end-to-end AI project lifecycle: ideation, feasibility, development, deployment, adoption, and success measurement. Partner with technical teams: full-stack developers, data engineers, prompt engineers, cloud architects. Ensure compliance with data privacy, cybersecurity, and ethical AI standards. Champion AI adoption, innovation, and best practices across the organization. Maintain clear communication, manage risks, and provide consistent updates to leadership and stakeholders. What You ll Bring 5 8 years of experience in program management (IT/Tech Consulting/Engineering), with 2+ years leading AI-focused initiatives. Familiarity with Large Language Models (LLMs) and related technologies. Experience working alongside technical teams (developers, data engineers, cloud experts). Strong grasp of AI solution delivery using full-stack and cloud-based technologies. Skilled in business case development, ROI analysis, and roadmap execution. Excellent stakeholder communication, cross-functional leadership, and Agile project management skills (Jira, Confluence). Exceptional organizational and problem-solving abilities; comfortable in fast-paced, ambiguous environments. Preferred Qualifications Bachelor s degree in CS, Engineering, IT, or related field. Experience managing AI programs in large tech companies or consulting firms. Awareness of AI regulations and frameworks (e.g., GDPR, NIST, EU AI Act, CCPA). Rubrik (NYSE: RBRK) secures data across cloud, SaaS, and enterprise environments using its Zero Trust Data Security platform. Powered by machine learning, Rubrik helps organizations ensure data integrity, availability, and resilience against modern cyber threats and disruptions. Qualification : BS in Computer Science, Engineering, Information Technology, or a related technical field.
Consultant - UX Designer
Glance
Job Title: Consultant - UX Designer (Intern) Location: Bangalore, India Company: Glance An InMobi Group Company About Glance Join Glance, where creativity meets cutting-edge AI technology! Glance is revolutionizing mobile user experiences by combining AI and user-centered UX design directly on the lock screen. As a UX Intern, you ll work with a dynamic team creating context-driven, interactive content that delights millions of users worldwide. Position Overview As a UX Intern at Glance, you ll gain hands-on experience in UX design fundamentals while exploring emerging AI technologies and prompt engineering. This internship is ideal for individuals passionate about designing seamless, intuitive user experiences and eager to build skills in AI-driven UX and prompt engineering. Key Responsibilities Assist in UX Design: Create wireframes, layouts, and visual design elements that enhance usability and engagement on Glance s lock screen platform. Collaborate on AI-Driven UX Projects: Partner with designers, engineers, and data scientists to integrate AI for personalized and intuitive user interactions. Learn Prompt Engineering: Develop expertise in crafting prompts that help AI understand and respond effectively to user behavior, improving the overall UX. What You ll Gain Hands-On UX Design Experience: Build a strong foundation in UX principles and user-centered design workflows. Exposure to Advanced AI Technologies: Learn how AI can be harnessed to create innovative, personalized user experiences. Prompt Engineering Skills: Acquire in-demand skills in prompt engineering within AI-powered platforms. Mentorship and Professional Growth: Work alongside experienced professionals in a collaborative, supportive environment focused on your development. Qualifications Currently pursuing a degree in Design, Human-Computer Interaction (HCI), Psychology, Computer Science, or a related field. Strong interest in UX design fundamentals, AI technology, and emerging digital trends. Familiarity with UX tools such as Figma, Sketch, or similar is a plus. Creative mindset with curiosity and willingness to experiment with new ideas. Contract Duration: 6 Months Kickstart your UX career with Glance and be part of a future-forward team shaping next-gen mobile experiences. Apply now to join us in Bangalore! Qualification : Currently pursuing a degree in Design, Human-Computer Interaction (HCI), Psychology, Computer Science, or a related field.
Product Specialist Intern
Cloudsek
Job Title: Product Specialist Intern Cybersecurity Location: Bengaluru, Karnataka, India Internship Duration: 3 Months | Full-Time About CloudSEK CloudSEK is a cutting-edge AI-powered cybersecurity company that s revolutionizing the way digital threats are detected and mitigated in real-time. Founded in 2015 and headquartered in Singapore, we are committed to developing the fastest, most reliable AI and ML technology to identify, analyze, and resolve cyber threats. Our product suite includes: XVigil: Digital Risk Protection and Threat Intelligence Platform BeVigil: Attack Surface Monitoring and Threat Detection Tool SVigil: Contextual AI for Software Supply Chain Risk Management With rapid global expansion, including operations in India, Southeast Asia, and the Americas, CloudSEK has received accolades such as: NASSCOM-DSCI Excellence Award for Security Product Company of the Year NetApp Excellerator s Best Growth Strategy Award Series A funding of $7M to fuel growth and innovation Join us as we continue to redefine digital risk management! About the Role: Product Specialist Intern We are looking for enthusiastic Product Specialist Interns who are eager to learn and contribute to the cybersecurity space. If you have a passion for technology, client communication, and problem-solving, this is an excellent opportunity for you! As a Product Specialist Intern at CloudSEK, you will play a crucial role in supporting clients and helping them make the most of our cybersecurity products. Key Responsibilities Client Support & Communication: Act as the first point of contact for clients with product-related queries and issues. Provide assistance via email, phone, and online presentations. Troubleshooting & Issue Resolution: Identify, document, and troubleshoot customer issues, providing timely solutions or escalating to the relevant teams. Ownership & Accountability: Take ownership of client issues, ensuring they are resolved efficiently and follow through with the internal teams for prompt resolution. Process & Compliance Tracking: Ensure all processes are followed, and compliance standards are maintained. Product Knowledge & Updates: Stay updated on the latest cybersecurity trends, technologies, and product developments to better assist clients. Skills & Qualifications B.Tech Final Year Engineering students with a focus on Computer Science, Information Technology, or similar fields. Excellent verbal and written communication skills in English. Strong problem-solving capabilities and a keen interest in learning new technologies. Self-driven, with the ability to work independently in a fast-paced startup environment. Basic knowledge of CRM software and MS Office is a plus. Interest in Cybersecurity is a plus, but not mandatory. At CloudSEK, we believe in providing an environment where you can learn, grow, and develop your skills. As an intern, you will: Flexible working hours to promote work-life balance Access to free food, unlimited snacks, and beverages in the office Engage in team bonding activities, games, and music sessions we love to unwind together! A chance to work in an innovative, fast-paced startup culture that encourages creativity and learning If you're passionate about technology, client success, and want to contribute to the world of digital risk protection, CloudSEK is the place for you. Apply now for the Product Specialist Internship and gain hands-on experience with cutting-edge cybersecurity technologies! Qualification : B.Tech Final Year Engineering students with a focus on Computer Science, Information Technology, or similar fields.
Ai Ml Engineer
Wipro Limited
AI/ML Engineer - Bengaluru, India Experience: 4 to 6 years About Wipro Wipro Limited (NYSE: WIT, BSE: 507685, NSE: WIPRO) is a global leader in technology services and consulting. Operating in 65 countries with over 230,000 employees, Wipro helps organizations achieve digital transformation through innovative, future-ready solutions. Role Overview We are hiring an experienced AI/ML Engineer to work on advanced machine learning and Generative AI solutions. You will design prompt engineering pipelines, develop ML models using frameworks like TensorFlow and PyTorch, and deliver scalable backend services using Python and Django. Key Responsibilities Design and implement GenAI-based use cases using prompt engineering techniques Build REST APIs and scalable microservices using Python and Django Evaluate GenAI model performance and implement guardrails Deploy applications to Azure or AWS cloud environments Lead prompt lifecycle management including tuning, templating, and optimization Ensure integration with databases and external APIs Use CI/CD tools for continuous deployment (Azure DevOps, Jenkins, Ansible, Terraform) Mandatory Skills Python, Django, REGEX AI/ML, Deep Learning, NLP TensorFlow, PyTorch Generative AI, LLMs, RAG Pipelines REST API development Preferred Skills API Gateways: WSO2, KONG, nginx Apache HTTP Server Azure DevOps, Ansible, Jenkins, Terraform Databricks Additional Requirements Understanding of OOP, design patterns, and scalable architecture Familiarity with Docker and version control tools like GitHub or Bitbucket Team leadership in prompt engineering and stakeholder engagement Knowledge of secure prompt design to mitigate injection or leakage risks Join a forward-thinking digital transformation company where your ideas shape the future. We value diversity, innovation, and continuous learning. Applications from individuals with disabilities are highly encouraged.
Gen Ai Engineer - L1
Wipro Limited
Gen AI Engineer - L1 | Bengaluru, India About Wipro Wipro Limited (NYSE: WIT, BSE: 507685, NSE: WIPRO) is a global IT consulting and services company with over 230,000 employees across 65 countries. We offer innovative solutions in consulting, engineering, and operations to solve complex digital transformation challenges. Role Overview We are looking for an experienced Gen AI Engineer - L1 with deep expertise in Generative AI, LLMs, RAG pipelines, and Python-based machine learning frameworks. The role will focus on developing secure, scalable AI systems using modern tools and cloud platforms. Key Responsibilities Design, implement, and optimize generative AI models using LangChain, LLaMA, Hugging Face, etc. Develop RAG pipelines and integrate with LLMs for advanced AI solutions. Create, test, and optimize prompt templates across different base models. Implement guardrails for prompt security to prevent prompt injection, jailbreaks, and leaks. Build efficient backend applications using Python, Django, and related tools. Work with vector databases to enhance generative AI workflows. Collaborate on data grooming and model training across business units. Benchmark model performance and develop auto-prompting systems. Ensure adherence to minimum design standards in prompt engineering use cases. Mandatory Skills Gen AI, LLMs, RAG Pipelines LangChain, LLaMA, Hugging Face Python, TensorFlow, PyTorch, Django NLP, Machine Learning, Deep Learning Vector Database integration Preferred Skills Azure or AWS Cloud Platforms MLOps, Kubernetes GitHub, Bitbucket Experience with GPT-4 Domain exposure in Banking or Financial Services Join Wipro to be part of a company that thrives on innovation, reinvention, and digital excellence. Work on impactful GenAI projects, grow your career, and contribute to shaping the future of AI in real-world applications. We welcome applications from individuals with disabilities.
Datascientist With Gen Ai
Wipro Limited
Data Scientist with GenAI | Bengaluru, India About Wipro Wipro Limited (NYSE: WIT, BSE: 507685, NSE: WIPRO) is a global IT services and consulting company with over 230,000 employees in 65+ countries. We provide transformative digital solutions across consulting, engineering, and cloud. Job Overview We are seeking a skilled Data Scientist with Generative AI expertise to lead the development of AI/ML solutions using advanced technologies like LLMs, LangChain, and AWS. You will architect and implement end-to-end GenAI pipelines for enterprise-grade applications. Key Responsibilities Design and implement AI/ML solutions using Python and AWS. Develop GenAI systems using LLMs, LangChain, and RAG pipelines. Fine-tune AI models to optimize performance and accuracy. Create custom solutions when off-the-shelf tools don t meet requirements. Build SQL-based data pipelines extracting from Snowflake for model training. Integrate NLP capabilities into AI architecture to process textual data. Ensure responsible AI practices including security, fairness, and transparency. Lead and mentor team members, upholding coding and design standards. Communicate progress, risks, and strategies to stakeholders clearly. Mandatory Skills Python GenAI, LLMs AWS Cloud Services Preferred Skills NLP, AI/ML LangChain, RAG SQL, Snowflake Model fine-tuning and custom architecture development Join Wipro and be part of a global digital transformation journey. We empower talent to lead innovation and embrace reinvention. Applications from people with disabilities are explicitly encouraged.
Gen Ai Aws Engineer
Wipro Limited
Gen AI AWS Engineer | Bengaluru, India Experience: 8 10 Years Mandatory Skills: AWS, Generative AI About Wipro Wipro Limited (NYSE: WIT, BSE: 507685, NSE: WIPRO) is a global leader in consulting and technology services with over 230,000 employees across 65 countries. We provide digital transformation solutions that are innovative, sustainable, and scalable. Role Overview We are hiring a highly skilled Gen AI AWS Engineer to lead the architecture, development, and delivery of AI-driven cloud solutions. This role focuses on leveraging AWS infrastructure and Generative AI technologies to solve complex business problems at scale. Key Responsibilities Architecture and Solution Design Design and implement scalable AI/ML solutions using AWS and GenAI frameworks. Lead solution design for enterprise initiatives and RFPs. Develop architectural documentation and reusable design patterns. Evaluate current architectures and recommend improvements and modernization plans. Delivery Enablement Provide frameworks and technical direction to development teams. Monitor project risks and propose mitigation strategies. Ensure alignment with architecture principles and best practices. Client Engagement Participate in pre-sales activities and client presentations. Build trusted relationships with clients by demonstrating technical thought leadership. Coordinate with stakeholders to ensure successful delivery of AI solutions. Innovation & Competency Building Create PoCs, whitepapers, and solution demos in GenAI and AWS domains. Represent Wipro s capabilities at industry events and internal forums. Mentor junior architects and contribute to internal upskilling initiatives. Team Management Recruit, train, and retain high-performing AI/ML engineers. Conduct performance reviews and set goals for team members. Drive employee engagement and diversity across the architecture team. Be part of a company that thrives on innovation, reinvention, and purpose. Work with cutting-edge Generative AI solutions and lead impactful transformations for global clients. Applications from individuals with disabilities are explicitly welcome.
Engineer, Principal/manager - Machine Learning, Ai
Qualcomm India Private Limited
Engineer, Principal/Manager - Machine Learning, AI Location: Bangalore, Karnataka, India Company: Qualcomm India Private Limited General Summary Qualcomm is seeking an experienced and visionary Principal AI/ML Engineer to lead research, development, and optimization of AI inference systems. This role involves developing high-performance AI models, optimizing deployments across various hardware platforms, and contributing to research in model compression, quantization, and hardware-aware optimization. Education & Experience PhD with 6+ years, Master's with 7+ years, or Bachelor's with 8+ years in Engineering, CS, or related field. 20+ years of experience in AI/ML development; 5+ years in inference optimization and debugging. Key Responsibilities Model Optimization & Quantization Optimize models using quantization (INT8, INT4, mixed precision), pruning, and knowledge distillation. Implement PTQ and QAT techniques for deployment. Experience with TensorRT, ONNX Runtime, OpenVINO, TVM. AI Hardware Acceleration & Deployment Target platforms: Hexagon DSP, CUDA GPUs, TPUs, NPUs, FPGAs, Habana Gaudi, Apple Neural Engine. Use Python APIs: cuDNN, XLA, MLIR for hardware acceleration. Benchmark and debug performance across platforms. AI Research & Innovation Research on efficient AI inference: model compression, low-bit precision, sparse computing. Explore architectures like Sparse Transformers, Mixture of Experts, Flash Attention. Publish in ML conferences: NeurIPS, ICML, CVPR; contribute to open-source projects. Technical Expertise Optimization of LLMs, LMMs, LVMs for inference. Deep Learning frameworks: TensorFlow, PyTorch, JAX, ONNX. Expert in CUDA, cuPy, Numba, TensorRT, ONNX Runtime, OpenVINO. Skilled in Python for scalable AI development. Experience with ML runtime delegates: TFLite, ONNX, Qualcomm AI Stack. Debugging: Netron, TensorBoard, PyTorch Profiler, Nsight, perf, Py-Spy. Cloud inference: AWS Inferentia, Azure ML, GCP AI Platform, Habana Gaudi. Hardware-aware optimization: oneDNN, ROCm, MLIR, SparseML. Contributions to open-source and research publications are a strong plus. Leadership & Collaboration Lead a team of engineers in Python-based AI inference and optimization. Collaborate with researchers, software engineers, DevOps, and hardware vendors. Define debugging, deployment, and performance tuning best practices.
1 - 20 of 0 jobs
* No exact matches found. Showing closest results insteadNo results found
Modify search criteria or create an alert to get relevant jobs as soon as they’re posted