Model Inference Jobs in Bengaluru
405 Jobs Found
Assistant Risk Modelling Manager
Osb India
Assistant Risk Modelling Manager Location: Bengaluru Department: Risk & Modelling About OneSavings Bank (OSB) Group OneSavings Bank (OSB) Group is a specialist lending and retail savings group listed on the London Stock Exchange and a member of the FTSE 250. Headquartered in Chatham, Kent, OSB is regulated by the Prudential Regulation Authority and the Financial Conduct Authority. OSB focuses on niche lending markets offering high growth and strong risk-adjusted returns, including: Buy-to-Let and commercial mortgages Residential development finance Specialist residential lending and secured funding lines We operate under trusted brands such as Kent Reliance, CCFS, InterBay Commercial, Prestige Finance, and Heritable Development Finance. Retail savings are primarily sourced through Kent Reliance via branches, online, and postal channels. Our offshore delivery and support operations are handled by OSB India, with offices in Bengaluru and Hyderabad. About OSB India Pvt Ltd OSB India, a wholly owned subsidiary of OSB Group, plays a critical role in delivering operational and customer support services. Since 2004, OSB India has focused on service excellence, process efficiency, and continuous improvement for the group s UK operations. Role Overview As the Assistant Risk Modelling Manager, you will support capital and impairment reporting, provide deep data insights, and contribute to strategic projects. This role involves analysis, stakeholder collaboration, and ensuring regulatory and internal compliance. Key Responsibilities Lead and support monthly IFRS9 impairment and IRB RWA reporting with trend analysis and insights Provide analytics to support collections and help define operational priorities Drive and deliver strategic projects, managing timelines and stakeholders Assist with IFRS9 engine code changes, conduct impact assessments, and challenge trends Identify process and model weaknesses and develop mitigating solutions Produce clear, insightful commentary for credit and audit committees, including regulatory teams Ensure compliance with model execution and operational risk requirements Maintain adherence to Finance, Risk Management, and Data Governance Policies Build strong working relationships with UK stakeholders and capture clear requirements Complete all mandatory compliance training and attestations Experience Required Minimum 7+ years in a related role in retail or mortgage finance Extensive hands-on experience in SAS, SQL, and advanced Excel Proven ability to generate and present detailed analytical and reporting outputs Experience with impairment/capital modelling processes (preferred) Comfortable managing priorities, leading tasks, and collaborating with international teams Technical & Functional Skills Expert in SAS and SQL for data analysis and reporting Working knowledge of IFRS9 (impairment) or IRB (capital) frameworks Understanding of probability/statistics in a financial risk context (preferred) Core Competencies Strong analytical thinking and problem-solving skills Effective communication skills, both written and verbal Ability to deliver clear, actionable reports to senior stakeholders Self-motivated with a proven ability to learn new technical skills and tools This role is an exciting opportunity to work at the intersection of data, risk, and strategy within a dynamic and growing financial group. If you have a strong analytical mindset and are looking to influence real business decisions, we'd love to hear from you.
Associate ML Ops
Mpokket Financial Services Private Limited
Job Title: Associate ML Ops Location: Bangalore Department: Data Science Employment Type: Full-time Experience: 1 2 years Job Overview We are seeking a motivated and detail-oriented Associate ML Ops to join our Data Science team. In this role, you will be responsible for supporting the deployment, monitoring, and scaling of machine learning models in production environments. You ll collaborate closely with data scientists and engineers to build robust MLOps pipelines and ensure model reliability, scalability, and performance. If you are passionate about bringing machine learning models to life and have hands-on experience in productionizing ML systems, we d love to hear from you. Key Responsibilities Deploy and maintain machine learning models in production environments using best-in-class tools like Databricks and MLflow. Collaborate with data scientists to translate experimental models into scalable, production-ready systems. Monitor model performance, accuracy, and overall health through automated tools and custom strategies. Build and maintain RESTful APIs using Python frameworks such as Flask or Django to serve ML models. Write efficient and optimized SQL and NoSQL queries for data extraction and transformation. Apply software engineering best practices, including version control, testing, and documentation, to MLOps workflows. Work with Python libraries like Pandas, PySpark, scikit-learn, SQLAlchemy, and Requests. Troubleshoot issues related to model deployment, API performance, or data integration pipelines. Minimum Qualifications Bachelor s or Master s degree in Computer Science, Statistics, Econometrics, Operations Research, or a related technical field. 1 2 years of hands-on experience in solving analytical or machine learning problems in production settings. Must-Have Technical Skills Hands-on experience with Databricks and MLflow Proven expertise in deploying ML models in real-world applications Strong understanding of data structures, algorithms, OOP, and software engineering principles Experience building and maintaining REST APIs using Python Proficiency in SQL and NoSQL Excellent Python programming and debugging skills Familiarity with core Python libraries used in ML and data processing: Pandas, scikit-learn, PySpark, SQLAlchemy, etc. Nice-to-Have Skills Exposure to Kafka for streaming and batch data processing Familiarity with Git and CI/CD pipelines Experience with Python multiprocessing or worker/queue systems Understanding of event-driven or asynchronous programming models This is an exciting opportunity to work at the intersection of data science and engineering. You ll play a key role in productionizing cutting-edge models and ensuring they deliver real business impact. Qualification : Bachelors or Masters degree in Computer Science, Statistics, Econometrics, Operations Research, or a related technical field
ML Ops Engineer
Mpokket Financial Services Private Limited
Job Title: ML Ops Engineer Location: Bangalore Department: Data Science Employee Type: Full-time Experience Required: 3 5 years Position Overview We are seeking an experienced and motivated ML Ops Engineer to join our Data Science team. In this role, you will be responsible for deploying, monitoring, and maintaining machine learning models in production environments. You will work closely with data scientists, engineers, and product teams to ensure models are scalable, reliable, and aligned with business objectives. This role is ideal for professionals who are passionate about building robust ML pipelines and bringing machine learning solutions into real-world applications at scale. Key Responsibilities Deploy and manage machine learning models in production environments, ensuring scalability, reliability, and performance. Build and maintain MLOps pipelines using platforms like Databricks and MLflow. Monitor model performance, accuracy, and health; implement alerting and diagnostics as needed. Develop and maintain RESTful APIs using Python frameworks such as Flask or Django to serve ML models. Optimize data workflows and collaborate with engineering teams to improve model integration and performance. Design strategies for automated model retraining, deployment, and version control. Write clean, maintainable, and efficient code using Python, adhering to OOP principles and best practices. Write complex queries using SQL and work with NoSQL databases to support data pipelines and feature stores. Leverage Python libraries such as PySpark, Pandas, scikit-learn, SQLAlchemy, and Requests. Minimum Qualifications Bachelor s or Master s degree in Computer Science, Statistics, Econometrics, Operations Research, or a related technical field. 3 5 years of experience in building, deploying, and monitoring machine learning solutions in production. Must-Have Skills Experience with Databricks and MLflow for model training and deployment. Proven expertise in machine learning model deployment and monitoring in live environments. Strong programming skills in Python, with solid understanding of data structures, algorithms, and OOP concepts. Experience developing RESTful APIs using Flask or Django. Proficient in SQL and NoSQL database operations. Hands-on knowledge of libraries such as Pandas, PySpark, scikit-learn, SQLAlchemy, and Requests. Strong analytical, problem-solving, and debugging skills. Good-to-Have Skills Experience with Kafka streaming and batch processing. Familiarity with CI/CD pipelines and version control systems like Git. Understanding of Python multiprocessing, worker/queue systems, and asynchronous/event-driven programming. This is a unique opportunity to work at the intersection of machine learning and DevOps. You'll play a critical role in operationalizing AI models and making them a core part of our product offerings. If you enjoy building scalable systems and solving real-world ML engineering challenges, we d love to meet you. Qualification : Bachelors or Masters degree in Computer Science, Statistics, Econometrics, Operations Research, or a related technical field
Data Scientist
Subex Limited
Position: Data Scientist (AI/ML Expert) Location: Pritech Park SEZ, Block 09, 4th Floor B Wing, Survey No. 51 to 64/4, Outer Ring Road, Bellandur V, Bangalore, Karnataka, India Department: Advanced Analytics Employment Type: Subexian Experience Required: 1 to 3 years Job Overview: We are looking for a talented Data Scientist with expertise in AI/ML to join our Advanced Analytics team. As a key contributor, you ll design, develop, and validate predictive models, recommendation systems, and forecasting solutions, while also collaborating with cross-functional teams to deliver cutting-edge solutions using the latest technologies. Key Responsibilities: Model Development: Design, develop, and validate predictive models, recommendation systems, and forecasting solutions using a mix of statistical, machine learning, and deep learning techniques. You will work both independently and as part of a collaborative team. Data Visualization & Reporting: Communicate actionable insights effectively through compelling dashboards, reports, and visualizations using tools such as Superset, Power BI, and Python libraries (Matplotlib, Seaborn, Plotly). AI & Tech Solutions: Collaborate with teams to design and deliver flexible, scalable solutions using advanced technologies such as AI and large language models (LLMs). API Development: Develop and integrate REST APIs and frameworks such as Flask or FastAPI for seamless deployment of machine learning models. Documentation: Maintain clear, comprehensive documentation for data workflows, model development, and analytical methodologies to ensure knowledge sharing and transparency across teams. Continuous Learning: Stay up-to-date with the latest trends and advancements in data science, algorithms, and technologies, ensuring your skills and knowledge remain cutting-edge. Required Technical Skills: Python Proficiency: Strong experience with Python and libraries like Scikit-learn, TensorFlow/PyTorch, and data visualization libraries (Matplotlib, Seaborn, Plotly). SQL: Solid hands-on experience in SQL for efficient data querying. ML Ops & Pipelines: Understanding of machine learning operations (ML Ops) and ML pipelines for streamlined model deployment. Cloud & Distributed Computing: Exposure to cloud platforms such as AWS, Azure, or GCP and distributed computing tools like Hadoop, Spark, or Pyspark is a plus. Soft Skills: Effective Communication: Strong ability to communicate complex analytical findings in a clear and engaging manner, tailoring insights for both technical and non-technical audiences. Problem-Solving: A proactive problem-solver with the ability to adapt and thrive in a fast-paced, dynamic environment. Continuous Growth: Self-motivated, curious, and always seeking opportunities for professional growth and learning. At Subex, we encourage a collaborative, innovative, and growth-driven work environment. If you're passionate about applying data science techniques to real-world challenges and want to work with cutting-edge AI/ML technologies, we d love to hear from you!
Senior Data Scientist - LLM
5c Network Pvt. Ltd.
Position: Senior Data Scientist LLM Location: Bangalore, Karnataka, India Type: Full-Time (On-site) Experience Required: 2+ years in Deep Learning, 1+ years in LLMs Industry: Healthcare AI Company Overview: 5C Network is pioneering multi-modal AI systems for autonomous diagnosis in medical imaging. We're building next-generation models that integrate deep learning with language understanding to revolutionize clinical workflows and diagnostic accuracy. Role Summary: As a Senior Data Scientist LLM, you will lead the development and deployment of Large Language Models (LLMs) focused on enhancing medical imaging diagnostics. You ll work on cutting-edge problems such as instruction fine-tuning, prompt engineering, and Retrieval-Augmented Generation (RAG), while ensuring scalability and robustness in production environments. Key Responsibilities: LLM Development & Fine-Tuning Design and optimize prompts for diverse clinical and imaging-related use cases. Perform instruction fine-tuning of LLMs to meet task-specific requirements. Develop reasoning pipelines including Chain of Thought (CoT) techniques for complex diagnostic workflows. LLM Deployment & Optimization Self-host and deploy LLMs in secure, scalable production environments. Apply quantization and other performance optimization methods to minimize compute and memory footprint. Ensure high performance, uptime, and security in AI deployments. Retrieval-Augmented Generation (RAG) & Vector Databases Develop and implement RAG pipelines by integrating LLMs with semantic search. Work with vector databases (e.g., Qdrant) to enable fast, efficient retrieval of contextual data. Optimize data storage, indexing, and retrieval to support clinical applications. Data Engineering & Annotation Build and manage high-quality datasets tailored for CoT and multi-step reasoning tasks. Lead data annotation efforts to enhance LLM understanding of medical contexts. Collaboration & Research Collaborate with researchers, ML engineers, and domain experts to bring LLM solutions from prototype to product. Stay ahead of the curve by experimenting with novel LLM architectures and emerging techniques. Qualifications: Bachelor's or Master s degree in Computer Science, Data Science, AI, or a related field. 2+ years of hands-on experience in deep learning. Minimum 1 year of experience working with LLMs (e.g., instruction tuning, prompt engineering, RAG). Prior experience in LLM deployment (self-hosting, optimization, quantization, scaling). Proficient in Python and common ML frameworks (e.g., PyTorch, Hugging Face Transformers). Familiarity with vector databases like Qdrant or similar. Strong interest or prior exposure to healthcare/medical AI. Excellent problem-solving, communication, and team collaboration skills. Technical Stack: Languages: Python Frameworks: PyTorch, Hugging Face Technologies: LLMs (e.g., GPT, LLaMA), Vector Databases (Qdrant), RAG, Quantization Tools: Docker, Kubernetes, REST APIs, Git Work on high-impact AI solutions in the healthcare domain. Collaborate with a team at the forefront of multi-modal diagnostic technology. Access cutting-edge tools and real-world data to drive innovation. Qualification : Bachelor's or Masters degree in Computer Science, Data Science, AI, or a related field.
Senior Data Scientist - AI
5c Network Pvt. Ltd.
Position: Senior Data Scientist - AI Employment Type: Full-Time Location: Bangalore, Karnataka, India Experience Required: 2+ years Job Summary: 5C Network is developing multi-modal AI solutions for autonomous diagnosis of medical imaging. This role focuses on building and deploying advanced AI models using Computer Vision and Vision Language Models to analyze CT and MRI scans, helping detect, segment, and classify abnormalities. Key Responsibilities: Develop, train, and implement computer vision models, VLMs, and 3D models on CT and MRI imaging data. Collaborate closely with leading radiologists and AI researchers from India and the US. Work with product and engineering teams to deploy and operationalize AI models effectively. Contribute to cutting-edge research and assist in publishing findings in academic journals and conferences. Qualifications: Bachelor s or Master s degree in Computer Science, Data Science, or related fields. Proven experience in training and deploying deep learning-based computer vision models. Strong interest or experience in healthcare AI applications. Excellent analytical, problem-solving, and communication skills. Ability to collaborate effectively in a research-driven, team-oriented environment. Technical Requirements: 2+ years of hands-on experience with deep learning. Proficient in Python and deep learning frameworks, especially PyTorch. Experience with well-known computer vision architectures (ResNet, DenseNet, U-Net, Vision Transformers, V-Net, etc.). Expertise in building deep learning architectures from scratch. Experience working with Large Language Models (LLMs) is a plus. Knowledge of model pruning, quantization, and scaling model training and inference on GPUs. Qualification : Bachelors or Masters degree in Computer Science, Data Science, or related fields.
AI/ML Architect
Thoughtfocus
Job Title: AI/ML Architect Location: Bangalore, India Experience: 15 17 Years Employment Type: Full-Time Overview: We are seeking an experienced AI/ML Architect with 15 17 years of overall experience, including 2 3 years in project management. This strategic role requires deep expertise in AI/ML technologies, enterprise AI architecture, and hands-on delivery of cutting-edge solutions such as autonomous AI agents and generative AI platforms. The architect will lead the design, proposal, and execution of AI-driven solutions across diverse client engagements. Key Responsibilities: Solution Architecture & Design: Design and evolve robust AI/ML architectures incorporating autonomous agents, scalable business intelligence, and generative AI. Collaborate with cross-functional teams (data scientists, ML engineers, UX designers, and business strategists) to convert business needs into technical designs. Evaluate emerging technologies, frameworks, and trends to guide architectural roadmaps and innovation strategies. RFP Response & Proposal Development: Lead the technical response for RFPs, creating solution frameworks that align with client objectives and industry standards. Draft detailed technical proposals, use cases, and presentations to demonstrate the business value and ROI of AI solutions. Partner with sales and business teams to align proposals with market needs and positioning. Project Delivery & Technical Leadership: Oversee full AI/ML solution lifecycles from architecture design to deployment and monitoring ensuring compliance, quality, and performance. Provide hands-on guidance and mentorship to technical teams, promoting best practices in data science, ML model development, and deployment pipelines. Manage project schedules, resource planning, and risk mitigation for successful delivery outcomes. Stakeholder & Cross-Functional Collaboration: Act as the primary technical liaison across teams, ensuring stakeholder alignment on vision, progress, and technical challenges. Conduct technical workshops, reviews, and solution demos to showcase innovation and architectural decisions. Innovation & Continuous Improvement: Monitor industry trends, publications, and competitor offerings to integrate innovative features into AI/ML initiatives. Continuously refine and optimize existing platforms, ensuring scalability, performance, and user-centricity. Required Skills & Qualifications: 15 17 years of overall experience with 2 3 years in a leadership/project management role. Strong track record in AI/ML architecture, especially with autonomous agent systems, enterprise AI, and generative AI technologies. Proficient in modern AI/ML frameworks (TensorFlow, PyTorch, etc.), data pipelines, and cloud platforms (AWS, Azure, GCP). Experience designing and implementing end-to-end AI solutions in complex, data-driven environments. Demonstrated success in crafting technical proposals and responding to RFPs. In-depth knowledge of NLP, deep learning, traditional ML, and AI product integration. Effective project and stakeholder management skills; able to balance priorities across multiple projects. Excellent communication and presentation skills, with the ability to articulate AI value to technical and non-technical stakeholders.
Lead Data Scientist
Playsimple
Job Title: Lead Data Scientist Location: Bangalore North, Karnataka, India Job Type: Full-Time Industry: Entertainment / Mobile Gaming About Us We are one of India s most exciting and fast-growing mobile gaming companies. Founded in 2014 and partnered with Modern Times Group (MTG), our vision is to create simple, impactful casual game experiences at massive scale. Our portfolio includes evergreen hits such as Daily Themed Crossword, WordTrip, WordJam, WordWars, WordTrek, TileMatch, and Jigsaw. We have built a global network of chart-topping games supported by powerful tech and analytics infrastructure that fuels rapid growth. Position Summary As a Lead Data Scientist in our Central Analytics team, you will play a critical role in shaping data-driven strategies that enhance player experience and business performance. This fast-paced role offers abundant opportunities to work alongside product leaders and game teams, transforming complex data into actionable insights that drive user acquisition, engagement, and monetization. Key Responsibilities Collaborate closely with product leaders to provide data-driven advisory on strategic decisions. Partner with game development teams to analyze gameplay data and generate actionable insights that improve user acquisition, engagement, and monetization. Perform advanced exploratory data analyses and ad-hoc reporting to identify trends, issues, and opportunities across our game portfolio. Design, execute, and lead data research projects, delivering practical recommendations based on rigorous statistical analyses. Drive continuous improvement in game performance through innovative machine learning models and analytics techniques. Requirements Bachelor s/Master s/PhD degree in Computer Science, Statistics, or a related field. Proven experience with machine learning, statistical modeling, and data science projects. Hands-on proficiency in Python and/or Spark for data manipulation, visualization, and building ML models. Strong SQL skills with experience querying large, complex datasets from data lakes or warehouses. Demonstrated ability to lead research projects and translate findings into actionable business recommendations. Excellent interpersonal skills and a collaborative approach to working with cross-functional teams. Knowledge of Deep Learning frameworks and techniques is highly desirable. Work with a top-tier gaming company known for its innovative and data-driven culture. Influence millions of users worldwide through impactful analytics. Collaborate with talented teams in a high-growth, dynamic environment. Access to cutting-edge tools and technologies for data science and machine learning. Competitive compensation and career growth opportunities. Qualification : Bachelors/Masters/PhD degree in Computer Science, Statistics, or a related field.
Software Developer-c++
Siemens
Software Developer C++ Location: Bangalore, Karnataka, India Employment Type: Full-time, Permanent Experience Level: Experienced Professional (6-8 years) Role Overview We are seeking a proactive and skilled Full Stack Developer with deep expertise in C++ to contribute to the development of MR image reconstruction modules integrated with AI. The ideal candidate will actively research and innovate MR reconstruction techniques, improve module performance, and collaborate closely with cross-functional teams to deliver high-quality medical imaging solutions. Key Responsibilities Develop, improve, test, and maintain MR image reconstruction modules. Conduct research to enhance acquisition speed, data extraction, noise/artifact robustness, and overall reconstruction quality. Develop AI inferencing code, prepare data, and support model training activities. Manage code repositories and version control systems such as Git or Azure Repos. Participate actively in design discussions, code reviews, and agile development processes. Troubleshoot and optimize module performance, security, and scalability. Collaborate with product owners and stakeholders to manage backlogs and ensure continuous feature delivery. Required Skills & Qualifications Education: BE/B.Tech/MCA/ME/M.Tech from a recognized institution. Core Expertise: Strong practical experience in C++ development, object-oriented programming, and design patterns. Additional Skills: Python programming experience (advantageous). Knowledge of medical imaging modalities, particularly MRI (preferred). Strong foundation in physics, mathematics, signal processing, linear algebra, probability, and random processes. Understanding of inverse problems, AI, imaging chains, MR reconstruction, and pulse sequences is a plus. Soft Skills: Strong analytical and problem-solving skills, clear communication, and a passion for learning and creative thinking. Tools: Experience with Azure Repos or Git for version control. Experience 6 to 8 years of core development experience with C++. Collaborative work environment fostering professional growth. Challenging projects enhancing technical expertise. Competitive compensation and benefits.
Senior Analyst
Latentview Analytics
Role: Senior Analyst Machine Learning Performance & Testing Location: Bengaluru, Karnataka, India Experience: 3 5 Years Employment Type: Permanent, Full-Time About the Role We are seeking a skilled and detail-oriented Senior Analyst with strong experience in ML model performance testing, load testing, and end-to-end (E2E) automation. This role is focused on ensuring scalable, low-latency deployment of production-grade machine learning models. The ideal candidate will be proficient in evaluating model performance under varied workloads, building robust test frameworks, and enhancing system monitoring. Key Responsibilities Conduct load testing and performance benchmarking for machine learning models under varying requests per second (RPS) scenarios. Develop and automate end-to-end test cases to validate model readiness and support smooth rollouts. Monitor and improve model scalability, response time, and error rates across production environments. Collaborate with ML engineers, backend developers, and QA test teams to ensure seamless integration and testing workflows. Identify and address bottlenecks in model inference, helping improve performance for high-volume, low-latency applications. Set up alerting and observability pipelines for model health using industry-standard tools. Required Skills & Tools Performance Testing & Monitoring: ML Load Testing, Job Monitoring, Model Scalability Evaluation Platforms & Tools: Databricks, MLflow, Seldon, Kubeflow, Tecton, Jenkins Cloud Services: Experience with AWS and deploying/testing models in cloud environments Programming Languages: Proficiency in at least one of the following Python, Java, Scala Experience: Working with production-level ML models, especially involving high data volumes and real-time inference Strong communication skills and ability to work in cross-functional teams Preferred Qualifications Hands-on experience with CI/CD pipelines for ML systems Knowledge of A/B testing and canary deployments for ML models Experience building testing frameworks for ML infrastructure at scale Understanding of monitoring and alerting best practices in production ML systems Be at the forefront of ML operations and model performance optimization Collaborate with industry-leading engineers and contribute to cutting-edge AI deployments Gain deep exposure to real-time data systems, cloud platforms, and enterprise-scale ML testing Competitive compensation and an innovative, fast-paced work environment
Machine Learning Engineer - Speech Ai (asr & Tts)
Sarvam
Machine Learning Engineer - Speech AI (ASR & TTS) Location: Bengaluru, Karnataka, India (On-Site) Department: Engineering Employment Type: Full-Time About Sarvam.ai Sarvam.ai is a pioneering generative AI startup headquartered in Bengaluru, India. We specialize in leading transformative research and development in speech and language technologies. Focused on building state-of-the-art ASR (Automatic Speech Recognition) and TTS (Text-to-Speech) models, particularly for Indic languages, we aim to redefine human-computer interaction with cutting-edge, AI-driven solutions. Join us as we push the boundaries of Speech AI to create inclusive, scalable, and intelligent voice-based applications for diverse communities worldwide. Role Overview We are seeking an experienced Machine Learning Engineer specializing in Speech AI (ASR & TTS). The ideal candidate will work on deep learning-based ASR and TTS models, improving accuracy, efficiency, and multilingual capabilities while deploying them at scale. The role involves developing and optimizing speech recognition and synthesis models with a focus on low-resource languages, real-time inference, and scalability. If you have a passion for speech processing and deep learning, this is a great opportunity to make a significant impact in a rapidly growing field. Key Responsibilities ASR (Automatic Speech Recognition) Develop, train, and optimize speech-to-text models using state-of-the-art architectures like Wav2Vec, Whisper, Conformer, and DeepSpeech. Implement techniques for low-latency ASR inference, including beam search, language model integration, and real-time transcription. Improve speech recognition accuracy for low-resource languages, especially Indic languages, using transfer learning and data augmentation. Optimize ASR pipelines for noise robustness, speaker adaptation, and domain-specific transcription. TTS (Text-to-Speech) Develop and fine-tune neural TTS models such as Tacotron, FastSpeech, VITS, or WaveNet for high-quality, natural-sounding speech synthesis. Implement multilingual and expressive TTS models with prosody and emotion control. Optimize TTS inference for deployment on edge devices, mobile, and cloud platforms. Improve speech synthesis quality through voice cloning, neural vocoders (HiFi-GAN, WaveGlow), and prosody modeling. General Speech AI Responsibilities Benchmark and profile ASR/TTS models to improve latency, efficiency, and deployment performance. Deploy scalable speech AI APIs on AWS, Azure, or GCP for real-world applications. Optimize ASR & TTS models for edge and offline inference. Stay updated with the latest advancements in speech AI, neural vocoders, and real-time inference techniques. Must-Have Qualifications Experience: 2-3 years of experience in speech AI, deep learning, or machine learning, with a focus on ASR & TTS. Education: Bachelor s or Master s degree in Computer Science, AI/ML, Speech Processing, or a related field. ML Frameworks: Proficiency in PyTorch or TensorFlow for training and deploying ASR/TTS models. ASR Expertise: Experience with speech-to-text architectures like Whisper, Wav2Vec, Conformer, or DeepSpeech. TTS Expertise: Experience with speech synthesis models like Tacotron, FastSpeech, or VITS. Speech Signal Processing: Understanding of MFCCs, STFT, phonemes, prosody modeling, and feature extraction. Inference Optimization: Hands-on experience with TensorRT, ONNX, or quantization (INT8, FP16) for ASR/TTS. Cloud & Edge Deployment: Experience deploying speech models on AWS, GCP, or Azure. Preferred Qualifications Experience with speech diarization, speaker recognition, or language modeling for ASR. Familiarity with zero-shot TTS, voice cloning, and multilingual speech modeling. Understanding of CUDA optimization and low-bit quantization for ASR/TTS models. Contributions to open-source speech AI projects or a strong GitHub portfolio showcasing relevant work. Experience with real-time streaming ASR/TTS applications and low-latency inference. Innovative Impact: Work on AI-driven speech solutions that are changing how people interact with technology, especially in low-resource languages. Cutting-Edge Technology: Contribute to the development of state-of-the-art speech AI models in a rapidly advancing field. Collaborative Environment: Work with a team of experts in AI, machine learning, and speech processing, in a startup culture. Growth Opportunities: Sarvam.ai offers exciting career growth in a fast-paced environment with opportunities for personal and professional development. Interested candidates are invited to submit their resume, cover letter, and any relevant project portfolios or GitHub links showcasing their experience in ASR, TTS, or Speech AI. Strong AI-related projects, whether in industry, research, or personal work, will be highly valued. Qualification : Bachelors or Masters degree in Computer Science, AI/ML, Speech Processing, or a related field.
Ai Platform Architect
Adobe
AI Platform Architect Location: Bangalore, Karnataka, India Employment Type: Full-Time About Adobe Adobe is changing the world through digital experiences. Whether you're an emerging artist or a global brand, our tools empower creativity and innovation across every screen. From powerful imaging and video solutions to immersive web and app design, Adobe s mission is to help people and businesses deliver exceptional digital experiences. We are committed to creating an inclusive workplace where everyone is respected and given equal opportunity. Innovation can come from anywhere and the next big idea could be yours. Job Description We are looking for a visionary AI Platform Architect with deep expertise in building and scaling cloud-native, AI-powered platforms. The ideal candidate will have experience deploying large-scale, customer-facing AI solutions and a deep understanding of modern cloud architecture, data systems, MLOps, and LLMOps. Responsibilities Design and develop scalable AI/ML platforms and pipelines across AWS, Azure, and GCP. Architect end-to-end LLM pipelines including model training, fine-tuning, serving, inference APIs, and monitoring. Lead cross-functional teams in delivering AI solutions from experimentation to production. Implement MLOps and LLMOps best practices using tools like MLFlow, SageMaker, Langchain, and LangGraph. Design GPU-optimized architectures for training and inference of LLMs using DeepSpeed, vLLM, and other modern frameworks. Support infrastructure automation and container orchestration with Kubernetes, Docker, and CI/CD pipelines. Collaborate with internal stakeholders and clients to understand requirements, evangelize platform solutions, and ensure successful delivery. Key Skills and Expertise Cloud and DevOps: Expertise in AWS, Azure, GCP especially VPC design, cloud databases, and serverless architecture. Certified in AWS Professional Solution Architect, AWS ML Specialty, or Azure Solutions Architect Expert (preferred). Proficient with Kubernetes, Docker, FluentD, Kibana, Grafana, Prometheus. Data and Streaming: Experience with OLTP/OLAP databases and cloud-native data warehouses like BigQuery, Aurora, Spanner. Hands-on with Kafka, Apache Flink, Spark, Airflow, Databricks, Apache Iceberg, Presto. AI/ML & LLM Expertise: In-depth understanding of LLMs (GPT, Gemini, Claude, Mixtral, Llama, Hugging Face OSS models). LLMOps frameworks: Langchain, Langgraph, Langflow, Flowise, LLamaIndex. ML lifecycle tools: MLFlow, SageMaker, Vertex AI, Azure AI, AWS Bedrock. Proven experience in model optimization, fine-tuning, and high-throughput inference systems. Programming Languages: Proficient in Python, SQL, and JavaScript. Preferred Qualifications 10+ years in cloud and AI/ML platform architecture roles. Experience delivering AI solutions for enterprise-scale clients. Hands-on experience with GPU architecture and parallel/distributed training. Strong communication skills with ability to influence technical and business stakeholders. Work on cutting-edge AI technologies and shape future product experiences used by millions. Collaborate with world-class engineers and scientists in a diverse, inclusive culture. Be part of a company that values creativity, innovation, and employee well-being. Adobe is proud to be an Equal Opportunity Employer. We welcome and encourage candidates from all backgrounds to apply.
Gen Ai Engineer - L1
Wipro Limited
Gen AI Engineer - L1 | Bengaluru, India About Wipro Wipro Limited (NYSE: WIT, BSE: 507685, NSE: WIPRO) is a global IT consulting and services company with over 230,000 employees across 65 countries. We offer innovative solutions in consulting, engineering, and operations to solve complex digital transformation challenges. Role Overview We are looking for an experienced Gen AI Engineer - L1 with deep expertise in Generative AI, LLMs, RAG pipelines, and Python-based machine learning frameworks. The role will focus on developing secure, scalable AI systems using modern tools and cloud platforms. Key Responsibilities Design, implement, and optimize generative AI models using LangChain, LLaMA, Hugging Face, etc. Develop RAG pipelines and integrate with LLMs for advanced AI solutions. Create, test, and optimize prompt templates across different base models. Implement guardrails for prompt security to prevent prompt injection, jailbreaks, and leaks. Build efficient backend applications using Python, Django, and related tools. Work with vector databases to enhance generative AI workflows. Collaborate on data grooming and model training across business units. Benchmark model performance and develop auto-prompting systems. Ensure adherence to minimum design standards in prompt engineering use cases. Mandatory Skills Gen AI, LLMs, RAG Pipelines LangChain, LLaMA, Hugging Face Python, TensorFlow, PyTorch, Django NLP, Machine Learning, Deep Learning Vector Database integration Preferred Skills Azure or AWS Cloud Platforms MLOps, Kubernetes GitHub, Bitbucket Experience with GPT-4 Domain exposure in Banking or Financial Services Join Wipro to be part of a company that thrives on innovation, reinvention, and digital excellence. Work on impactful GenAI projects, grow your career, and contribute to shaping the future of AI in real-world applications. We welcome applications from individuals with disabilities.
Gen Ai Aws Engineer
Wipro Limited
Gen AI AWS Engineer | Bengaluru, India Experience: 8 10 Years Mandatory Skills: AWS, Generative AI About Wipro Wipro Limited (NYSE: WIT, BSE: 507685, NSE: WIPRO) is a global leader in consulting and technology services with over 230,000 employees across 65 countries. We provide digital transformation solutions that are innovative, sustainable, and scalable. Role Overview We are hiring a highly skilled Gen AI AWS Engineer to lead the architecture, development, and delivery of AI-driven cloud solutions. This role focuses on leveraging AWS infrastructure and Generative AI technologies to solve complex business problems at scale. Key Responsibilities Architecture and Solution Design Design and implement scalable AI/ML solutions using AWS and GenAI frameworks. Lead solution design for enterprise initiatives and RFPs. Develop architectural documentation and reusable design patterns. Evaluate current architectures and recommend improvements and modernization plans. Delivery Enablement Provide frameworks and technical direction to development teams. Monitor project risks and propose mitigation strategies. Ensure alignment with architecture principles and best practices. Client Engagement Participate in pre-sales activities and client presentations. Build trusted relationships with clients by demonstrating technical thought leadership. Coordinate with stakeholders to ensure successful delivery of AI solutions. Innovation & Competency Building Create PoCs, whitepapers, and solution demos in GenAI and AWS domains. Represent Wipro s capabilities at industry events and internal forums. Mentor junior architects and contribute to internal upskilling initiatives. Team Management Recruit, train, and retain high-performing AI/ML engineers. Conduct performance reviews and set goals for team members. Drive employee engagement and diversity across the architecture team. Be part of a company that thrives on innovation, reinvention, and purpose. Work with cutting-edge Generative AI solutions and lead impactful transformations for global clients. Applications from individuals with disabilities are explicitly welcome.
Senior Machine Learning Engineer
Chevron Corporation
Senior Machine Learning Engineer Location: Bengaluru, India Company: Chevron Experience Required: 5 10 Years Department: AI/ML Engineering Work Mode: Hybrid / Global Operations Support About the Role Chevron is actively seeking a Senior Machine Learning Engineer to join our cutting-edge AI team. You will be responsible for designing, building, and optimizing advanced machine learning systems that power transformative applications in artificial intelligence. In this role, you will develop self-learning applications and refine AI systems using robust engineering, statistics, and software design practices. Key Responsibilities Study and transform data science prototypes into production-ready systems Design scalable and robust machine learning systems Research and implement modern ML algorithms and tools Develop ML applications based on business and technical requirements Select appropriate datasets, data pipelines, and data representation techniques Run experiments and evaluate results to fine-tune models Perform statistical analysis and ML performance optimization Train and retrain models as new data becomes available Extend and customize existing ML libraries and frameworks Stay up to date with the latest ML research and trends Required Qualifications 5 10 years of proven experience as a Machine Learning Engineer or in a similar role Hands-on experience with Azure Machine Learning and MLOps Strong skills in data structures, data modeling, and software architecture Deep knowledge of math, probability, statistics, and algorithm design Proficient in programming languages: Python, R Familiarity with ML frameworks and libraries such as Keras, PyTorch, scikit-learn Excellent analytical and problem-solving skills Strong communication and teamwork abilities Working Hours Chevron supports international teams. Work hours are scheduled to align with global collaboration: Work Days: Monday to Friday Shift Options: 8:00 AM 5:00 PM or 1:30 PM 10:30 PM IST Opportunity to work on impactful ML/AI solutions at enterprise scale Flexible work culture with global exposure Advanced tools, infrastructure, and data at your fingertips Professional growth in a forward-thinking, innovation-driven environment Equal Opportunity Statement Chevron is an equal opportunity employer and adheres to inclusive hiring practices. All qualified candidates will receive consideration without regard to race, gender, age, religion, nationality, sexual orientation, disability, or any other protected status. Chevron also participates in E-Verify as required by law in applicable jurisdictions. Apply Today
Engineer, Principal/manager - Machine Learning, Ai
Qualcomm India Private Limited
Engineer, Principal/Manager - Machine Learning, AI Location: Bangalore, Karnataka, India Company: Qualcomm India Private Limited General Summary Qualcomm is seeking an experienced and visionary Principal AI/ML Engineer to lead research, development, and optimization of AI inference systems. This role involves developing high-performance AI models, optimizing deployments across various hardware platforms, and contributing to research in model compression, quantization, and hardware-aware optimization. Education & Experience PhD with 6+ years, Master's with 7+ years, or Bachelor's with 8+ years in Engineering, CS, or related field. 20+ years of experience in AI/ML development; 5+ years in inference optimization and debugging. Key Responsibilities Model Optimization & Quantization Optimize models using quantization (INT8, INT4, mixed precision), pruning, and knowledge distillation. Implement PTQ and QAT techniques for deployment. Experience with TensorRT, ONNX Runtime, OpenVINO, TVM. AI Hardware Acceleration & Deployment Target platforms: Hexagon DSP, CUDA GPUs, TPUs, NPUs, FPGAs, Habana Gaudi, Apple Neural Engine. Use Python APIs: cuDNN, XLA, MLIR for hardware acceleration. Benchmark and debug performance across platforms. AI Research & Innovation Research on efficient AI inference: model compression, low-bit precision, sparse computing. Explore architectures like Sparse Transformers, Mixture of Experts, Flash Attention. Publish in ML conferences: NeurIPS, ICML, CVPR; contribute to open-source projects. Technical Expertise Optimization of LLMs, LMMs, LVMs for inference. Deep Learning frameworks: TensorFlow, PyTorch, JAX, ONNX. Expert in CUDA, cuPy, Numba, TensorRT, ONNX Runtime, OpenVINO. Skilled in Python for scalable AI development. Experience with ML runtime delegates: TFLite, ONNX, Qualcomm AI Stack. Debugging: Netron, TensorBoard, PyTorch Profiler, Nsight, perf, Py-Spy. Cloud inference: AWS Inferentia, Azure ML, GCP AI Platform, Habana Gaudi. Hardware-aware optimization: oneDNN, ROCm, MLIR, SparseML. Contributions to open-source and research publications are a strong plus. Leadership & Collaboration Lead a team of engineers in Python-based AI inference and optimization. Collaborate with researchers, software engineers, DevOps, and hardware vendors. Define debugging, deployment, and performance tuning best practices.
Senior Engineer - Ai/ml, C/c++
Qualcomm India Private Limited
Senior Engineer AI/ML, C/C++ Location: Bangalore, Karnataka, India Company: Qualcomm India Private Limited General Summary As a Qualcomm Software Engineer, you will design, develop, modify, and validate embedded and edge cloud software or specialized utility programs that launch cutting-edge, world-class products. Collaboration is key in this role, working across systems, hardware, test, and architecture teams to meet software performance and interface requirements. About the Role We are seeking a talented AI/ML Engineer with a solid background in AI/ML, C/C++, and operating systems. The ideal candidate has 3 to 5 years of experience in machine learning development and implementation and will be responsible for building and deploying AI gateway solutions that promote innovation and operational efficiency. Required Qualifications Bachelor's or Master s degree in Computer Science, Engineering, or related field. 3 to 5 years of experience in AI/ML development. Technical Skills Strong proficiency in C and C++ programming. Deep understanding of operating system internals and functions. Experience with ML frameworks like TensorFlow, PyTorch, or equivalent. Strong grasp of data structures, algorithms, and software design patterns. Experience in data preprocessing techniques and related tools. Familiarity with Git and version control best practices. Soft Skills Excellent analytical and problem-solving skills. Strong communication and collaboration abilities. Self-motivated with the ability to work independently and as part of a team. Adaptable and eager to stay updated with evolving technologies. Qualification : Bachelor's or Masters degree in Computer Science, Engineering, or related field.
Staff/senior Staff Ai Developer Advocate
Qualcomm India Private Limited
Staff/Senior Staff AI Developer Advocate Location: Bangalore, Karnataka, India Company: Qualcomm India Private Limited General Summary We are looking for Developer Advocates to enable developers building with generative AI and AI-driven hardware applications. You will engage with the community and create resources to onboard developers on our platforms. Qualcomm products touch multiple industries, including mobile, laptops, mixed-reality, robotics, and industrial IoT. You will work closely with product, engineering, and regional sales teams to drive awareness and engagement for our platforms. You are a builder who loves writing code and integrating AI models into applications. Whether it's language-based use cases, computer vision, or audio, you can effortlessly integrate open-source models (large or small) and distilled models into applications. You are also a community builder, engaging, ideating, and helping others realize their development goals. Your contributions and insights from the community will directly impact product improvements, drive feature prioritization, and help create a repository of community-contributed sample applications, tutorials, and content. You will engage with community builders and influencers to build ecosystems that encourage constant collaboration. Responsibilities Engage with external developers in at least one of the following application areas: IoT, Automotive, Microsoft device ecosystem. Collaborate across software, hardware engineering, developer marketing, and product management teams. Understand trends in ML model design and workflow through academic research and developer engagements. Ensure comprehensive sample applications for AI on Linux/Windows using Snapdragon to cover a variety of models and use cases. Interface with 3rd party developers and internal teams to create easy-to-use sample applications and documentation for Windows on Snapdragon. Contribute new features and designs to the Qualcomm AI toolkit to enhance the developer workflow. Minimum Qualifications Bachelor's or advanced degree in computer science, artificial intelligence, or a related field. 6+ years of software engineering, systems engineering, or related work experience. Preferred Qualifications Excellent understanding of AI frameworks (e.g., TensorFlow, PyTorch), GPU programming, and parallel computing. Experience with large language models/foundational models is a plus. Good understanding of the complete AI software stack and AI performance tuning techniques on GPU, NPU-based systems. Experience in developing end-to-end AI applications on Windows using Windows ML, DirectML. Experience with training and deploying models on servers and porting them to client Windows compute platforms, including inference deployment and performance tuning. Proficiency in programming languages such as Python and C++. Excellent communication skills, with the ability to articulate complex technical concepts to both technical and non-technical stakeholders. Strong leadership abilities to guide development teams. Attention to detail with strong problem-solving, analytical, and debugging skills. Ability to adapt quickly and learn in a fast-changing environment. Familiarity with software development methodologies, version control systems, and agile project management practices. 12+ years of application development experience, with 5+ years in AI application development on Windows. Bachelor's degree in Computer Science or Electrical Engineering. Qualification : Bachelor's or advanced degree in computer science, artificial intelligence, or a related field.
Research Engineer
International Business Machines
Research Engineer Location: Bangalore, Karnataka, India Job Type: Full-Time Experience Level: 0-8 years Company: IBM Research India (IRL) Introduction: IBM Research is the innovation engine of IBM and is the largest industrial research organization in the world. With 12 labs across 6 continents and over 3200 researchers globally, we produce more patents daily than any other organization. At IBM Research India (IRL), we are shaping the future of computing in areas like AI, Hybrid Cloud, and Quantum Computing. Our work is at the forefront of breakthrough innovations in Foundation Models, AI systems, large-scale data engineering, and more. We are looking for top talent to join us in our exciting and dynamic projects, pushing the boundaries of innovation. As a Research Engineer, you will work on pioneering research and development in the most cutting-edge fields of AI and computing. Role Overview: The Research Engineer role at IBM India Research Lab (IRL) involves working on challenging, dynamic, and highly innovative projects in the fields of AI, machine learning, and data systems. Your responsibilities will span multiple areas including optimizing AI models for large-scale distributed systems, pre-training foundation models, and developing real-world use cases that leverage IBM s infrastructure and models. Key Responsibilities: Optimized Runtime Stacks for Foundation Models: Work on fine-tuning, inference serving, and large-scale data engineering for AI models. Focus on multi-stage tuning, reinforcement learning, inference-time compute, and preparing data for complex AI systems. Model Optimization Across Accelerators: Develop solutions to optimize models for multi-accelerator environments, particularly focusing on IBM s AIU accelerator. Work on compiler optimizations, specialized kernels, libraries, and tools to enhance model performance. Pre-training and Deployment of Foundation Models: Participate in pre-training language models and multi-modal foundation models. Work on distributed training procedures, model alignment, and creating pipelines for various tasks, including LLM-generated data pipelines. Research and Use Case Development: Develop and implement use cases that effectively leverage infrastructure and models to drive real-world value. Contribute to creating frameworks for human-data collection and deploying models on user-centric platforms. Required Education and Experience: Education: A Master s degree in Computer Science, AI, or related fields from a top institution. Experience: 0-8 years of experience working with modern ML techniques, including but not limited to model architectures, data processing, fine-tuning techniques, reinforcement learning, distributed training, and inference optimizations. Technical Skills: Experience with big data platforms such as Ray and Spark. Experience with Pytorch FSDP and HuggingFace libraries. Proficiency in programming with Python or web development technologies. Mindset and Attitude: A growth mindset and pragmatic approach to problem-solving. Preferred Experience: Research Experience: Peer-reviewed research at top machine learning or systems conferences. Advanced Technical Skills: Experience working with pytorch.compile, CUDA, Triton kernels, GPU scheduling, and memory management. Open Source Contributions: Experience working within open-source communities, contributing to or developing open-source projects. Innovative Environment: Be at the forefront of technological innovation, working on cutting-edge projects in AI, quantum computing, and more. Global Impact: Work on projects that influence both academic research and commercial product development, making a global impact. Career Development: IBM offers abundant opportunities for learning and growth, with access to the latest technologies and research. Collaborative Culture: Work with a diverse team of world-class researchers and engineers in a collaborative, open-source-driven environment. Apply today and become a part of the team that s redefining innovation. Qualification : A Masters degree in Computer Science, AI, or related fields from a top institution.
Ai Engineer
Trellissoft Engineering Services Pvt Ltd
Job Title: AI/ML Engineer (LLM-driven Products) Location: Bengaluru, Karnataka Experience: 3+ years Work Modality: Full-time (Work from office) Job Description: We are seeking an AI/ML Engineer to help develop LLM-driven products from the ground up. The ideal candidate will have a strong programming background and experience working with Transformer architecture to design cutting-edge AI systems. If you're passionate about implementing scalable AI solutions and driving innovation, we would love to have you join our team! As part of the team, you will work on transformer models like BERT, GPT, T5, and other small LLMs for Natural Language Processing (NLP) and Computer Vision tasks. You will have the opportunity to work on impactful AI solutions that are designed to scale globally. Key Responsibilities: LLM Product Development: Design and develop products powered by large language models (LLMs), ensuring they meet the technical requirements and scale for global deployment. Model Fine-tuning & Optimization: Fine-tune transformer models for tasks such as text classification, summarization, image generation, and recognition. Implement optimization techniques to accelerate model performance, including GPU optimization and model quantization. AI Solutions Implementation: Translate AI research into actionable product features, ensuring AI models are implemented effectively to solve real-world problems. Collaboration & Communication: Work closely with cross-functional teams to integrate AI models and solutions into larger products. Communicate complex technical concepts to non-technical stakeholders. Model Deployment: Deploy models using frameworks like Flask, FastAPI, or through cloud-based inference services. Data Preprocessing & Training: Engage in data preprocessing and feature engineering to improve the performance of AI models. Required Qualifications: Experience: 3+ years of hands-on experience in AI/ML, specifically working with Transformer-based models like BERT, GPT, T5, ViTs, or small LLMs. Technical Skills: Strong proficiency in Python and deep learning frameworks such as PyTorch or TensorFlow. In-depth understanding of Transformer architecture and its applications in NLP (text classification, summarization) and Computer Vision (image generation, recognition). Experience in deploying models using frameworks like Flask, FastAPI, or cloud-based inference services. Familiarity with GPU acceleration, model optimization, and model quantization. Proficient in data preprocessing, feature engineering, and training workflows. Analytical Skills: Ability to independently analyze open-source code repositories and leverage existing models for further optimization. What We Offer: Competitive Salary: Attractive salary based on experience and expertise. Innovative Work Environment: Work on cutting-edge AI and machine learning solutions with the opportunity to shape innovative products. Career Growth: Opportunities to advance your career in AI/ML and work with a team of passionate professionals. Comprehensive Benefits: A benefits package designed to support your overall well-being and work-life balance.
1 - 20 of 0 jobs
* No exact matches found. Showing closest results insteadNo results found
Modify search criteria or create an alert to get relevant jobs as soon as they’re posted