Onnx Jobs in Bengaluru
6 Jobs Found
Senior Engineer - Ai/ml, C/c++
Qualcomm India Private Limited
Senior Engineer AI/ML, C/C++ Location: Bangalore, Karnataka, India Company: Qualcomm India Private Limited General Summary As a Qualcomm Software Engineer, you will design, develop, modify, and validate embedded and edge cloud software or specialized utility programs that launch cutting-edge, world-class products. Collaboration is key in this role, working across systems, hardware, test, and architecture teams to meet software performance and interface requirements. About the Role We are seeking a talented AI/ML Engineer with a solid background in AI/ML, C/C++, and operating systems. The ideal candidate has 3 to 5 years of experience in machine learning development and implementation and will be responsible for building and deploying AI gateway solutions that promote innovation and operational efficiency. Required Qualifications Bachelor's or Master s degree in Computer Science, Engineering, or related field. 3 to 5 years of experience in AI/ML development. Technical Skills Strong proficiency in C and C++ programming. Deep understanding of operating system internals and functions. Experience with ML frameworks like TensorFlow, PyTorch, or equivalent. Strong grasp of data structures, algorithms, and software design patterns. Experience in data preprocessing techniques and related tools. Familiarity with Git and version control best practices. Soft Skills Excellent analytical and problem-solving skills. Strong communication and collaboration abilities. Self-motivated with the ability to work independently and as part of a team. Adaptable and eager to stay updated with evolving technologies. Qualification : Bachelor's or Masters degree in Computer Science, Engineering, or related field.
System Software Architect, Programmable Vision Accelerator
Nvidia
We are looking for a System Software Architect Programmable Vision Accelerator. As the market leader in deep learning and parallel computing, NVIDIA is seeking an expert system software architect to lead the design and implementation of firmware and driver stack for NVIDIA's Programmable Vision Accelerator (PVA) engine in the Tegra SoC platform. As a Software Architect, you will join a team of software engineers to create and evolve an essential part of the software stack responsible for scheduling and execution of highly optimized computer vision and machine learning kernels for specialized DSP hardware. You will use your design abilities, coding expertise, and creativity to help deliver innovative real-time firmware and kernel mode drivers for a low power, high performance computer vision accelerator engine. You will be architecting and developing new features and improvements to realize the groundbreaking potential of NVIDIA mobile systems, ranging from self-driving cars, intelligent video analytics and autonomous mobile robotics. You will need to demonstrate excellent technical leadership, communication, interpersonal, and analytical skills as well as a real passion for performance-oriented software engineering. If this sounds like a fun challenge, we want to hear from you! What you will be doing: Evolve and define software architecture for future NVIDIA's Programmable Vision Accelerator (PVA) chips and enhance the functionality of currently shipping products. Design and write custom embedded software for PVA engine to meet product and hardware requirements at the SoC level. Help defining forward-looking strategy and improvements to the PVA algorithms and system architecture. Review hardware specifications and map algorithms to the architecture. Participate in the bring-up of the new generation of the world's most advanced SoC. Collaborate closely with other teams and software/hardware architects across NVIDIA to support the architecture, design, creation, integration, and validation of PVA software under a common SoC umbrella. Provide technical support and guidance for internal and external customers. Mentor and guide technical development of the less experienced team members What we need to see: College degree (preferably PhD or MS) in Electrical Engineering, Computer Engineering, Computer Science, or equivalent experience 10+ years of working experience in embedded industry, including 5+ years in technical leadership role Deep understanding of SoC principles, general systems architectures, operating systems, device drivers, memory management, multithreading, and real-time scheduling. Deep understanding and working experience with embedded technologies including DSP, computer vision and image/signal processing. Excellent software development skills (C, C++) and outstanding problem-solving capabilities. Proven expertise in architecting embedded software and development of highly optimized code for DSP, SIMD and/or VLIW processors Experience with embedded Linux and/or QNX. Outstanding interpersonal skills with ability to work in a global and diverse team operating in a fast-paced environment. Good understanding of safety-critical software principles with experience in automotive or other highly regulated industries Ways to stand out from the crowd: Experience with ISO 26262 and IEC 61508 or equivalent quality/safety processes. Understanding of software safety and safety development processes is a major plus. Consistent record to effectively guide and influence in a technically strong dynamic environment. NVIDIA is widely considered to be one of the technology world s most desirable employers. We have some of the most forward-thinking people in the world working for us. If you're creative and autonomous, we want to hear from you. NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars. NVIDIA is looking for great people like you to help us accelerate the next wave of artificial intelligence. Qualification : College degree (preferably PhD or MS) in Electrical Engineering, Computer Engineering, Computer Science, or equivalent experience
Machine Learning Engineer - Speech Ai (asr & Tts)
Sarvam
Machine Learning Engineer - Speech AI (ASR & TTS) Location: Bengaluru, Karnataka, India (On-Site) Department: Engineering Employment Type: Full-Time About Sarvam.ai Sarvam.ai is a pioneering generative AI startup headquartered in Bengaluru, India. We specialize in leading transformative research and development in speech and language technologies. Focused on building state-of-the-art ASR (Automatic Speech Recognition) and TTS (Text-to-Speech) models, particularly for Indic languages, we aim to redefine human-computer interaction with cutting-edge, AI-driven solutions. Join us as we push the boundaries of Speech AI to create inclusive, scalable, and intelligent voice-based applications for diverse communities worldwide. Role Overview We are seeking an experienced Machine Learning Engineer specializing in Speech AI (ASR & TTS). The ideal candidate will work on deep learning-based ASR and TTS models, improving accuracy, efficiency, and multilingual capabilities while deploying them at scale. The role involves developing and optimizing speech recognition and synthesis models with a focus on low-resource languages, real-time inference, and scalability. If you have a passion for speech processing and deep learning, this is a great opportunity to make a significant impact in a rapidly growing field. Key Responsibilities ASR (Automatic Speech Recognition) Develop, train, and optimize speech-to-text models using state-of-the-art architectures like Wav2Vec, Whisper, Conformer, and DeepSpeech. Implement techniques for low-latency ASR inference, including beam search, language model integration, and real-time transcription. Improve speech recognition accuracy for low-resource languages, especially Indic languages, using transfer learning and data augmentation. Optimize ASR pipelines for noise robustness, speaker adaptation, and domain-specific transcription. TTS (Text-to-Speech) Develop and fine-tune neural TTS models such as Tacotron, FastSpeech, VITS, or WaveNet for high-quality, natural-sounding speech synthesis. Implement multilingual and expressive TTS models with prosody and emotion control. Optimize TTS inference for deployment on edge devices, mobile, and cloud platforms. Improve speech synthesis quality through voice cloning, neural vocoders (HiFi-GAN, WaveGlow), and prosody modeling. General Speech AI Responsibilities Benchmark and profile ASR/TTS models to improve latency, efficiency, and deployment performance. Deploy scalable speech AI APIs on AWS, Azure, or GCP for real-world applications. Optimize ASR & TTS models for edge and offline inference. Stay updated with the latest advancements in speech AI, neural vocoders, and real-time inference techniques. Must-Have Qualifications Experience: 2-3 years of experience in speech AI, deep learning, or machine learning, with a focus on ASR & TTS. Education: Bachelor s or Master s degree in Computer Science, AI/ML, Speech Processing, or a related field. ML Frameworks: Proficiency in PyTorch or TensorFlow for training and deploying ASR/TTS models. ASR Expertise: Experience with speech-to-text architectures like Whisper, Wav2Vec, Conformer, or DeepSpeech. TTS Expertise: Experience with speech synthesis models like Tacotron, FastSpeech, or VITS. Speech Signal Processing: Understanding of MFCCs, STFT, phonemes, prosody modeling, and feature extraction. Inference Optimization: Hands-on experience with TensorRT, ONNX, or quantization (INT8, FP16) for ASR/TTS. Cloud & Edge Deployment: Experience deploying speech models on AWS, GCP, or Azure. Preferred Qualifications Experience with speech diarization, speaker recognition, or language modeling for ASR. Familiarity with zero-shot TTS, voice cloning, and multilingual speech modeling. Understanding of CUDA optimization and low-bit quantization for ASR/TTS models. Contributions to open-source speech AI projects or a strong GitHub portfolio showcasing relevant work. Experience with real-time streaming ASR/TTS applications and low-latency inference. Innovative Impact: Work on AI-driven speech solutions that are changing how people interact with technology, especially in low-resource languages. Cutting-Edge Technology: Contribute to the development of state-of-the-art speech AI models in a rapidly advancing field. Collaborative Environment: Work with a team of experts in AI, machine learning, and speech processing, in a startup culture. Growth Opportunities: Sarvam.ai offers exciting career growth in a fast-paced environment with opportunities for personal and professional development. Interested candidates are invited to submit their resume, cover letter, and any relevant project portfolios or GitHub links showcasing their experience in ASR, TTS, or Speech AI. Strong AI-related projects, whether in industry, research, or personal work, will be highly valued. Qualification : Bachelors or Masters degree in Computer Science, AI/ML, Speech Processing, or a related field.
Engineer, Principal/manager - Machine Learning, Ai
Qualcomm India Private Limited
Engineer, Principal/Manager - Machine Learning, AI Location: Bangalore, Karnataka, India Company: Qualcomm India Private Limited General Summary Qualcomm is seeking an experienced and visionary Principal AI/ML Engineer to lead research, development, and optimization of AI inference systems. This role involves developing high-performance AI models, optimizing deployments across various hardware platforms, and contributing to research in model compression, quantization, and hardware-aware optimization. Education & Experience PhD with 6+ years, Master's with 7+ years, or Bachelor's with 8+ years in Engineering, CS, or related field. 20+ years of experience in AI/ML development; 5+ years in inference optimization and debugging. Key Responsibilities Model Optimization & Quantization Optimize models using quantization (INT8, INT4, mixed precision), pruning, and knowledge distillation. Implement PTQ and QAT techniques for deployment. Experience with TensorRT, ONNX Runtime, OpenVINO, TVM. AI Hardware Acceleration & Deployment Target platforms: Hexagon DSP, CUDA GPUs, TPUs, NPUs, FPGAs, Habana Gaudi, Apple Neural Engine. Use Python APIs: cuDNN, XLA, MLIR for hardware acceleration. Benchmark and debug performance across platforms. AI Research & Innovation Research on efficient AI inference: model compression, low-bit precision, sparse computing. Explore architectures like Sparse Transformers, Mixture of Experts, Flash Attention. Publish in ML conferences: NeurIPS, ICML, CVPR; contribute to open-source projects. Technical Expertise Optimization of LLMs, LMMs, LVMs for inference. Deep Learning frameworks: TensorFlow, PyTorch, JAX, ONNX. Expert in CUDA, cuPy, Numba, TensorRT, ONNX Runtime, OpenVINO. Skilled in Python for scalable AI development. Experience with ML runtime delegates: TFLite, ONNX, Qualcomm AI Stack. Debugging: Netron, TensorBoard, PyTorch Profiler, Nsight, perf, Py-Spy. Cloud inference: AWS Inferentia, Azure ML, GCP AI Platform, Habana Gaudi. Hardware-aware optimization: oneDNN, ROCm, MLIR, SparseML. Contributions to open-source and research publications are a strong plus. Leadership & Collaboration Lead a team of engineers in Python-based AI inference and optimization. Collaborate with researchers, software engineers, DevOps, and hardware vendors. Define debugging, deployment, and performance tuning best practices.
Engineer/sr Engineer/staff - System Solution Ai Center Of Excellence
Qualcomm
General Summary: Join the Qualcomm AI Systems Solution CoE (Center of Excellence) team to develop cutting-edge products and solutions leveraging Qualcomm s high-performance inference accelerators for cloud, edge, and hybrid AI applications. This role focuses on designing and delivering AI/ML system software solutions for diverse industries, including automotive, cloud, and industrial IoT. You will work on optimizing ML models (including GenAI, LLMs, LVMs, and LMMs) for deployment across a variety of devices, including phones, AI PCs, and edge appliances. This is your opportunity to contribute to Qualcomm s energy-efficient AI compute systems and collaborate with cross-functional teams to develop solutions that exceed industry standards. Key Responsibilities: Develop system solutions for AI/ML requirements, including model optimization, inference graph tuning, and scalable deployment across various devices. Work with deep learning frameworks like PyTorch, TensorFlow, and ONNX. Optimize AI/ML models using techniques such as quantization, pruning, and compression. Define and evaluate performance and accuracy metrics for neural network models, including CV, NLP, and multi-modal architectures. Collaborate across functional teams to ensure scalable, efficient deployments on inference accelerators and edge devices. Monitor and analyze industry trends and competitor toolchains to innovate and enhance system solutions. Develop and debug high-quality software in Python and C++, leveraging object-oriented design principles. Lead projects to improve toolchains and develop innovative product features. Required Skills and Experience: Hands-on experience with deep learning frameworks: PyTorch, TensorFlow, and ONNX. Strong understanding of model architectures (CV, NLP, LLMs, and multi-modal networks). Proficiency in Python and C++ (skill level: 7/10 or higher). Expertise in data structures and algorithms (skill level: 7/10 or higher). Experience in inference graph optimization, quantization-aware training, and deployment of AI/ML models. Knowledge of AI edge and server systems, infrastructure, and industry standards. Familiarity with ONNX Runtime, PyTorch runtime, and TensorFlow runtime. Experience in competitor toolchain surveys and emerging trends in AI/ML. Excellent debugging, analytical, and development skills. Desirable Skills: Experience with GPUs, machine learning accelerators, and related software. Familiarity with ML compilers like TVM, GLOW, and XLA. Hands-on use of version control tools like Git and GitHub. Knowledge of LLM fine-tuning and advanced training techniques. Qualifications: Bachelor s, Master s, or PhD degree in Engineering, AI/ML, Computer Science, or a related field. Work Experience: Bachelor s degree with 3+ years of relevant experience. Master s degree with 2+ years of relevant experience. PhD with 1+ year of relevant experience. Qualification : Bachelor's / Masters/ PHD degree in Engineering, Machine learning/ AI, Information Systems, Computer Science, or related field.
Senior Manager - Technical Solutions (spark)
Databricks
As a Senior Manager of the Spark Technical Solutions team, you will lead & manage a team of Technical Solution Engineers (Spark) and be responsible for driving deep dive technical solutions for any issues reported by Databricks customers. We expect the manager to resolve challenges with comprehensive technical and customer communication skills. You will assist our customers in their Databricks journey and provide them with the guidance, knowledge, and expertise that they need to realise value and achieve their strategic objectives using our products. The impact you will have: As a manager and member of the leadership team, you will be directly responsible for the management of Technical solution engineers, team leads and operations personnel Responsible for directly monitoring, reporting, and driving improvements to team-level metrics and KPIs, acting as an escalation point with customers and internal teams, and optimising and developing support processes and tools Responsible for working across multiple cross functional teams that include Engineering, product management, sales and customer success; manage Hiring, mentoring and onboarding new support engineers Regularly meet one-on-one with your direct reports, conducting annual reviews and career development discussions throughout the year Be a hands on manager to assist the team members in resolving issues related to Spark core internals, Spark SQL, Structured Streaming, Delta, Lakehouse and other databricks runtime features Manage and drive best practices guidance around Spark runtime performance and usage of Spark core libraries and APIs for custom-built solutions developed by Databricks customers; contribute in the development of tools/automation initiatives Own Engineering JIRA tickets and proactively work to bring quicker resolutions to customer reported issues; participate in creation of knowledge base articles Participate in weekend and weekday on-call rotation and run escalations during databricks runtime outages, incident situations, ability to multitask and plan day 2 day activities and provide escalated level of support for critical customer operational issues, etc What we look for: Min 10-12 years of experience in designing, building, testing, and maintaining Python/Java/Scala/Spark based applications in a typical project delivery and consulting environments with 4+ years working as a Manager 5+ years of hands-on experience in developing and leading any two or more of the Big Data, Hadoop, Spark,Machine Learning, Artificial Intelligence, Streaming, Kafka, Data Science, ElasticSearch related industry use cases at the production scale. Big Data / Spark hands on-experience is mandatory Hands on experience in the performance tuning/troubleshooting of Hive and Spark based applications at production scale. Real time experience in JVM and Memory Management techniques such as Garbage collections, Heap/Thread Dump Analysis is preferred Working and hands-on experience with Data lakes and any SQL-based databases, Data Warehousing/ETL technologies like Informatica, DataStage, Oracle, Teradata, SQL Server, MySQL is preferred Hands-on experience with AWS or Azure or GCP is preferred Experience in implementing CI/CD, Monitoring/alerting for Production Systems Technical lead in design, implementation and support of large scale data and analytics solutions that are highly reliable, flexible, and scalable Experience in leading and managing end-to-end projects and have reported and escalated to top levels Experience in managing and leading teams in an organisation involving multiple reporting lines About Databricks Databricks is the data and AI company. More than 10,000 organizations worldwide including Comcast, Cond Nast, Grammarly, and over 50% of the Fortune 500 rely on the Databricks Data Intelligence Platform to unify and democratize data, analytics and AI. Databricks is headquartered in San Francisco, with offices around the globe and was founded by the original creators of Lakehouse, Apache Spark , Delta Lake and MLflow. To learn more, follow Databricks on Twitter,LinkedIn and Facebook . Benefits At Databricks, we strive to provide comprehensive benefits and perks that meet the needs of all of our employees. For specific details on the benefits offered in your region, please visithttps://www.mybenefitsnow.com/databricks. Our Commitment to Diversity and Inclusion At Databricks, we are committed to fostering a diverse and inclusive culture where everyone can excel. We take great care to ensure that our hiring practices are inclusive and meet equal employment opportunity standards. Individuals looking for employment at Databricks are considered without regard to age, color, disability, ethnicity, family or marital status, gender identity or expression, language, national origin, physical and mental ability, political affiliation, race, religion, sexual orientation, socio-economic status, veteran status, and other protected characteristics. Compliance If access to export-controlled technology or source code is required for performance of job duties, it is within Employer's discretion whether to apply for a U.S. government license for such positions, and Employer may decline to proceed with an applicant on this basis alone.
1 - 20 of 0 jobs
* No exact matches found. Showing closest results insteadNo results found
Modify search criteria or create an alert to get relevant jobs as soon as they’re posted