Hadoop Ecosystem Jobs in Bengaluru
234 Jobs Found
Mechatronics & Bigdata Scientist Developer
Bharat Fritz Werner
Position: Mechatronics & Big Data Scientist Developer Department: Research & Development Reporting To: General Manager Location: Bengaluru Key Responsibilities Machine Learning: Select features, build, and optimize classifiers using advanced machine learning techniques. Data Mining: Perform data mining using state-of-the-art methods to extract valuable insights from large datasets. Data Enhancement: Extend the company s datasets with third-party data sources when necessary to improve model accuracy and relevance. Data Collection & Processing: Improve data collection procedures to include all necessary information for building analytic systems. Data Cleansing & Integrity: Process, cleanse, and verify the integrity of data used for analysis to ensure reliable results. Ad-hoc Analysis: Perform ad-hoc analysis as needed, presenting the results in a clear, actionable manner. Anomaly Detection: Design and implement automated anomaly detection systems, tracking their performance over time to ensure accuracy. Behavioral Competencies Data-Driven: Strong inclination toward working with data and applying analytical thinking to solve complex problems. Detail-Oriented: Meticulous in data analysis and system development to ensure quality and precision in results. Skills and Expertise Machine Learning Algorithms Strong understanding of machine learning techniques and algorithms such as k-NN, Naive Bayes, SVM, Decision Forests, etc. Data Science Tools Experience with common data science toolkits like R, Weka, NumPy, and MatLab. Proficiency in at least one (preferably NumPy or R) is highly desirable. Data Visualization Skilled in data visualization tools such as D3.js, GGplot, or similar. Database Management Experience with query languages such as SQL, Hive, Pig, NiFi, or others depending on the company s stack. Familiarity with NoSQL databases like InfluxDB, MongoDB, Cassandra, HBase. Statistical Analysis Strong applied statistics skills, including distributions, statistical testing, and regression analysis. Programming Skills Good scripting and programming skills in languages like PHP, Slim, SQL, and Laravel. Big Data Technologies Knowledge of Hadoop, HDFS, NiFi, and other big data platforms and technologies. Qualifications Essential: MTech, MS, or equivalent in Mechatronics, Computer Science, or a related field. Experience: Minimum 2 years of hands-on experience in developing SDKs and working with Big Data platforms. Proven track record in machine learning, data mining, and data science projects. Qualification : MTech, MS, or equivalent in Mechatronics, Computer Science, or a related field
Data Engineering Lead
Fampay
Data Engineering Lead Bengaluru | Engineering | Full-Time About Fam (formerly FamPay) Fam is India s first payments app designed for everyone aged 11 and above. FamApp enables seamless online and offline payments through UPI and FamCard. Our mission is to empower over **250 million young Indians** to start their financial journey early, becoming financially aware and confident. Founded in 2019 by IIT Roorkee alumni, Fam is backed by top-tier investors including Elevation Capital, Y-Combinator, Peak XV (Sequoia Capital India), Venture Highway, and angels like Kunal Shah and Amrish Rao. About the Role We re looking for a visionary **Data Engineering Lead** to take **end-to-end ownership** of Fam s data ecosystem from data ingestion and storage to processing and delivering actionable insights. You ll **define the data strategy and architecture** that supports both batch and **real-time** use cases, ensuring scalability, reliability, and governance across the organization. You will be instrumental in enabling accurate, complete, and trusted data flow that powers business intelligence, analytics, and product decision-making. This role involves **leadership, strategic thinking**, and hands-on problem solving. What You ll Do Own the full data lifecycle: ingestion, organization, storage, processing, and presentation. Define and execute **data architecture and strategy** aligned with operational and analytical goals. Build **scalable, reliable, and observable data systems** supporting batch and near real-time processing. Ensure **data quality, governance, and compliance**, proactively resolving discrepancies. Collaborate with product, engineering, and business teams to define, track, and optimize key metrics. Anticipate data-related challenges and implement preventive solutions. Lead, mentor, and grow the data engineering team, fostering innovation and accountability. Must-Haves 10+ years experience in data engineering, including proven leadership of teams or projects. Expertise designing, building, and scaling end-to-end data pipelines and systems. Deep understanding of the data lifecycle, from ingestion through business reporting. Strong communication skills and ability to collaborate across technical and business teams. Solid knowledge of **data governance, quality assurance, and compliance standards**. Experience with observability and proactive monitoring for data systems. Proficiency in Python and SQL; familiarity with Scala or Java. Hands-on experience with streaming and batch data frameworks. Experience designing large-scale data lakes and warehouses with best practices for schema evolution and partitioning. Strong background with **cloud platforms (AWS, GCP, or Azure)**. Fintech or regulated industry experience is a plus. Good to Have Fintech-specific data experience, including regulatory compliance and reporting. Deployment experience with **real-time analytics** and event-driven architectures. Familiarity with containerization and infrastructure tools like Docker, Kubernetes, Terraform. Knowledge of data observability tools (Monte Carlo, Databand, etc.). Exposure to **ML pipelines** and model deployment. Solve challenging problems at the intersection of big data, real-time processing, and fintech. Lead impactful data initiatives at a rapidly growing startup. Collaborate with a world-class team of engineers, data scientists, and product leaders. Competitive compensation, equity, and benefits. Clear career growth opportunities in leadership and innovation. Perks That Go Beyond the Paycheck Relocation assistance for a smooth move. Free office meals (lunch & dinner). Generous leave policies (birthday, period, parental support, and more). Salary advances and loan policies for financial support. Quarterly rewards, recognition, and referral incentives. Access to the latest gadgets and tools. Comprehensive health insurance with mental health support. Tax benefits like food coupons, phone allowances, and leasing options. Retirement benefits including PF contribution, leave encashment, and gratuity. About FamApp FamApp focuses on financial inclusion for the next generation by offering UPI and card payments to users aged 11+. Our flagship product, FamX, integrates UPI and card payments seamlessly, helping users manage, save, and learn about their finances effortlessly. With over **10 million users**, FamApp is revolutionizing how young Indians transact eliminating the need to carry cash and offering customizable FamX cards with personal doodles for a fun, unique payment experience. Join Our Dynamic Team At Fam, we foster a people-first culture with flexible work schedules, generous leave, comprehensive health benefits, and mental health support. You ll be part of a passionate, talented, and fun team shaping the future of fintech for India s youth.
Data Architect
Growtharc Technologies
Position: Data Architect Location: Remote/Hybrid | Bengaluru, IND We're searching for a highly skilled and experienced Data Architect to join our team. If you have a deep understanding of big data technologies and extensive experience with Hadoop, Python, Snowflake, and Databricks, you're the ideal candidate. You'll be responsible for designing, implementing, and managing complex data architectures that support our critical business needs and objectives. What You'll Do: Design & Architecture Leadership: Design scalable and efficient data architecture solutions that meet current and future business data needs. Lead the development of data models, schemas, and databases, ensuring alignment with business requirements. Architect and implement robust data solutions on leading cloud platforms (AWS, Azure, or GCP). Data Management & Governance: Develop and maintain robust data pipelines and ETL processes using Hadoop, Databricks, and other essential tools. Oversee data integration and quality efforts to ensure consistency and reliability across the organization. Implement data governance best practices, focusing on data security, privacy, and compliance. Collaboration & Mentorship: Work closely with data engineers, data scientists, and business stakeholders to translate data requirements into effective technical solutions. Provide technical leadership and mentorship to junior data engineers and architects. Collaborate with cross-functional teams to ensure data solutions align perfectly with overall business goals. Optimization & Innovation: Optimize existing data architectures for peak performance, scalability, and cost-efficiency. Monitor and troubleshoot data systems to ensure high availability and reliability. Continuously evaluate and recommend new tools and technologies to improve our data architecture. What You'll Bring: Experience: 10+ years in data architecture, data engineering, or a related field. Big Data Expertise: Proven experience with Hadoop ecosystems (HDFS, MapReduce, Hive, HBase). Programming Prowess: Strong programming skills in Python for data processing and automation. Data Platform Mastery: Hands-on experience with Snowflake for data warehousing and Databricks for analytics. Cloud Fluency: Extensive experience with cloud platforms (AWS, Azure, GCP) and their data services. Data Modeling: Familiarity with data modeling tools and methodologies. Core Skills: Deep understanding of big data technologies and distributed computing. Strong problem-solving skills to design solutions for complex data challenges. Excellent communication skills, able to explain complex technical concepts clearly to diverse audiences. Proficient in SQL and database performance tuning. Experience with CI/CD pipelines and automation in data environments. Education: Bachelor's degree in Computer Science, Information Technology, or a related field. Preferred Qualifications: Advanced Degree: A Master's degree in a related field. Cloud Certifications: Certifications like AWS Certified Data Analytics, Google Professional Data Engineer, or Microsoft Certified: Azure Data Engineer Associate. Additional Languages: Experience with other programming languages like Java or Scala. Machine Learning Integration: Knowledge of machine learning frameworks and their integration with data pipelines.
Data Science Specialist
5c Network Pvt. Ltd.
Position: Data Science Specialist Employment Type: Full-time Location: Bangalore, On-site Experience Required: 1 to 6 years (Mindset > Experience) About the Role: We re seeking a mission-driven, hands-on Data Analytics Specialist passionate about impact. This is not a typical analytics role you will drive key business metrics across clinical operations, stakeholder reporting, and automation. Collaborate closely with leadership, radiologists, operations, product teams, and AI engineers to ensure every decision is data-driven, actionable, and scalable. If you thrive on building dashboards, automating processes, streamlining pipelines, and driving growth through insights, this is your playground. Key Responsibilities: Own Business KPIs: Take full ownership of company-wide metrics, ensuring their accuracy, relevance, and actionability through stakeholder alignment. Data Engineering: Build and optimize ETL/ELT pipelines integrating data from Postgres, ClickHouse, and other sources. Dashboards & Reporting: Design and maintain intuitive dashboards (Metabase, Power BI, Tableau, Google Data Studio) that stakeholders actively use. Backend Data Transformation: Write clean, reusable code to convert raw data into actionable insights. API Development: Build and maintain internal APIs to serve analytics data to frontend and production systems. Email Automation: Develop real-time and scheduled email reports delivering dynamic insights to stakeholders. Spreadsheet Expertise: Automate and manipulate complex data in Excel/Google Sheets for detailed analysis and reporting. AI & Automation: Collaborate with AI teams to integrate predictive algorithms and automate analytics workflows. Business Acumen: Develop deep understanding of the teleradiology ecosystem clinical workflows, operations, financials, and tech platforms. Proactive Collaboration: Identify data gaps, flag inconsistencies, recommend improvements, and work closely with leadership. Required Skills: Strong Python skills for data analytics (Pandas, NumPy, FastAPI, Jupyter, etc.) Proficient in SQL (ClickHouse, PostgreSQL) with experience optimizing queries End-to-end dashboard ownership from data modeling to UI presentation Experience building or integrating APIs for analytics Advanced spreadsheet skills and formula-driven reporting Experience working with AI/ML models in applied settings (preferred) Hands-on backend data transformations, version control, and automation Familiarity with full-stack development frameworks (bonus) Bonus / Nice to Have: Exposure to dbt, Airflow, ChromaDB, Streamlit, Plotly Knowledge of data privacy, compliance, and healthcare analytics Experience building analytics platforms for SaaS or health-tech companies Mindset We re Looking For: 10x Hustler: Ready to learn fast and go the extra mile to solve problems. Obsessed with Accuracy: Data must be flawless before release. Extreme Ownership: Proactive in driving results without waiting for instructions. Fast Learner: Able to pick up new tools and concepts quickly. Business First, Code Later: Focused on impact, not just coding. Be part of India s fastest-growing teleradiology platform Direct access to leadership and real-world business impact Build game-changing analytics systems and influence strategy Gain exposure to AI, automation, product, and operations in one role Clear ramp-up roadmap and growth plan Expectations in First 3 Months: Build an end-to-end dashboard from scratch Clean up at least one messy data pipeline Automate a stakeholder email update Validate accuracy across critical business metrics Understand full-stack platform structure and propose an improvement
Senior Associate Data Engineering L2
Publicis Sapient
Senior Associate Data Engineering L2 Location: Bengaluru, India Department: Engineering Data Employment Type: Full-Time About the Role As a Senior Associate Data Engineering (L2) at Publicis Sapient, you will lead technical solutions that drive digital transformation by building scalable, high-performance data platforms. You ll be responsible for translating business and technical requirements into modern, data-centric solutions using Big Data technologies, cloud services (Azure), and advanced data engineering practices. Key Responsibilities Design and implement data ingestion, integration, and transformation processes from multiple heterogeneous sources in both batch and real-time. Build scalable data platforms using Hadoop stack components such as HDFS, Kafka, Spark, Hive, NiFi, Oozie, Airflow, Flink, and Storm. Develop real-time analytics, aggregation, and search features to support various data-driven applications. Collaborate closely with cross-functional teams on data infrastructure, computation frameworks, and data visualization. Apply cloud-native principles and Azure services to build and deploy data pipelines. Ensure performance optimization and data pipeline tuning. Work with NoSQL and MPP platforms like MongoDB, Cassandra, Redshift, Azure SQL DW, HBase, BigQuery. Contribute to infrastructure, automation, and DevOps for data pipelines using CI/CD practices. Ensure data governance, lineage, and cataloging using tools like Collibra or Alation. Required Qualifications 6 8 years of professional experience in software/data engineering. Minimum of 3 years hands-on experience with Big Data technologies. Strong programming expertise in Java (preferred), Scala, or Python. Expertise in the Hadoop ecosystem and real-time stream processing tools (Kafka, Pulsar, Spark Streaming, etc.). Hands-on experience with Azure data services (e.g., Data Factory, Synapse, ADLS, Databricks). Experience working with modern ETL tools (Informatica, Talend, etc.) and traditional RDBMS platforms (Oracle, PostgreSQL, SQL Server, MySQL). Bachelor's degree in Computer Science, Engineering, or related field. Nice to Have Certifications in Azure Data Engineer, GCP Big Data, or related cloud specializations. Experience with distributed messaging frameworks (ActiveMQ, RabbitMQ, Solace). Familiarity with microservices architecture and search technologies (Elasticsearch). Performance tuning of distributed data processing systems. Exposure to data governance, security, and metadata management. Benefits and Culture at Publicis Sapient Gender-neutral workplace policies 18 paid holidays annually Generous parental leave + new parent transition support Flexible work arrangements Access to Employee Assistance Programs (wellness & well-being) A dynamic culture focused on learning, creativity, and collaboration Qualification : Bachelor's degree in Computer Science, Engineering, or related field.
Cloud Architect
Camsdata Technologies India Pvt. Ltd.
Cloud Architect Bangalore, India Location: Bangalore (Bengaluru) Experience: 8 to 15 Years Industry: IT Software / Cloud Computing Job Summary: We are seeking a seasoned Cloud Architect with deep expertise in designing and implementing secure, scalable cloud solutions across public and private cloud platforms. The ideal candidate will have strong knowledge of enterprise application and integration patterns, cloud-native microservices, and security architecture. Key Responsibilities: Architect and design cloud solutions leveraging AWS, Microsoft Azure, and Google Cloud Platform (GCP) Develop microservices-based applications using Docker and Kubernetes, and deploy them on cloud platforms Define security architectural requirements, including threat modeling, identity and access management, PKI, and secrets management Ensure cloud environments adhere to security protocols, compliance standards, and best practices for authentication and authorization Work with a wide range of cloud services including storage, networking, and security components Apply knowledge of Big Data ecosystems such as Hadoop and NoSQL databases to design scalable data processing architectures Deploy and manage cloud infrastructure using Terraform for infrastructure as code (IaC) Utilize configuration management tools like Puppet, Chef, and continuous integration tools such as Git and Jenkins Collaborate within Agile teams to deliver cloud architecture solutions efficiently Stay updated on emerging open-source technologies and integrate them into cloud architectures when applicable Required Skills & Qualifications: Extensive experience with both public and private cloud technologies Strong understanding of Enterprise Application Patterns and Integration Patterns Hands-on experience with containerization, microservices, and orchestration tools (Docker, Kubernetes) In-depth knowledge of cloud security, including threat modeling and compliance requirements Proficiency in managing cloud infrastructure on AWS, Azure, and/or GCP Familiarity with Big Data platforms and NoSQL databases Skilled in Infrastructure as Code (IaC) tools, especially Terraform Experience with automation and configuration management tools like Puppet, Chef, Git, and Jenkins Comfortable working in Agile development environments Preferred Qualifications: Bachelor s or Master s degree in Computer Science, Information Technology, or related field Relevant cloud certifications such as AWS Certified Solutions Architect, Azure Solutions Architect Expert, or Google Cloud Professional Architect Strong communication skills to articulate complex cloud architectures to diverse stakeholders Lead the design of innovative, secure, and scalable cloud architectures Work with cutting-edge cloud and container technologies in a dynamic environment Opportunity to grow professionally with access to training and certifications Qualification : Bachelors or Masters degree in Computer Science, Information Technology, or related field
Data Architect
Camsdata Technologies India Pvt. Ltd.
Data Architect Bangalore, India Location: Bangalore (Bengaluru) Experience: 10 to 15 Years Industry: IT & Data Systems Job Summary: We are seeking an experienced Data Architect with a strong background in designing and implementing enterprise-scale data solutions. The ideal candidate will have expertise in building data lakes, warehouses, and pipelines, with deep knowledge of cloud platforms, data management, and industry best practices. Key Responsibilities: Design, develop, and maintain complex data architectures including data lakes, data warehouses, data marts, and efficient schema design Build and optimize scalable data pipelines for extraction, transformation, and loading (ETL/ELT) processes Apply Agile methodologies in project delivery and collaborate within cross-functional teams Perform data profiling, cleansing, conversion, and ensure high-quality data management for both structured and unstructured data Implement CI/CD and Infrastructure as Code (IaC) practices using tools like GitHub, Jenkins, CloudFormation, and Azure Resource Manager Manage database systems and tools such as PostgreSQL, Oracle, Snowflake, Teradata, MongoDB, Hadoop, and others Utilize data modeling tools like Erwin, Power Designer, and Toad for effective data architecture design Leverage cloud platforms including AWS and Microsoft Azure, with hands-on experience in services like AWS Glue, DMS, Lambda, Azure Data Factory, Synapse, and Data Lake Storage Work with programming and scripting languages including SQL, PL/SQL, Python, Spark, YAML, and JSON Use containerization and automation tools such as Docker, Ansible, and NodeJS for efficient deployment Ensure compliance with cybersecurity principles and frameworks such as NIST Lead data governance initiatives and enforce best practices in data quality and security Preferred Qualifications: ITIL certification and experience with Agile methodology Knowledge of code review and version control best practices, especially in GitHub Familiarity with data science tools and AI/ML frameworks like R, Keras, or TensorFlow Experience with natural language processing (NLP) and machine learning concepts Background in regulated industries, with pharma manufacturing experience highly preferred Exposure to multi-site, global IT projects and manufacturing operations Lead innovative data architecture projects within a dynamic and fast-paced environment Work with cutting-edge cloud technologies and big data ecosystems Collaborate with global teams on impactful enterprise solutions Access to professional growth opportunities in data governance, AI, and cloud technologies
Data Scientist
Cognite
Data Scientist Location: Bengaluru Department: Global Strategic Services Data Science EMEA Type: Full-Time | Hybrid About Cognite Cognite is a global SaaS leader harnessing AI and data to solve complex business challenges across industries such as Oil & Gas, Chemicals, Pharma, Manufacturing, and Energy. Our flagship solutions include Cognite Atlas AI, an industrial agent workbench, and the Cognite Data Fusion (CDF) platform. Recognized as a 2022 Technology Innovation Leader and the 2024 Microsoft Energy and Resources Partner of the Year, Cognite is at the forefront of industrial digital transformation. Our Values Impact: We focus on meaningful results. Ownership: We take responsibility, foster inclusivity, and share success. Relentless: We pursue innovation with determination and resilience. Who You Are 3+ years of industry experience (or academic equivalent) in Oil & Gas, Manufacturing, or Power & Utilities, developing analytical solutions for production optimization, predictive maintenance, and related applications (e.g., well monitoring, event prediction, production planning). 1+ years serving as a domain expert on internal or customer projects. Proficient in Python and its data ecosystem (pandas, numpy), and ML libraries such as scikit-learn, Keras, etc. Skilled in data visualization tools like PowerBI, Grafana, Tableau, or web frameworks like Plotly Dash, Streamlit. Advocates for software best practices, including automated testing and documentation. Experienced with version control systems such as Git. Comfortable contributing to large-scale software projects. Pragmatic thinker able to balance short- and long-term tradeoffs. Trusted advisor on machine learning applications, with excellent communication skills for technical stakeholders and customers. Experienced mentor and quality assurer for junior team members. Cloud experience, including building streaming models and deploying serverless functions. Track record working across diverse industries. Eager to stay current with advances in generative AI and explore its practical applications for customers. Be part of a diverse global team with 70+ nationalities and a strong commitment to Diversity, Equality, and Inclusion (DEI). Work from our Bengaluru office (Rathi Legacy, Hoodi) in a modern hybrid environment. Benefit from a flat organizational structure with direct access to decision-makers. Collaborate on impactful, ambitious projects alongside talented professionals across industries. Engage with our community through internal HUBs and open conversations. Make an Impact Join Cognite and help transform industrial sectors by enabling better decisions through AI and data. We encourage candidates of all backgrounds to apply. If you re ready to grow and innovate with us, apply today!
Data Scientist Ii
Meesho
Data Scientist II Join the Meesho Tech Team Location: Bangalore, Karnataka | Department: Technology Empower the Future of E-Commerce with Meesho We are looking for an experienced Data Scientist II to join our Avengers-like Data Science team at Meesho. As part of our mission to revolutionize e-commerce in India, we solve complex problems related to fraud detection, inventory optimization, and localized platform experiences that impact millions of users daily. As a mid-level Data Scientist, you ll partner with stakeholders across teams to create data-driven strategies, improve platform performance, and lead initiatives that enable better business outcomes for resellers and end users across Bharat. What You ll Do Design and build predictive models and machine learning solutions using large-scale data Conduct experiments and extract insights from high-volume structured and unstructured data Enhance product usability and surface new growth opportunities through advanced analytics Analyze reseller behavior to deliver personalized product recommendations Design effective discount programs to increase reseller sales performance Identify customer preferences to help resellers better serve their buyers and drive revenue Uncover supply chain inefficiencies and support SLA adherence through data insights Forecast seasonal and regional demand to optimize key business metrics Mentor junior data scientists and contribute to team knowledge-sharing What You ll Need Bachelor s or Master s degree in Computer Science, Data Science, or a related technical field 2 4 years of experience in a Data Scientist role, ideally within a fast-moving B2C tech environment Strong knowledge of Machine Learning algorithms, Neural Networks, and data modeling techniques Hands-on experience with SQL, Python, and R for data manipulation and analysis Strong understanding of statistics, linear algebra, and A/B testing methodologies Demonstrated ability to turn business problems into data science solutions Experience working with product and engineering teams to deliver scalable systems Preferred Qualifications Prior experience solving personalization or recommendation engine problems Familiarity with Big Data technologies like Apache Spark, Hadoop, or Amazon Redshift About Meesho Meesho is India's top-rated e-commerce workplace, revolutionizing online commerce for the next billion users. We empower over 1.75 million sellers through an inclusive, technology-driven platform that offers zero-commission selling, the lowest logistics costs, and access to a massive customer base across every serviceable pincode in India. Our Mission Democratizing Internet Commerce for Everyone Meesho (Meri Shop) aims to help 100 million small businesses succeed online. With relatable merchandise and localized user experiences, we re building a truly inclusive digital shopping ecosystem for underserved markets in India. Our Culture and Benefits Meesho fosters a high-performance and people-first culture. Our workplace philosophy is shaped by 11 guiding principles ("Mantras") and practices like Reflections, Listen or Die, and a robust Internal Mobility Program. Our Total Rewards Include: Competitive compensation packages including performance-based equity Comprehensive wellness through our MeeCare Program covering medical, mental, and financial well-being Flexible leave policies and parental support benefits to ensure work-life balance Relocation support, salary advances, and learning & development assistance Fun, collaborative culture with team engagement events, gifts, and recognition programs We are not just building a company; we re creating a movement. If you re driven by data and inspired to make a difference, come be a part of Meesho s journey to redefine e-commerce in India. Apply today and let's build the future of commerce, together. Qualification : Bachelors or Masters degree in Computer Science, Data Science, or a related technical field
Data Scientist III
Meesho
Data Scientist III Advanced Analytics Role at Meesho Location: Bangalore, Karnataka | Department: Tech About the Data Science Team at Meesho Our Data Science team is the engine driving intelligent decision-making at Meesho. Known internally as the Avengers to our S.H.I.E.L.D, we tackle mission-critical challenges using cutting-edge data science to empower millions of users and small businesses across Bharat. From fraud detection and inventory optimization to platform vernacularization, we re shaping the future of e-commerce in India through data innovation. As a Data Scientist III, you'll be solving complex problems in a rapidly evolving market. You ll lead initiatives that translate data into strategic insights and scalable models, impacting business across various functions. About the Role If you thrive on solving high-impact data problems and love deriving actionable solutions from large datasets, you may be the next Data Scientist III at Meesho. Your primary responsibility will be to increase the utility of business data through experimentation, modeling, and strategic insights. This role involves significant collaboration with cross-functional teams, mentorship, and ownership of critical analytics workflows. What You ll Do Build and deploy machine learning models to extract insights from complex data Run experiments to test hypotheses and evaluate product performance Drive product usability improvements and uncover new growth opportunities Analyze reseller preferences and personalize product offerings Create effective discount strategies to support reseller success Enhance understanding of end-user behavior for improved targeting Uncover supply chain inefficiencies and improve SLA performance Predict seasonal and regional demand to guide planning decisions Mentor junior data scientists and contribute to team development What You ll Need Bachelor s or Master s degree in Computer Science, Data Science, or a related technical field 4 7 years of experience as a Data Scientist, ideally in a fast-paced B2C company Proficiency with Neural Networks, supervised/unsupervised learning, and ML frameworks Expertise in Python, SQL, and R for data analysis and model development Strong knowledge of statistics, linear algebra, and model validation techniques Experience in designing, running, and analyzing A/B tests Exceptional problem-solving and communication skills Experience working closely with product and engineering teams Bonus Points For: Work on personalization systems or recommendation algorithms Familiarity with Big Data technologies like Apache Spark, Hadoop, or Redshift About Meesho Meesho is India s leading social commerce platform, empowering over 1.75 million sellers to grow their businesses online. We offer a unique business model with zero commission fees and the lowest shipping costs, supported by a powerful tech ecosystem and pan-India logistics network. Our platform is built for first-time internet users and small business owners alike, democratizing e-commerce across every corner of the country. Our Mission Democratizing Internet Commerce for Everyone Meesho s goal is to enable 100 million small businesses to succeed online. With relatable, affordable merchandise and innovative seller support tools, we help entrepreneurs reach new customers and scale efficiently. Our Culture and Total Rewards At Meesho, we cultivate a high-performance, people-first culture supported by our 11 guiding Mantras. We prioritize continuous learning, collaboration, and transparency, and back it up with industry-leading employee experiences. Our Total Rewards Include: Market-leading compensation with both cash and equity components Access to MeeCare Program for physical, mental, financial, and social wellness Comprehensive medical coverage for employees and families Parental benefits, generous leave policies, and retirement plans Support for learning and career development through structured programs Employee engagement, flexible benefits, relocation assistance, and much more We re more than just a company we re a movement redefining online commerce in India. If you re a passionate data expert ready to lead high-impact initiatives, apply now and become a part of Meesho s incredible journey. Qualification : Bachelors or Masters degree in Computer Science, Data Science, or a related technical field
Principal Applied Scientist
Microsoft
Principal Applied Scientist Location: Bangalore, Karnataka, India Employment Type: Full-Time Overview Join the Computational Advertising Team within the AI & Research organization at Microsoft. We are seeking a passionate Principal Applied Scientist with expertise in machine learning, deep learning, natural language processing, and optimization. The team works on some of the most exciting problems in machine learning, powering the search advertising ecosystem of Bing, which holds a 30% share of desktop search in the US and a significant presence worldwide. The Principal Applied Scientist will contribute to building large-scale machine learning systems that model user responses, ranking, and various other business applications. Your work will have direct business impact, collaborating with top-tier machine learning scientists and engineers to deliver innovative solutions. Key Responsibilities Design & Implement ML Algorithms: Develop, tune, and analyze complex algorithms for large datasets and real-time systems in the advertising ecosystem. Productionize Machine Learning Models: Focus on deploying machine learning models in large-scale production environments, ensuring high-quality and sustainable performance. Collaborate with Experts: Work closely with cross-functional teams of scientists and engineers to integrate advanced machine learning techniques into production systems. Research & Development: Contribute to ongoing research in machine learning, optimization, and information retrieval to advance the core technologies in search advertising. Mentorship & Thought Leadership: Share your expertise with the team, mentor junior scientists, and help shape the direction of research efforts within the team. Qualifications Required Qualifications: Bachelor's degree in Statistics, Econometrics, Computer Science, Electrical Engineering, or related field AND 6+ years of relevant experience (e.g., statistics, predictive analytics, research), OR Master's degree in a related field AND 4+ years of relevant experience, OR Doctorate in a related field AND 3+ years of relevant experience, OR Equivalent experience in machine learning, research, or applied statistics. Additional Requirements: Strong understanding of probability, statistics, machine learning, A/B testing, and optimizing ML models for accuracy. Experience with distributed computing systems (e.g., Hadoop, Spark) for large-scale training and prediction with ML models. Hands-on experience implementing machine learning algorithms from research papers to production systems. Expertise in TensorFlow or PyTorch and deep learning models. Preferred Qualifications: Solid understanding of natural language processing (NLP), information retrieval, and optimization techniques. Experience working with large-scale, real-time systems and databases. Strong track record of delivering machine learning systems to production and optimizing them at scale. Microsoft s mission is to empower every person and organization on the planet to achieve more. At Microsoft, you ll have the opportunity to work with the brightest minds, contribute to cutting-edge AI advancements, and make a significant impact in a growing $100 billion global market. Employee Benefits Industry-leading healthcare coverage Generous paid time off and family leave policies Access to continuous learning and professional development resources Employee discounts and savings programs Opportunities to collaborate with the world s leading AI experts Global networking and community engagement opportunities Microsoft is an equal opportunity employer. We are committed to fostering an inclusive environment where all individuals are treated with respect, regardless of race, gender, religion, disability, or any other characteristic protected by law. Qualification : Bachelor's degree in Statistics, Econometrics, Computer Science, Electrical Engineering, or related field AND 6+ years of relevant experience (e.g., statistics, predictive analytics, research), OR Master's degree in a related field AND 4+ years of relevant experience, OR Doctorate in a related field AND 3+ years of relevant experience, OR Equivalent experience in machine learning, research, or applied statistics.
Data Engineer - Platform Generative Ai
Mckinsey & Company
Your Impact We are seeking a passionate Data Engineer with expertise in Python development who is excited about cloud-based data engineering using AWS services. You will be an integral part of a dynamic, multi-disciplinary team, working closely with digital product professionals, data scientists, cloud engineers, and other stakeholders. As a key member of a global team working on our generative AI initiative, you will be based in one of our European offices. McKinsey s Tech Ecosystem function is responsible for developing and delivering all technology solutions for the firm s internal use, and your role will be crucial in driving the development of data solutions to support generative AI applications. You will work with a team of data engineers to develop robust data ingestion pipelines and enhance data processing capabilities that integrate data into systems used by AI applications. Your responsibilities will include writing Python code, creating tests, developing and maintaining GitHub Action CICD pipelines, and managing AWS-based infrastructure and Docker containers. Your Growth As a member of the global team working on our generative AI initiative, you will play a key role in shaping and accelerating the delivery of McKinsey's target state data platform, which will enable AI use-cases. You will be part of our cloud-first approach, transforming data platforms and analytical applications across the firm. Working closely with multidisciplinary teams, you will contribute to building cutting-edge data solutions in a fast-paced, innovative environment. McKinsey s Tech Ecosystem function is responsible for developing all technology solutions for the firm s internal needs, and you ll have the opportunity to shape how these solutions evolve. Your Qualifications and Skills 3+ years of professional experience as a Data Engineer, with a focus on cloud-based data engineering using AWS services Expertise in Python development and a strong understanding of clean code, modularity, error handling, and test automation Extensive experience with relational databases and data pipeline performance Hands-on experience with Docker and CI/CD pipelines (e.g., GitHub Actions) Strong execution focus, with the ability to work independently in complex, fast-paced environments and deliver results Demonstrable experience in solving data pipeline performance issues and diagnostics Interest in generative AI and machine learning topics Experience with Kedro framework is a plus Opinionated and confident in sharing ideas, willing to speak up at all levels Familiarity with Agile principles and product development methodologies Excellent problem-solving skills and the ability to analyze and resolve complex data engineering challenges Strong communication and collaboration skills, with the ability to work effectively in cross-functional teams
Data Engineer: Data Warehouse
International Business Machines Corporation
Job Title: Application Developer - ETL and Data Management Introduction: In this role, you ll work in one of our IBM Consulting Client Innovation Centers (Delivery Centers), where we provide deep technical and industry expertise to a wide range of public and private sector clients globally. Our delivery centers leverage locally-based skills to help clients drive innovation and the adoption of new technologies. A career in IBM Consulting is built on long-term relationships and close collaboration with clients across the world. You will work with leaders across industries to improve the hybrid cloud and AI journeys for some of the most innovative and valuable companies worldwide. Your ability to make a meaningful impact for clients is enabled by our strategic partner ecosystem and robust technology platforms, including Software and Red Hat. Curiosity and a constant quest for knowledge are key to success in IBM Consulting. In your role, you ll be encouraged to challenge the norm, explore new ideas, and come up with creative solutions that result in groundbreaking impact for a wide network of clients. Our culture is built on evolution and empathy, focusing on long-term career growth and development in an environment that values your unique skills and experience. Your Role and Responsibilities: ETL Workflow Development: Develop and implement ETL workflows by creating ETL jobs, and data models in datamarts using technologies such as Snowflake, DBT, Unix, and SQL. Batch Processing Redesign: Redesign Control M Batch processing for ETL job builds to run efficiently in a production environment. System Evaluation and Improvement: Study the existing system to evaluate its effectiveness and design new systems to improve workflow efficiency. Business Program Analysis & Support: Perform requirements identification, business program analysis, testing, and system enhancements while providing production support. Agile Environment: Work effectively in an Agile environment and gain familiarity with tools such as JIRA and SharePoint. Client Interaction: Good written and verbal communication skills are essential as you will interact directly with client counterparts to understand requirements and provide solutions. Required Technical and Professional Expertise: Experience: A minimum of 3 years of experience in developing ETL applications, implementing workflows, and creating data models using Snowflake, DBT, Unix, and SQL technologies. Agile Environment: Strong understanding of working in an Agile environment and proficiency in tools like JIRA and SharePoint. Problem-Solving Skills: Ability to manage change and proven time management skills. Strong interpersonal skills to contribute effectively to team efforts. Continuous Learning: Stay up-to-date with technical knowledge by attending educational workshops and reviewing relevant publications. Preferred Technical and Professional Expertise: ETL Development: Experience in developing triggers, functions, and stored procedures to support ETL workflows. Impact Analysis: Assist with impact analysis of changing upstream processes on the Data Warehouse and reporting systems. ETL & Reporting Support: Participate in the design, testing, support, and debugging of new and existing ETL and reporting processes. Data Profiling & Troubleshooting: Perform data profiling and analysis using a variety of tools, troubleshoot and support production processes, and maintain documentation. Innovation: Be part of a team that drives global change and leverages cutting-edge technologies to solve complex problems. Growth: Gain access to continuous learning and career development opportunities to further your expertise in data management and cloud technologies. Collaboration: Work with a diverse team in a collaborative environment that values new ideas and creative solutions. Global Impact: Your work will contribute to improving business operations and technological advancements for clients around the world. If you're passionate about driving innovative solutions, working with a variety of clients, and continuously evolving your skills, IBM Consulting is the perfect place for you to advance your career.
Staff Data Engineer
Intuit
Intuit is a global leader in financial technology, dedicated to helping individuals and businesses thrive. Our suite of products, including TurboTax, Credit Karma, QuickBooks, and Mailchimp, serves approximately 100 million customers worldwide. At Intuit, we believe in providing everyone with the tools and resources they need to achieve financial success. We are constantly innovating to make financial empowerment a reality for all. Job Overview Join the Intuit Data Platform (IDP) team as a Staff Engineer and help us transform the way we handle big data! The IDP team is responsible for the Intuit Analytics Platform, which powers real-time data ingestion, cataloging, analytics, and machine learning across the entire organization. As Intuit s customer base grows, so does the volume of data we process. Our engineering excellence ensures that we can scale and leverage this data to drive machine learning and product innovations. We re in the process of building the next-generation real-time and batch ingestion engine, capable of indexing, cataloging, and organizing data and metadata. We are passionate about using open-source technologies to solve challenges and contributing back to the community. If you're excited about building a platform that will directly impact data scientists and analysts and have a desire to shape the future of data at Intuit, then come join us! Key Responsibilities Architect & Design: Build fault-tolerant and scalable big-data platforms using open-source technologies to handle massive datasets. Data Solutions: Create architecture solutions that address complex use cases like data normalization, lineage, governance, ontology, and discoverability. Cross-Team Collaboration: Work with analysts and data scientists to understand data requirements for building operational propensity models and gaining deep customer insights. Hands-On Coding: Lead development efforts within the Hadoop ecosystem using technologies such as Java MapReduce, Spark, Scala, HBase, and Hive to build and optimize data pipelines for both real-time and batch applications. Database Management: Work with NoSQL, SQL, and in-memory databases to design high-performance data systems. Code Reviews: Ensure code quality, consistency, and adherence to best practices through regular code reviews. Architectural Alignment: Ensure alignment between enterprise architecture and business requirements. Prove Feasibility: Conduct proof-of-concept (POC) experiments for new technologies or approaches and drive them to production. Collaboration with Data Cataloging Team: Work closely with data catalog teams and architects to index and catalog all data sources at Intuit. Agile Leadership: Lead fast-paced development teams using agile methodologies and promote best practices in software development, testing, and incident response. Design & Model: Build dimension models suited for customer business use cases and ensure seamless integration of business and technical requirements. Qualifications Experience: 12+ years of relevant experience, with at least 5+ years specializing in the big data domain. Big Data Architecture: Proven experience in architecting end-to-end ecosystems for big data and analytics platforms. Expert Knowledge: Deep expertise in building fault-tolerant, scalable big data solutions, especially using the Hadoop ecosystem (Hive, HBase, Spark, Kafka, MapReduce, etc.). Programming Expertise: Mastery of Java and Scala, with a focus on building high-throughput data services. Machine Learning: Knowledge of machine learning principles and AI applications in big data. Big-Data Technologies: Familiarity with tools such as HDFS, Storm, Zookeeper, Cassandra, Redshift, GraphDB, and others. Understanding both real-time and batch processing in the Hadoop ecosystem. Communication: Strong communication skills, with an ability to explain complex technical topics to both technical and non-technical audiences. Programming Skills: Intermediate experience in Python or R for data processing. Education: BE/BTech/MS in Computer Science or a related field (or equivalent experience). Collaboration: Demonstrated ability to work cross-functionally and lead change through influence and example. At Intuit, you ll be part of a talented, passionate team working on innovative solutions that shape the future of data analytics and machine learning. As a Staff Engineer, you ll have the chance to work with cutting-edge technologies, build scalable systems, and help revolutionize how Intuit leverages data to drive product innovation. If you're looking for a dynamic environment where you can have a meaningful impact, come join us at Intuit! Qualification : BE/BTech/MS in Computer Science (or equivalent)
Sr. Solutions Engineer
Databricks
Job Title: Senior Solutions Engineer (Analytics, AI, Big Data, Public Cloud) Job Summary As a Senior Solutions Engineer (Analytics, AI, Big Data, Public Cloud), you will guide the technical evaluation phase in a hands-on environment throughout the sales process. You will be a technical advisor internally to the sales team and work with the product team as an advocate of your customers in the field. You will help our customers achieve tangible data-driven outcomes through the use of our Databricks Lakehouse Platform, helping data teams complete projects and integrate our platform into their enterprise Ecosystem. You'll grow as a leader in your field while finding solutions to our customers' biggest challenges in big data, analytics, data engineering, and data science problems. You will report to the Solutions Architect (SA) Manager. The Impact You Will Have You will be a Big Data Analytics expert on aspects of architecture and design. Engage with the technical community by leading presentations, workshops, seminars, and meet-ups. Lead your clients through evaluating and adopting Databricks, including hands-on Spark programming and integration with the wider cloud ecosystem. Support your customers by authoring reference architectures, how-tos, and demo applications. Integrate Databricks with 3rd-party applications to support customer architectures. Together with your Account Executive, you will form successful relationships with clients throughout your assigned territory to provide technical and business value. What We Look For Consulting, pre-sales, or post-sales experience working with external clients across a variety of industry markets. Core strength in either data engineering or data science is advantageous. 5+ years of experience demonstrating technical concepts, including presenting and white-boarding. 4+ years of experience designing architectures within a public cloud (AWS, Azure, or GCP). 4+ years of experience with Big Data technologies, including Spark, AI, Data Science, Data Engineering, Hadoop, Cassandra, and others. Solid coding experience in Python, R, Java, Spark, or Scala. About Databricks Databricks is the data and AI company. More than 10,000 organizations worldwide including Comcast, Cond Nast, Grammarly, and over 50% of the Fortune 500 rely on the Databricks Data Intelligence Platform to unify and democratize data, analytics, and AI. Databricks is headquartered in San Francisco, with offices around the globe and was founded by the original creators of Lakehouse, Apache Spark , Delta Lake, and MLflow. To learn more, follow Databricks on Twitter, LinkedIn, and Facebook.
Sr. Data Engineer
Databricks
Job Title: Data Engineer Job Summary As a Data Engineer in the IT team, you will work on various big data challenges using the Databricks platform. You will provide data engineering, data science, and cloud technology projects which require integrating with client systems, training, and other technical tasks to help business to get most value out of their data. The Impact You Will Have You will work on a variety of impactful Big Data projects which may include building reference architectures, how-to's and production grade MVPs. Work on implementing transformational big data projects, 3rd party migrations, including end-to-end design, build and deployment of industry-leading big data and AI applications. Work on architecture and design; bootstrap or implement strategic projects. What We Look For 7+ years experience with Big Data Technologies such as Apache Spark , Kafka, Cloud Native and Data Lakes. 4+ years of experience working on Big Data Architectures independently. Preferred experience working in the Databricks ecosystem. Comfortable writing code in either Python or Scala. Experience working across Cloud Platforms (GCP / AWS / Azure). Documentation and white-boarding skills. Build skills in technical areas which support the deployment and integration of Databricks-based solutions to complete customer projects. About Databricks Databricks is the data and AI company. More than 10,000 organizations worldwide including Comcast, Cond Nast, Grammarly, and over 50% of the Fortune 500 rely on the Databricks Data Intelligence Platform to unify and democratize data, analytics, and AI. Databricks is headquartered in San Francisco, with offices around the globe and was founded by the original creators of Lakehouse, Apache Spark , Delta Lake, and MLflow. To learn more, follow Databricks on Twitter, LinkedIn, and Facebook. Benefits At Databricks, we strive to provide comprehensive benefits and perks that meet the needs of all of our employees. For specific details on the benefits offered in your region, please visit https://www.mybenefitsnow.com/databricks.
Senior Data Scientist
Gameskraft
Senior Data Scientist Experience: 1-4 Years | Location: Bengaluru About Gameskraft: Founded in 2017, Gameskraft has rapidly become one of India s fastest-growing online gaming companies. Our mission is to build a safe, secure, and responsible gaming ecosystem while delivering unmatched experiences through cutting-edge technology and design. We are the only gaming company in the industry with ISO 27001 and ISO 9001 certifications. About the Role: As a Senior Data Scientist, you will play a key role in driving data-driven decision-making by developing and implementing advanced analytics and machine learning solutions. You will work on high-impact projects across different business units, collaborating with stakeholders to optimize business performance and user experience. Key Responsibilities: Data Strategy & Governance: Develop and execute a data strategy aligned with business goals. Identify and prioritize high-value data science opportunities. Ensure data accuracy, integrity, and governance by working with data engineering teams. Advanced Analytics & Machine Learning: Design, develop, and deploy machine learning models for regression, classification, clustering, and time-series forecasting. Stay updated with state-of-the-art algorithms and implement innovative solutions. Optimize data pipelines and model performance for scalable deployment. Collaboration & Innovation: Work closely with business leaders, product teams, and IT to integrate data science solutions into business processes. Foster a culture of innovation by exploring emerging technologies and methodologies. Translate complex technical concepts into actionable business insights for non-technical stakeholders. Performance Monitoring & Reporting: Define and track KPIs to measure the impact of data science initiatives. Continuously optimize models for improved accuracy and efficiency. Present insights and recommendations to senior management. What You Bring to the Table: Education: Master s or Ph.D. in Computer Science, Statistics, Machine Learning, or a related field. Experience: 2-4 years of hands-on experience in building and deploying data science solutions. Proven expertise in machine learning, statistical modeling, and analytics. Experience working with large-scale datasets and high-performance computing frameworks. Technical Skills: Proficiency in Python and libraries like scikit-learn, TensorFlow, Keras, PyTorch, Pandas, NumPy, Matplotlib. Experience with Big Data technologies (Hadoop, Spark, distributed computing frameworks). Familiarity with Cloud Platforms (AWS, Azure, GCP) and services like S3, Redshift, Databricks. Deep learning expertise is a plus. Soft Skills & Leadership: Strong communication and stakeholder management skills. Ability to simplify complex data science concepts for non-technical audiences. A proactive problem-solver and team player. Why Join Us? Work on cutting-edge AI & ML projects in one of India s fastest-growing tech companies. Collaborate with top industry leaders and work in a fast-paced, dynamic environment. Competitive compensation and growth opportunities. A diverse, inclusive, and innovation-driven work culture. If you re passionate about AI, Data Science, and driving real-world impact, we d love to hear from you! Qualification : Masters or Ph.D. in Computer Science, Statistics, Machine Learning, or a related field.
Python Data Engineer Architect
Gramener
What Gramener Offers You Gramener provides an inviting workplace, talented colleagues from diverse backgrounds, steady career growth prospects, and plenty of opportunities for innovation. Our goal is to create an ecosystem of easily configurable data applications focused on data storytelling for both public and private use. Roles and Responsibilities Design and build data models to address the organization s strategic data needs. Collaborate with technical and business stakeholders to gather requirements and design scalable solutions. Perform requirement analysis and create detailed architectural models for proposed solutions. Identify and troubleshoot operational issues, recommending strategies for improvement. Effectively communicate technical solutions to business users and address their concerns. Ensure solutions are compliant with corporate standards and regulatory requirements. Develop technical design specifications for engineers and systems teams. Assess the impact of proposed solutions, including resource allocation and implementation feasibility. Leverage Python libraries for data acquisition and integration tasks, such as API interaction, database connectivity, and web scraping. About Us At Gramener, we empower organizations to make data-driven decisions through strategic data consulting. We provide a roadmap for data transformation, helping businesses turn data into a strategic asset. Our services include data analysis, visualization, and delivering insights through a wide range of products and solutions to enable smarter decision-making.
Senior Manager - Technical Solutions (spark)
Databricks
As a Senior Manager of the Spark Technical Solutions team, you will lead & manage a team of Technical Solution Engineers (Spark) and be responsible for driving deep dive technical solutions for any issues reported by Databricks customers. We expect the manager to resolve challenges with comprehensive technical and customer communication skills. You will assist our customers in their Databricks journey and provide them with the guidance, knowledge, and expertise that they need to realise value and achieve their strategic objectives using our products. The impact you will have: As a manager and member of the leadership team, you will be directly responsible for the management of Technical solution engineers, team leads and operations personnel Responsible for directly monitoring, reporting, and driving improvements to team-level metrics and KPIs, acting as an escalation point with customers and internal teams, and optimising and developing support processes and tools Responsible for working across multiple cross functional teams that include Engineering, product management, sales and customer success; manage Hiring, mentoring and onboarding new support engineers Regularly meet one-on-one with your direct reports, conducting annual reviews and career development discussions throughout the year Be a hands on manager to assist the team members in resolving issues related to Spark core internals, Spark SQL, Structured Streaming, Delta, Lakehouse and other databricks runtime features Manage and drive best practices guidance around Spark runtime performance and usage of Spark core libraries and APIs for custom-built solutions developed by Databricks customers; contribute in the development of tools/automation initiatives Own Engineering JIRA tickets and proactively work to bring quicker resolutions to customer reported issues; participate in creation of knowledge base articles Participate in weekend and weekday on-call rotation and run escalations during databricks runtime outages, incident situations, ability to multitask and plan day 2 day activities and provide escalated level of support for critical customer operational issues, etc What we look for: Min 10-12 years of experience in designing, building, testing, and maintaining Python/Java/Scala/Spark based applications in a typical project delivery and consulting environments with 4+ years working as a Manager 5+ years of hands-on experience in developing and leading any two or more of the Big Data, Hadoop, Spark,Machine Learning, Artificial Intelligence, Streaming, Kafka, Data Science, ElasticSearch related industry use cases at the production scale. Big Data / Spark hands on-experience is mandatory Hands on experience in the performance tuning/troubleshooting of Hive and Spark based applications at production scale. Real time experience in JVM and Memory Management techniques such as Garbage collections, Heap/Thread Dump Analysis is preferred Working and hands-on experience with Data lakes and any SQL-based databases, Data Warehousing/ETL technologies like Informatica, DataStage, Oracle, Teradata, SQL Server, MySQL is preferred Hands-on experience with AWS or Azure or GCP is preferred Experience in implementing CI/CD, Monitoring/alerting for Production Systems Technical lead in design, implementation and support of large scale data and analytics solutions that are highly reliable, flexible, and scalable Experience in leading and managing end-to-end projects and have reported and escalated to top levels Experience in managing and leading teams in an organisation involving multiple reporting lines About Databricks Databricks is the data and AI company. More than 10,000 organizations worldwide including Comcast, Cond Nast, Grammarly, and over 50% of the Fortune 500 rely on the Databricks Data Intelligence Platform to unify and democratize data, analytics and AI. Databricks is headquartered in San Francisco, with offices around the globe and was founded by the original creators of Lakehouse, Apache Spark , Delta Lake and MLflow. To learn more, follow Databricks on Twitter,LinkedIn and Facebook . Benefits At Databricks, we strive to provide comprehensive benefits and perks that meet the needs of all of our employees. For specific details on the benefits offered in your region, please visithttps://www.mybenefitsnow.com/databricks. Our Commitment to Diversity and Inclusion At Databricks, we are committed to fostering a diverse and inclusive culture where everyone can excel. We take great care to ensure that our hiring practices are inclusive and meet equal employment opportunity standards. Individuals looking for employment at Databricks are considered without regard to age, color, disability, ethnicity, family or marital status, gender identity or expression, language, national origin, physical and mental ability, political affiliation, race, religion, sexual orientation, socio-economic status, veteran status, and other protected characteristics. Compliance If access to export-controlled technology or source code is required for performance of job duties, it is within Employer's discretion whether to apply for a U.S. government license for such positions, and Employer may decline to proceed with an applicant on this basis alone.
Spark Backline Engineer
Databricks
Mission As a Spark Backline Engineer you will help our customers to be successful with the Databricks Data Intelligence platform by resolving important technical customer escalations and the support team. You will be the technical bridge between support and engineering and the first line of defense for engineering. You will ensure that all issues are vetted by you before it reaches the engineering team. You will report to the Senior Backline Manager of the Backline Escalations Team. Outcomes Troubleshoot, resolve and suggest deep code-level analysis of Spark to address complex customer issues related to Spark core internals, Spark SQL, Structured Streaming and Databricks Delta. Provide best practices guidance around Spark runtime performance and usage of Spark core libraries and APIs for custom-built solutions developed by Databricks customers. Help the support team with detailed troubleshooting guides and runbooks. Contribute to automation and tooling programs to make daily troubleshooting efficient. Work with the Spark Engineering Team and spread awareness of upcoming features and releases. Identify Spark bugs and suggest possible workarounds. Demonstrate ownership and coordinate with engineering and escalation teams to achieve resolution of customer issues and requests Participate in weekend and weekday on call rotation. Competencies Minimum 5 years' experience developing, testing, and sustaining Python or Java or Scala-based applications. Comfortable with compiling, building and navigating the Apache Spark source code. Comfortable with identifying and applying patches/bug fixes to the Apache Spark source code. Experience in Big Data/Hadoop/Spark/Kafka/Elasticsearch data pipelines. Hands-on experience with SQL-based database systems. Experience in JVM, GC, Thread dump-based troubleshooting is required. Experience with AWS or Azure related services. Bachelor's degree in Computer Science or a related field is required. About Databricks Databricks is the data and AI company. More than 10,000 organizations worldwide including Comcast, Cond Nast, Grammarly, and over 50% of the Fortune 500 rely on the Databricks Data Intelligence Platform to unify and democratize data, analytics and AI. Databricks is headquartered in San Francisco, with offices around the globe and was founded by the original creators of Lakehouse, Apache Spark , Delta Lake and MLflow. To learn more, follow Databricks on Twitter,LinkedIn and Facebook . Benefits At Databricks, we strive to provide comprehensive benefits and perks that meet the needs of all of our employees. For specific details on the benefits offered in your region, please visithttps://www.mybenefitsnow.com/databricks. Our Commitment to Diversity and Inclusion At Databricks, we are committed to fostering a diverse and inclusive culture where everyone can excel. We take great care to ensure that our hiring practices are inclusive and meet equal employment opportunity standards. Individuals looking for employment at Databricks are considered without regard to age, color, disability, ethnicity, family or marital status, gender identity or expression, language, national origin, physical and mental ability, political affiliation, race, religion, sexual orientation, socio-economic status, veteran status, and other protected characteristics. Compliance If access to export-controlled technology or source code is required for performance of job duties, it is within Employer's discretion whether to apply for a U.S. government license for such positions, and Employer may decline to proceed with an applicant on this basis alone. Qualification : Bachelor's degree in Computer Science or a related field is required.
1 - 20 of 0 jobs
* No exact matches found. Showing closest results insteadNo results found
Modify search criteria or create an alert to get relevant jobs as soon as they’re posted