Apache Airflow Engineer Jobs in Bengaluru
1068 Jobs Found
Data Engineering Lead
Fampay
Data Engineering Lead Bengaluru | Engineering | Full-Time About Fam (formerly FamPay) Fam is India s first payments app designed for everyone aged 11 and above. FamApp enables seamless online and offline payments through UPI and FamCard. Our mission is to empower over **250 million young Indians** to start their financial journey early, becoming financially aware and confident. Founded in 2019 by IIT Roorkee alumni, Fam is backed by top-tier investors including Elevation Capital, Y-Combinator, Peak XV (Sequoia Capital India), Venture Highway, and angels like Kunal Shah and Amrish Rao. About the Role We re looking for a visionary **Data Engineering Lead** to take **end-to-end ownership** of Fam s data ecosystem from data ingestion and storage to processing and delivering actionable insights. You ll **define the data strategy and architecture** that supports both batch and **real-time** use cases, ensuring scalability, reliability, and governance across the organization. You will be instrumental in enabling accurate, complete, and trusted data flow that powers business intelligence, analytics, and product decision-making. This role involves **leadership, strategic thinking**, and hands-on problem solving. What You ll Do Own the full data lifecycle: ingestion, organization, storage, processing, and presentation. Define and execute **data architecture and strategy** aligned with operational and analytical goals. Build **scalable, reliable, and observable data systems** supporting batch and near real-time processing. Ensure **data quality, governance, and compliance**, proactively resolving discrepancies. Collaborate with product, engineering, and business teams to define, track, and optimize key metrics. Anticipate data-related challenges and implement preventive solutions. Lead, mentor, and grow the data engineering team, fostering innovation and accountability. Must-Haves 10+ years experience in data engineering, including proven leadership of teams or projects. Expertise designing, building, and scaling end-to-end data pipelines and systems. Deep understanding of the data lifecycle, from ingestion through business reporting. Strong communication skills and ability to collaborate across technical and business teams. Solid knowledge of **data governance, quality assurance, and compliance standards**. Experience with observability and proactive monitoring for data systems. Proficiency in Python and SQL; familiarity with Scala or Java. Hands-on experience with streaming and batch data frameworks. Experience designing large-scale data lakes and warehouses with best practices for schema evolution and partitioning. Strong background with **cloud platforms (AWS, GCP, or Azure)**. Fintech or regulated industry experience is a plus. Good to Have Fintech-specific data experience, including regulatory compliance and reporting. Deployment experience with **real-time analytics** and event-driven architectures. Familiarity with containerization and infrastructure tools like Docker, Kubernetes, Terraform. Knowledge of data observability tools (Monte Carlo, Databand, etc.). Exposure to **ML pipelines** and model deployment. Solve challenging problems at the intersection of big data, real-time processing, and fintech. Lead impactful data initiatives at a rapidly growing startup. Collaborate with a world-class team of engineers, data scientists, and product leaders. Competitive compensation, equity, and benefits. Clear career growth opportunities in leadership and innovation. Perks That Go Beyond the Paycheck Relocation assistance for a smooth move. Free office meals (lunch & dinner). Generous leave policies (birthday, period, parental support, and more). Salary advances and loan policies for financial support. Quarterly rewards, recognition, and referral incentives. Access to the latest gadgets and tools. Comprehensive health insurance with mental health support. Tax benefits like food coupons, phone allowances, and leasing options. Retirement benefits including PF contribution, leave encashment, and gratuity. About FamApp FamApp focuses on financial inclusion for the next generation by offering UPI and card payments to users aged 11+. Our flagship product, FamX, integrates UPI and card payments seamlessly, helping users manage, save, and learn about their finances effortlessly. With over **10 million users**, FamApp is revolutionizing how young Indians transact eliminating the need to carry cash and offering customizable FamX cards with personal doodles for a fun, unique payment experience. Join Our Dynamic Team At Fam, we foster a people-first culture with flexible work schedules, generous leave, comprehensive health benefits, and mental health support. You ll be part of a passionate, talented, and fun team shaping the future of fintech for India s youth.
Senior Data Engineer
Okta
Senior Data Engineer Enterprise Data Platform Location: Bengaluru Department: Business Technology Data Engineering Experience: 5+ Years Employment Type: Full-Time About Okta Okta is The World s Identity Company. We empower people to securely use any technology, anywhere, on any device. Through our Okta and Auth0 platforms, we provide secure access, authentication, and automation placing identity at the center of security and growth for thousands of organizations. We value diverse perspectives and lifelong learners. We re not looking for someone who checks every box we re looking for someone who will make us better with their unique experiences. Team: Business Technology Data Engineering The Data Engineering team at Okta supports cross-functional partners by building scalable, secure, and high-performing platforms. These platforms power decision-making and business processes across sales, marketing, engineering, finance, product, and operations. As part of this team, you ll contribute to data solutions that fuel Okta s hyper-growth. You will have the opportunity to work with cutting-edge technologies in cloud infrastructure, data lakes, automation, and CI/CD pipelines. The Role: Senior Data Engineer As a Senior Data Engineer, you will design, build, and manage modern data pipelines, infrastructure, and automation frameworks. You ll help scale our enterprise data platform using tools such as Snowflake, dbt, Airflow, Databricks, and AWS, while ensuring security, observability, and performance. You ll also contribute to CI/CD pipelines, infrastructure as code (IaC), and secure development lifecycle practices, enabling consistent, efficient, and secure delivery of data solutions. Key Responsibilities Platform Development & Infrastructure Design and maintain scalable data pipelines and platforms using Snowflake, AWS, Databricks, dbt, and Airflow. Manage infrastructure with Terraform, enabling repeatable and consistent deployments. Develop and maintain robust CI/CD pipelines using GitHub Actions, GitLab, or Jenkins. Containerize data services using Docker for better scalability and portability. Security & Compliance Implement and enforce secure development lifecycle practices, integrating tools like DAST, SAST, SCA, and Secret Scanning into pipelines. Conduct vulnerability scanning and apply patches to ensure system integrity. Ensure data security and compliance with industry standards and regulations. Collaboration & Innovation Collaborate with data engineers, data scientists, and analysts across business units to ensure data availability and integrity. Identify opportunities for automation and optimization within the data platform. Stay updated on emerging technologies and drive adoption of best practices. Must-Have Skills Bachelor s degree in Computer Science, Engineering, or a related technical field. 5+ years of experience in data engineering, including: Advanced SQL and ETL development with Airflow and dbt. Experience with data warehouses such as Snowflake, Redshift, or BigQuery. Strong hands-on experience with AWS (S3, Lambda, EC2, EMR, EKS). 2+ years of experience managing CI/CD pipelines using tools like GitHub Actions, GitLab, Jenkins, or ArgoCD. Experience with Terraform and Docker. Proficiency in backend languages such as Python, Java, or Go. Preferred Skills Experience with lakehouse architectures like Databricks, including knowledge of Delta Lake and Apache Iceberg. Background in infrastructure security, vulnerability management, and observability tooling. High Impact: Help build and scale the data platform that powers Okta s global business. Cutting-Edge Stack: Work with best-in-class technologies like AWS, Snowflake, dbt, Terraform, and Databricks. Collaborative Culture: Join a diverse, inclusive, and globally distributed team that values knowledge sharing and continuous learning. Career Growth: Shape the future of Okta s data engineering practice while expanding your technical and leadership skills. Bring your passion for data, cloud, and automation and let s shape the future of secure, scalable enterprise data platforms together. Qualification : Bachelors degree in Computer Science, Engineering, or a related technical field
ML Ops Engineer
Mpokket Financial Services Private Limited
Job Title: ML Ops Engineer Location: Bangalore Department: Data Science Employee Type: Full-time Experience Required: 3 5 years Position Overview We are seeking an experienced and motivated ML Ops Engineer to join our Data Science team. In this role, you will be responsible for deploying, monitoring, and maintaining machine learning models in production environments. You will work closely with data scientists, engineers, and product teams to ensure models are scalable, reliable, and aligned with business objectives. This role is ideal for professionals who are passionate about building robust ML pipelines and bringing machine learning solutions into real-world applications at scale. Key Responsibilities Deploy and manage machine learning models in production environments, ensuring scalability, reliability, and performance. Build and maintain MLOps pipelines using platforms like Databricks and MLflow. Monitor model performance, accuracy, and health; implement alerting and diagnostics as needed. Develop and maintain RESTful APIs using Python frameworks such as Flask or Django to serve ML models. Optimize data workflows and collaborate with engineering teams to improve model integration and performance. Design strategies for automated model retraining, deployment, and version control. Write clean, maintainable, and efficient code using Python, adhering to OOP principles and best practices. Write complex queries using SQL and work with NoSQL databases to support data pipelines and feature stores. Leverage Python libraries such as PySpark, Pandas, scikit-learn, SQLAlchemy, and Requests. Minimum Qualifications Bachelor s or Master s degree in Computer Science, Statistics, Econometrics, Operations Research, or a related technical field. 3 5 years of experience in building, deploying, and monitoring machine learning solutions in production. Must-Have Skills Experience with Databricks and MLflow for model training and deployment. Proven expertise in machine learning model deployment and monitoring in live environments. Strong programming skills in Python, with solid understanding of data structures, algorithms, and OOP concepts. Experience developing RESTful APIs using Flask or Django. Proficient in SQL and NoSQL database operations. Hands-on knowledge of libraries such as Pandas, PySpark, scikit-learn, SQLAlchemy, and Requests. Strong analytical, problem-solving, and debugging skills. Good-to-Have Skills Experience with Kafka streaming and batch processing. Familiarity with CI/CD pipelines and version control systems like Git. Understanding of Python multiprocessing, worker/queue systems, and asynchronous/event-driven programming. This is a unique opportunity to work at the intersection of machine learning and DevOps. You'll play a critical role in operationalizing AI models and making them a core part of our product offerings. If you enjoy building scalable systems and solving real-world ML engineering challenges, we d love to meet you. Qualification : Bachelors or Masters degree in Computer Science, Statistics, Econometrics, Operations Research, or a related technical field
Business Technology Data Engineer
Samsara Inc
Position: Business Technology Data Engineer Location: Bengaluru, India (Hybrid 3 days onsite) Company: Samsara Technologies India Pvt. Ltd. About Samsara Samsara (NYSE: IOT) is a leader in the Connected Operations Cloud, enabling businesses across industries like transportation, logistics, manufacturing, and field services to harness IoT data for safety, efficiency, and sustainability improvements. Samsara helps organizations digitize physical operations at scale, improving outcomes that impact global infrastructure. Role Overview Samsara is seeking a Business Technology Data Engineer to join its Data & Analytics team within the Business Technology division. In this role, you will design, build, and optimize end-to-end data pipelines and infrastructure for various business-critical systems across CRM, marketing, support, and product platforms. You'll collaborate with teams across the company to build reliable and scalable data solutions that power reporting, automation, and analytics. This hybrid role requires working 3 days per week from the Bengaluru office and 2 days remotely, with working hours aligned to India Standard Time (IST). Key Responsibilities Data Engineering & Platform Development Design and maintain ETL/ELT pipelines that integrate and transform data across business systems. Build scalable data infrastructure to support advanced analytics and real-time reporting needs. Write Python and SQL scripts for data ingestion, transformation, and validation. Data Integration & Enablement Work with diverse data sources: CRM, product telemetry, marketing automation, support ticketing, and order flow systems. Develop and support data lake and data warehouse solutions using Snowflake, Redshift, Databricks, or BigQuery. Ensure interoperability between applications and data layers. Performance & Quality Monitor and optimize pipeline performance, implement observability and alerting. Improve data quality, lineage, and governance across systems. Partner with internal stakeholders (e.g., Sales Ops, Marketing Ops, Analytics) to deliver reliable data products. Minimum Qualifications Bachelor s degree in Computer Science, Data Engineering, or related field. 5+ years of professional experience in data engineering. 3+ years experience building and maintaining end-to-end pipelines in a modern data stack. Strong in SQL and Python. Hands-on experience with: ETL tools: Fivetran, dbt Cloud: AWS (preferred), GCP, or Azure Databases: MySQL, PostgreSQL, Oracle, or similar Data Warehouses: Snowflake, Redshift, BigQuery, Databricks Preferred Qualifications Familiarity with API-based ingestion, serverless architecture (Lambda, API Gateway, SQS, etc.). Experience with monitoring tools (DataDog, CloudWatch, Splunk). Comfortable engaging stakeholders to translate business needs into data solutions. Proficiency in Docker, Kubernetes, or AWS Fargate is a plus. Qualification : Bachelors degree in Computer Science, Data Engineering, or related field
Engineering Manager
Themathcompany
Job Title: Engineering Manager Data Engineering Location: Bengaluru, Karnataka, India Department: Engineering Experience: 6 to 8 years Open Positions: 2 About the Role As an Engineering Manager - Data Engineering, you will lead a team of skilled data engineers who design, build, and maintain scalable data pipelines and infrastructure. You will collaborate with cross-functional teams and client stakeholders to deliver high-quality data systems that meet business goals. Your leadership will be pivotal in mentoring your team, driving project execution, and advancing data engineering capabilities across the organization. Key Responsibilities Lead, mentor, and develop a team of data engineers, fostering a collaborative and inclusive work environment. Conduct performance reviews, provide constructive feedback, and set clear goals for team members. Identify skill gaps and create opportunities for continuous professional growth. Plan, execute, and deliver data engineering projects on schedule and within scope. Coordinate with stakeholders to gather requirements, prioritize tasks, and define project timelines. Ensure all projects align with broader business objectives and data strategies. Oversee design, development, and maintenance of data pipelines, ETL processes, and data warehouses. Guarantee data quality, integrity, and security in all data engineering initiatives. Identify and drive process improvements to enhance efficiency and effectiveness in data operations. Manage client conversations to understand requirements and translate them into technical deliverables. Build and promote reusable frameworks to drive efficiency in data systems. Lead multiple projects involving streaming, batch, and large-scale data pipelines. Required Technical Skills Strong execution knowledge of data modeling, relational and non-relational databases (SQL and NoSQL). Expertise with ETL and orchestration tools such as IICS, Metatron, Airflow, Azure Data Factory, AWS Glue, or GCP Composer. Experience working with data warehouses like Snowflake, Redshift, Hive, or BigQuery. Proficiency in Apache Spark and optimization of Spark jobs. Strong programming skills in Python (mandatory), with knowledge of Scala, Rust, or Java as a plus. Understanding of Medallion architecture patterns. Advanced SQL skills with query optimization expertise. Experience with software development lifecycle, unit testing, and functional programming concepts. Required Non-Technical Skills Strong problem-solving skills with the ability to assess financial impacts of decisions. Excellent written and verbal communication skills, capable of engaging with mid-management client stakeholders. Ability to balance pragmatic solutions against perfect ones, driving team consensus and business value. Exceptional people management skills, including conflict resolution, empathy, negotiation, and active listening. Proven leadership and mentorship abilities, providing technical guidance to delivery teams. Self-driven, with a strong sense of ownership and accountability. Preferred Educational Qualifications Bachelor s degree in Engineering (B.E./B.Tech), MCA, or M.Sc. (Mathematics, Statistics). Lead and mentor a talented team working on cutting-edge data engineering projects. Collaborate closely with clients and cross-functional teams in a dynamic, fast-growing company. Drive innovation with scalable, high-impact data solutions. Grow your leadership and technical skills in a supportive, inclusive environment. Qualification : Bachelors degree in Engineering (B.E./B.Tech), MCA, or M.Sc. (Mathematics, Statistics).
Senior Data Engineer
Synechron
Position Title: Senior Data Engineer Databricks, PySpark, Cloud Platforms Location: Bengaluru Bellandur (GTP) Employment Type: Full-time Job Summary Synechron is looking for a Senior Data Engineer to join our advanced analytics team in Bengaluru. In this role, you will architect and build scalable, high-performance data pipelines that power data science, analytics, and business intelligence initiatives. You ll work with modern tools including Databricks, PySpark, and cloud data platforms, while collaborating across teams to ensure high-quality, secure, and efficient data solutions. Key Responsibilities Design, develop, and maintain large-scale, secure, and efficient data pipelines using Databricks, PySpark, and cloud-native tools. Partner with data scientists, analysts, and business stakeholders to translate requirements into robust data solutions. Integrate data from various structured, semi-structured, and streaming sources. Ensure high standards for data quality, performance optimization, security, and cost efficiency. Drive data pipeline automation, orchestration, and monitoring using tools like Airflow. Lead troubleshooting efforts, performance tuning, and enhancements of existing pipelines. Stay informed about emerging data technologies and recommend adoption where relevant. Technical Skills Core Expertise Programming: Python (expert), SQL (advanced), PySpark. Platforms: Databricks (clusters, notebooks, workflows), AWS/Azure/GCP. Data Orchestration: Apache Airflow (or similar). Data Warehousing: Snowflake (preferred), data modeling, ETL/ELT pipelines. Streaming: Kafka or other stream processing tools. DevOps: CI/CD (GitLab CI, Jenkins), version control (Git), containerization (Docker/Kubernetes preferred). Security: Familiarity with encryption, access controls, and compliance best practices. Experience 8+ years of experience in data engineering or related roles. Proven expertise in developing and deploying scalable data pipelines using Databricks, PySpark, and SQL. Hands-on experience with cloud platforms (AWS, Azure, or GCP). Strong background in data warehousing, especially with Snowflake. Exposure to real-time data processing and orchestration tools. Experience implementing CI/CD pipelines for data workflows is a plus. Daily Responsibilities Build and optimize data ingestion, transformation, and storage workflows. Collaborate with cross-functional teams to align data solutions with business objectives. Monitor, troubleshoot, and continuously improve pipeline performance. Conduct data quality checks, ensure governance and compliance standards. Contribute to technical documentation, code reviews, and team knowledge sharing. Qualifications Bachelor s or Master s degree in Computer Science, IT, or related field. Relevant certifications (e.g., Databricks Certified Data Engineer, AWS Certified Data Analytics) are preferred. Professional Competencies Strong problem-solving and analytical mindset. Effective communicator with ability to collaborate across technical and non-technical teams. Time management and prioritization skills under tight deadlines. Proactive leadership and a passion for innovation. Commitment to ethical data use and data security. Diversity & Inclusion at Synechron Synechron is committed to building an inclusive, diverse, and equitable workplace. Through our global Same Difference DEI initiative, we celebrate and support people from all backgrounds, including race, gender, sexual orientation, religion, age, disability, and more. We offer flexible work arrangements, continuous learning, internal mobility, and mentoring programs to support every employee s growth. Qualification : Bachelors or Masters degree in Computer Science, IT, or related field
Senior Analyst - Data Engineering
Latentview Analytics
Role: Senior Analyst Data Engineering Location: Bengaluru, Karnataka, India Experience: 3 6 Years Employment Type: Permanent, Full-Time About the Role We are looking for a results-driven Senior Data Engineer to join our high-performing data team in Bengaluru. The ideal candidate will have 3 6 years of experience in data engineering, AI/ML implementation, and working with large-scale databases like Snowflake and Teradata. If you're passionate about driving data-powered insights, building scalable solutions, and applying advanced machine learning and AI techniques, we want to hear from you. Key Responsibilities Design, develop, and implement machine learning models to solve complex business challenges. Apply AI techniques, including generative AI, NLP, and computer vision, to improve analytics capabilities. Use Tableau, Power BI, and other tools to develop insightful, interactive data dashboards. Manage and optimize large datasets using platforms like Snowflake, Teradata, and SQL/NoSQL databases. Collaborate with business and technical teams to translate requirements into robust data engineering solutions. Guide junior data professionals and foster a culture of learning and innovation. Communicate analytical findings clearly to non-technical stakeholders. Stay current with the latest in data science, machine learning, cloud platforms, and big data technologies. Key Skills & Technologies Machine Learning & AI Techniques: Supervised & unsupervised learning, deep learning, neural networks Reinforcement learning, decision trees, random forests, clustering NLP, computer vision, GANs, transfer learning Data Visualization Tools: Tableau, Power BI, Matplotlib, Seaborn, Plotly Programming Languages & Libraries: Python (essential), R, SQL TensorFlow, PyTorch, scikit-learn, pandas, NumPy, Keras, SciPy Databases & Data Management: Snowflake, Teradata, SQL/NoSQL, ETL, data lakes, data warehousing Cloud Platforms: AWS, Azure, Google Cloud Platform (GCP) Big Data Technologies: Apache Spark, Hadoop
Data Engineer
Cognite
Data Engineer Location: Bengaluru Department: Global Strategic Services Data Engineering EMEA Type: Full-Time | Hybrid About Cognite Cognite is a global SaaS company at the forefront of industrial digital transformation. Our core platforms Cognite Data Fusion (CDF) and Cognite Atlas AI leverage AI and data to solve complex challenges across oil & gas, chemicals, pharma, manufacturing, and energy sectors. Recognized globally for innovation, Cognite helps businesses improve efficiency, reduce costs, and operate more sustainably. Our Values Impact: We focus on outcomes that truly make a difference. Ownership: We take responsibility and work inclusively to deliver success. Relentless: We are persistent problem-solvers who view challenges as growth opportunities. About the Team You ll join Cognite s Global Customer Success (GCS) organization spanning offices in Oslo, Austin, Houston, Tokyo, and now Bengaluru. The GCS team includes Project Managers, Data Engineers, Data Scientists, and Solution Architects, all working on high-impact industrial use cases across regions including MENA and SEA. About the Role As a Data Engineer, you'll play a key role in solving real-world industrial problems by designing, building, and optimizing scalable data architectures. You ll create robust pipelines to integrate and contextualize data using Cognite Data Fusion, working closely with cross-functional teams. This is also a growth role with the opportunity to step into a Tech Lead position. Key Responsibilities Design and implement scalable data solutions using Cognite Data Fusion. Integrate data sources using Cognite connectors, SQL, Python/Java, and REST APIs. Develop custom data models for cleansing, discovery, and mapping. Work with data scientists, architects, and PMs to deliver client outcomes. Conduct code reviews and apply best practices in maintainability and quality. Support customers and partners with data engineering tasks and challenges. Collaborate with Engineering and Product teams to influence product roadmap. What You ll Bring Bachelor's or Master s in Computer Science or related field. 3 5 years of experience in data-focused, customer-facing roles. Strong experience with Python, SQL, and REST APIs for production-grade pipelines. Familiarity with data from industrial domains like oil & gas or manufacturing is a plus. Experience with Kubernetes, Azure/GCP, and modern DevOps practices. Comfortable with Git, CI/CD, and cloud-native deployment workflows. A proactive, collaborative mindset with a passion for problem-solving and learning. Join a diverse global team with 70+ nationalities and a strong commitment to DEI. Work in a modern hybrid environment at our Bengaluru office (Rathi Legacy, Hoodi). Be part of a flat, transparent culture with direct access to leadership. Build cutting-edge solutions and influence how industries evolve digitally. Learn from top engineers and contribute to ambitious, high-impact projects. Enjoy a respectful, fun, and collaborative workplace where your voice matters. If you re excited to build data infrastructure that powers real change in heavy industries, Cognite offers you a platform to grow, lead, and innovate. We welcome candidates from all backgrounds and identities apply today and be part of something meaningful. Qualification : Bachelor's or Masters in Computer Science or related field.
Senior Data Engineer
Cognite
Senior Data Engineer Location: Bengaluru Department: Global Strategic Services Data Engineering EMEA Type: Full-Time | Hybrid About Cognite Cognite is a global SaaS leader driving industrial digital transformation. Our platforms Cognite Data Fusion and Cognite Atlas AI enable companies across Energy, Utilities, Manufacturing, and Chemicals to solve complex challenges using AI, contextual data, and automation. Cognite is backed by top-tier investors and recognized with global innovation awards. Our Values Impact: We deliver results that matter. Ownership: We take initiative, act inclusively, and embrace accountability. Relentless: We pursue excellence and innovation with resilience. About the Role As a Senior Data Engineer, you ll lead the development of scalable data solutions that empower critical industries to make informed decisions. You ll work on high-impact projects across regions, collaborating with solution architects, data scientists, and product teams. This is a growth role with room to influence product direction and mentor junior engineers. Key Responsibilities Architect and implement robust data pipelines using Cognite Data Fusion, Python, SQL, and REST APIs. Lead integrations, data modeling, and transformation tasks using cloud-native technologies. Design custom data models for discovery, mapping, and cleansing industrial data. Collaborate closely with cross-functional teams to deliver digital solutions. Conduct code reviews and champion engineering best practices. Contribute to Cognite s SDKs and internal tools. Translate customer use cases into scalable, reusable data engineering frameworks. Mentor team members and support customer onboarding when needed. What You ll Bring Bachelor s or Master s in Computer Science or related field (or equivalent experience). 5+ years in a data-intensive, customer-facing engineering role. Expertise in Python, SQL, REST APIs, and pipeline orchestration. Experience with distributed computing, Kubernetes, and cloud platforms (Azure, GCP). Familiarity with data from industrial domains like oil & gas or manufacturing is a plus. Strong DevOps mindset with hands-on experience in Git, CI/CD, and deployments. Proactive and collaborative; able to work independently and solve complex challenges. A growth mindset with willingness to ask for help and share knowledge openly. Be part of a global team spanning 70+ nationalities with strong DEI focus. Work at our Bengaluru office (Rathi Legacy, Hoodi) in a modern hybrid environment. Enjoy flat hierarchy, fast decision-making, and high ownership culture. Collaborate with world-class professionals on industry-transforming projects. Shape the future of industrial data and drive real-world impact at scale. If you re passionate about solving meaningful problems with cutting-edge data technologies, apply today. We welcome applicants from all backgrounds and experiences you might be the perfect fit, even if you don t meet every single requirement. Qualification : Bachelors or Masters in Computer Science or related field (or equivalent experience).
Data Engineer
Colan Infotech
Data Engineer Experience: 5+ Years Location: Bangalore, Karnataka, India Job Type: Full Time Job Summary We are looking for a talented Data Engineer with over 5 years of experience and a strong foundation in machine learning development to join our team in Bangalore. The ideal candidate will have hands-on expertise in Python programming, machine learning basics, and computer vision techniques like custom object detection and OCR. Key Responsibilities Develop and maintain data pipelines supporting machine learning models and data analytics. Implement custom object detection algorithms and OCR solutions using computer vision techniques. Utilize Python ML libraries such as OpenCV, SciPy, NumPy, Matplotlib, Pandas, Scikit-learn, Keras, PyTorch, and TensorFlow. Collaborate with data scientists and software engineers to optimize data workflows and ML model deployment. Ensure data quality, integrity, and scalability within data infrastructure. Troubleshoot and improve existing machine learning systems and pipelines. Required Skills Minimum of 5 years of experience in data engineering or related roles. At least 2 years of hands-on experience as a Machine Learning developer. Strong programming skills in Python. Solid understanding of machine learning fundamentals. Practical experience with custom object detection and OCR applications. Proficiency in key Python libraries for machine learning and data processing. Ability to work collaboratively in cross-functional teams. Qualifications Any graduate degree from a recognized university. Work with cutting-edge machine learning technologies in a supportive, innovative environment in Bangalore. Grow your career by solving complex problems and building impactful data solutions with a passionate team. Qualification : Any graduate degree from a recognized university.
Engineering Leader - Machine Learning
Eightfold
Engineering Leader - Machine Learning Location: Bangalore, Karnataka, India Employment Type: Full-Time | Hybrid Work Model About Eightfold.ai At Eightfold.ai, we re revolutionizing the future of employment by leveraging artificial intelligence to connect individuals to the right career opportunities based on their skills, not just their network. Our AI-powered Talent Intelligence Platform is transforming how organizations plan, hire, develop, and retain a diverse workforce. With $410M+ in funding and a $2B+ valuation, we re growing rapidly, and if you're passionate about solving one of society's most critical challenges employment then Eightfold is the place for you. Led by visionary leaders like Ashutosh Garg (former Google Search and Personalization leader) and Varun Kacholia (former Facebook and Youtube leader), we are shaping the future of AI-powered talent management. About the AI Platform Team Our AI/ML team is the heart of Eightfold, pushing the boundaries of applied machine learning. We re working with massive datasets, solving complex problems, and building cutting-edge AI models that are reshaping how companies approach talent management. Join us if you re eager to work in a team where every day presents a new challenge and opportunity. What You ll Do As the Engineering Leader - Machine Learning, you ll be responsible for leading and mentoring a high-performing team of engineers, driving the success of AI-driven products at Eightfold. Your primary responsibilities will include: Team Leadership: Coach, mentor, and manage a talented team of engineers to foster a culture of innovation, collaboration, and high performance. ML Model Ownership: Lead the development and deployment of cutting-edge deep learning models across all Eightfold products, ensuring reliability, scalability, and high-quality performance. Platform Development: Help build high-performance, flexible infrastructure that supports a variety of deep learning techniques and modeling approaches. End-to-End ML Pipeline: Oversee the end-to-end process of building and deploying machine learning models, including creating robust data pipelines that can process unstructured data. ML Framework Implementation: Design and implement an intuitive ML development framework that ensures efficiency and ease of use for data scientists and engineers. Model Fairness: Work with our internal model fairness platform to ensure that we are providing equal opportunity for everyone through responsible ML practices. Cross-Team Collaboration: Work closely with product teams to apply deep learning techniques to solve complex business problems across various domains. What You Should Already Know To be successful in this role, you should have: Strong Foundation in ML & Deep Learning: Expertise in applying Natural Language Processing (NLP) and deep learning solutions to solve real-world problems. Experience with Language Models: Familiarity with advanced language models such as BERT, GPT-3, T5, and others. Academic Background: A BS, MS, or PhD in Computer Science, Data Science, Mathematics, or related fields. Proven ML Experience: Hands-on experience building and deploying machine learning models at scale, particularly in production environments. Programming Expertise: Strong knowledge of ML languages such as Python, C++, Java, R, Scala, and experience with scientific libraries like numpy, pandas, and frameworks like TensorFlow, PyTorch, scikit-learn, etc. Strong ML Theory Knowledge: In-depth understanding of ML theory and experience working with large-scale datasets, data ingestion, and processing systems. Experience with Distributed Systems: Familiarity with distributed systems, including REST APIs, microservices, and data processing frameworks. Metrics-Focused: A passion for building high-quality models that deliver results and metrics-driven outcomes. Nice to Have Real-Time Tech Problems: Experience with real-time processing or low-latency systems. Cloud Environments: Familiarity with cloud platforms like AWS, and experience using cloud-based ML tools. MLOps Tools & Pipelines: Experience with MLflow, Metaflow, or similar tools to streamline ML workflows and operations. Advanced Tech Stack: Familiarity with tools like Spark, MLlib, Databricks, Apache Airflow, etc. Impactful Work: Join a company dedicated to solving one of society's most pressing issues employment. Your work will have a direct impact on individuals' careers around the world. Innovation at Scale: Work with cutting-edge AI and ML technologies to shape the future of talent management. Competitive Compensation: Receive an attractive salary, equity, and comprehensive benefits package (including family medical, vision, and dental coverage). Collaborative Environment: Work in a culture that values transparency, ownership, and collaboration across teams. Hybrid Work Model: Enjoy a hybrid work environment, with flexibility for remote work and in-office collaboration at our Bangalore office. Growth Opportunities: Be part of a rapidly scaling company with vast opportunities for career development and leadership roles. Equal Opportunity Employer Eightfold.ai is an Equal Opportunity Employer. We do not discriminate based on race, color, religion, sex, sexual orientation, gender identity, national origin, age, or disability. If you are an experienced and visionary leader in Machine Learning, eager to make a lasting impact while solving one of society's most important challenges, we would love to hear from you. Qualification : BS or MS or PhD degree in Computer Science, Data Science or Mathematics
Python/apache Airflow Developer
Wipro Limited
Python/Apache Airflow Developer Location: Bengaluru, India Company: Wipro Limited (NYSE: WIT, BSE: 507685, NSE: WIPRO) Company Overview Wipro Limited is a leading technology services and consulting company focused on building innovative solutions that address clients' most complex digital transformation needs. Leveraging our holistic portfolio of capabilities in consulting, design, engineering, and operations, we help clients realize their boldest ambitions and build future-ready, sustainable businesses. With over 230,000 employees and business partners across 65 countries, we deliver on the promise of helping our customers, colleagues, and communities thrive in an ever-changing world. For more information, visit www.wipro.com. Role Purpose The purpose of this role is to design, test, and maintain software programs for operating systems or applications to be deployed at the client end and ensure they meet 100% quality assurance parameters. Position Overview We are seeking a skilled Python and Apache Airflow Developer with a strong background in data engineering to help develop and maintain robust data pipelines and support business intelligence initiatives. Key Responsibilities Develop and maintain data pipelines using Apache Airflow and Python. Collaborate with business managers to understand data requirements. Optimize existing data workflows for performance and reliability. Manage SQL and NoSQL databases for business intelligence. Ensure data quality and security. Troubleshoot data pipeline issues and ensure smooth data flow. Required Qualifications Proven experience in Python programming and data engineering. Hands-on experience with Apache Airflow. Strong understanding of SQL and experience with SQL databases. Familiarity with NoSQL databases. Experience in data modeling and ETL processes. Must have: Python and Apache Airflow, with preferably 5+ years of experience. About Wipro Wipro is constantly evolving, and we re looking for passionate individuals who want to build a modern digital transformation partner with bold ambitions. Join Wipro, where we empower you to design your own reinvention. Realize your ambitions and take part in our global, purpose-driven business. Applications from people with disabilities are explicitly welcome.
Ai Platform Architect
Adobe
AI Platform Architect Location: Bangalore, Karnataka, India Employment Type: Full-Time About Adobe Adobe is changing the world through digital experiences. Whether you're an emerging artist or a global brand, our tools empower creativity and innovation across every screen. From powerful imaging and video solutions to immersive web and app design, Adobe s mission is to help people and businesses deliver exceptional digital experiences. We are committed to creating an inclusive workplace where everyone is respected and given equal opportunity. Innovation can come from anywhere and the next big idea could be yours. Job Description We are looking for a visionary AI Platform Architect with deep expertise in building and scaling cloud-native, AI-powered platforms. The ideal candidate will have experience deploying large-scale, customer-facing AI solutions and a deep understanding of modern cloud architecture, data systems, MLOps, and LLMOps. Responsibilities Design and develop scalable AI/ML platforms and pipelines across AWS, Azure, and GCP. Architect end-to-end LLM pipelines including model training, fine-tuning, serving, inference APIs, and monitoring. Lead cross-functional teams in delivering AI solutions from experimentation to production. Implement MLOps and LLMOps best practices using tools like MLFlow, SageMaker, Langchain, and LangGraph. Design GPU-optimized architectures for training and inference of LLMs using DeepSpeed, vLLM, and other modern frameworks. Support infrastructure automation and container orchestration with Kubernetes, Docker, and CI/CD pipelines. Collaborate with internal stakeholders and clients to understand requirements, evangelize platform solutions, and ensure successful delivery. Key Skills and Expertise Cloud and DevOps: Expertise in AWS, Azure, GCP especially VPC design, cloud databases, and serverless architecture. Certified in AWS Professional Solution Architect, AWS ML Specialty, or Azure Solutions Architect Expert (preferred). Proficient with Kubernetes, Docker, FluentD, Kibana, Grafana, Prometheus. Data and Streaming: Experience with OLTP/OLAP databases and cloud-native data warehouses like BigQuery, Aurora, Spanner. Hands-on with Kafka, Apache Flink, Spark, Airflow, Databricks, Apache Iceberg, Presto. AI/ML & LLM Expertise: In-depth understanding of LLMs (GPT, Gemini, Claude, Mixtral, Llama, Hugging Face OSS models). LLMOps frameworks: Langchain, Langgraph, Langflow, Flowise, LLamaIndex. ML lifecycle tools: MLFlow, SageMaker, Vertex AI, Azure AI, AWS Bedrock. Proven experience in model optimization, fine-tuning, and high-throughput inference systems. Programming Languages: Proficient in Python, SQL, and JavaScript. Preferred Qualifications 10+ years in cloud and AI/ML platform architecture roles. Experience delivering AI solutions for enterprise-scale clients. Hands-on experience with GPU architecture and parallel/distributed training. Strong communication skills with ability to influence technical and business stakeholders. Work on cutting-edge AI technologies and shape future product experiences used by millions. Collaborate with world-class engineers and scientists in a diverse, inclusive culture. Be part of a company that values creativity, innovation, and employee well-being. Adobe is proud to be an Equal Opportunity Employer. We welcome and encourage candidates from all backgrounds to apply.
Sr. Data Engineer- Aws- Big Data
Infocepts
Sr. Data Engineer - AWS - Big Data Location:Bangalore Type of Employment: Full-Time Experience Required: 7 to 10 years Job Overview: We are seeking a highly skilled Sr. Data Engineer with expertise in AWS cloud technologies and Big Data to join our Cloud Data Architect Team at Infocepts. In this critical role, you will design and implement robust data solutions using technologies like EMR, Athena, PySpark, AWS Lambda, S3, and other AWS services. The ideal candidate will have a strong foundation in database concepts and SQL and will be responsible for building scalable data pipelines to support high-performance data processing. Key Responsibilities: Technology Assessment and Design: Study the existing technology landscape and evaluate current data integration frameworks. Assist in designing complex Big Data use cases leveraging AWS services. Documentation and Stakeholder Communication: Prepare and maintain comprehensive project documentation, adhering to quality guidelines and schedules. Work closely with Architects and Project Managers to provide accurate estimations, scoping, and scheduling assistance. Clearly communicate design decisions and conduct Proof-of-Concepts to validate new solutions before implementation. Process Improvement and Automation: Identify areas for process automation to improve efficiency and team productivity. Provide expert guidance and troubleshooting support to junior Data Engineers. Training and Knowledge Sharing: Develop and deliver technology-focused training sessions for the team, ensuring continuous knowledge sharing. Share expertise through Expert Knowledge Sharing sessions with Client Stakeholders. Essential Skills: AWS Services Expertise: In-depth knowledge of S3, EC2, EMR, Athena, AWS Glue, and Lambda. Big Data Technologies: Proficiency with Apache Spark, Databricks, and Big Data table formats such as Delta Lake (open-source). Data Warehousing: Strong understanding of data warehousing concepts and architectures. Programming Skills: Advanced programming skills in Python for building data pipelines. SQL Expertise: Strong SQL skills for data transformation, aggregation, and querying large datasets. ETL Workflow Development: Expertise in creating ETL workflows with complex transformations (e.g., SCD, deduplication, aggregation). Orchestration Tools: Familiarity with orchestration tools like Apache Airflow. MPP Databases: Experience with at least one MPP database (e.g., AWS Redshift, Snowflake, SingleStore). Cloud Databases: Exposure to cloud databases like Snowflake or AWS Aurora. Desirable Skills: Cloud Databases: Familiarity with Snowflake, AWS Aurora. Big Data Technologies: Experience with Hadoop and Hive. AWS Certification: Associate or Professional Level AWS Certification. Advanced Knowledge of Big Data Solutions: Exposure to big data tools and frameworks on cloud platforms. Qualifications: Experience: 7+ years of overall IT experience, with 5+ years specifically focused on AWS-related projects. Educational Background: Bachelor's degree in Computer Science, Engineering, or a related field (Master's degree is a plus). Technical Certifications: Demonstrated commitment to continuous learning through certifications or relevant training. Qualities: Strong analytical and problem-solving skills to deep dive into complex technical challenges.
Cloud Data Engineer - AWS Big Data
Infocepts
Position: Cloud Data Engineer AWS Big Data Location: Bangalore, India Employment Type: Full-time Experience Required: 5 to 8 years Purpose of the Position: Join the Infocepts Cloud Data Architect Team as a Cloud Data Engineer and help design and implement cutting-edge big data solutions on AWS. You will leverage your expertise in EMR, Athena, PySpark, S3, AWS Lambda, and SQL to develop robust and scalable data platforms. Key Responsibilities: Technology Assessment and Design: Assess existing technology landscape and data integration frameworks. Design complex Big Data use cases using AWS services under guidance of the Architect. Support architectural decision-making by evaluating trade-offs in cost, performance, and durability. Recommend optimizations to existing data infrastructure. Documentation and Stakeholder Communication: Create project documentation adhering to quality and delivery standards. Collaborate closely with Architects and Project Managers for scoping, estimation, and planning. Present design decisions to technical and business stakeholders clearly. Conduct PoCs and design review sessions. Process Improvement and Automation: Identify and suggest opportunities for automation and process enhancements. Mentor junior engineers and support technical problem solving. Training and Knowledge Sharing: Prepare and deliver internal training on AWS and Big Data topics. Lead client knowledge sharing sessions and contribute to case studies. Essential Skills: In-depth experience with AWS services: S3, EC2, EMR, Athena, Glue, Lambda Familiarity with MPP databases like Redshift, Snowflake, or SingleStore Proficiency in Apache Spark and Databricks Strong programming skills in Python Experience building data pipelines using AWS and Databricks Knowledge of Big Data file formats such as Delta Lake Advanced SQL skills for large-scale data manipulation Hands-on experience with Apache Airflow or similar orchestration tools Strong understanding of ETL workflows and data warehousing concepts Desirable Skills: Cloud databases: AWS Aurora, Snowflake Experience with Hadoop and Hive AWS Certifications (Associate or Professional level) are a plus Qualifications: Bachelor s degree in Computer Science, Engineering, or related field (Master s preferred) Overall 5+ years of IT experience with at least 3 years in AWS Big Data projects Ongoing learning and technical certifications are strongly encouraged Key Qualities: Strong problem-solving and analytical thinking Self-driven with a passion for emerging data technologies Excellent communication and client presentation skills Ability to work in cross-functional, agile teams Apply now to be part of a high-impact data transformation team working on large-scale cloud data projects! Qualification : Bachelors degree in Computer Science, Engineering, or related field (Masters preferred)
Data Engineer 1
Loadshare Networks
Job Title: Data Analyst Location: Bengaluru Company: LoadShare Networks About LoadShare Networks At LoadShare, we re building India s largest intra-city logistics marketplace. Founded in 2017, we are a Series C startup backed by a strong set of investors (Tiger Global, Matrix Partners, BII, Stellaris, BeeNext, Filter Capital), focused on building a profitable and impactful business. We work with all major enterprise clients covering a wide range of intra-city use cases, including: Quick Commerce: Blinkit, Zepto, BBNow Food Delivery: Swiggy, Zomato Hyperlocal: Apollo, Licious Grocery: Reliance, Spencers, ONDC eCommerce: Flipkart, Amazon, Meesho, ShipRocket Bike Taxi: Uber, Ola Scale & Reach: Delivering over 500K orders per day across food, e-commerce, grocery, etc. Operating in 500+ towns nationwide with a fleet of 20K+ riders. Our tech platform powers 1M+ shipments daily. Role Overview As a Data Analyst at LoadShare, you will be an integral part of our Data team, responsible for designing, developing, and maintaining analytics reports and dashboards that measure, monitor, and alert the business regarding operations and product metrics. You ll work closely with internal teams to troubleshoot data issues, build ETL pipelines, and contribute to process improvements and innovation. Key Responsibilities Analyze, design, and develop analytics reports and dashboards to track key business metrics, providing insights into operational performance. Use your strong SQL skills to troubleshoot data issues in complex workflows and work with product and tech teams to resolve them. Build and maintain ETL pipelines to support data movement and integration across systems. Create ad-hoc reports as requested, ensuring timely and accurate delivery. Present findings and actionable recommendations to both business and non-technical stakeholders using effective storytelling techniques. Collaborate with product, business, and tech teams to understand the business domain and deliver insightful reports and dashboards that support business growth and optimization. Continuously improve data models, ensuring data is accessible and accurate for decision-making across the organization. Work on in-house technology projects to bring innovation and process improvements. Job Requirements Technical Skills: Strong proficiency in SQL and experience with writing and tuning SQL scripts. Around 1-2 years of experience coding in Python (experience in web scraping is a plus). Experience in data modeling, data warehousing, and building ETL pipelines. Familiarity with Business Intelligence (BI) tools such as Zoho Analytics, Metabase, Redash, Tableau, or Power BI. Collaboration: Ability to partner effectively with product, business, and tech teams to understand the business needs and develop reports and dashboards that support business growth. Communication: Strong communication skills, both written and verbal, with the ability to present complex data and insights to both technical and non-technical stakeholders. Problem-Solving: Proactive in identifying data issues, troubleshooting, and providing solutions that improve data integrity and usability. Key Skills SQL, Python, ETL Pipelines Data Modeling, Data Warehousing BI Tools: Zoho Analytics, Metabase, Redash, Tableau, Power BI Data Analysis & Reporting Communication & Collaboration
Sr. Data Engineer
Trellissoft Engineering Services Pvt Ltd
Job Title: Data Engineer Location: Bengaluru, Karnataka Experience: 5 to 8 Years Work Modality: Full-time (Work from office) Job Description: We are looking for an experienced Data Engineer to join our team and take responsibility for designing, developing, and maintaining scalable ETL/ELT pipelines. This is a full-time position based in Bengaluru, Karnataka, and you will be collaborating with cross-functional teams to define data requirements and ensure data accuracy, consistency, and integrity. Your role will also involve optimizing data workflows, automating processes, and ensuring high availability and reliability of data pipelines. Key Responsibilities: ETL/ELT Pipeline Development: Design, develop, and maintain scalable ETL/ELT pipelines to support data transformation and integration processes. Data Warehouse & Data Lake Optimization: Build and optimize data warehouses, data lakes, and real-time streaming solutions to support large-scale data operations. Collaboration & Data Requirements: Collaborate with cross-functional teams, such as product, data science, and analytics teams, to define data requirements and ensure data accuracy and consistency. Database Structure & Schema Management: Develop and maintain database structures and schemas to ensure efficient data storage and retrieval. Data Workflow Optimization: Optimize data workflows for performance, reliability, and scalability, ensuring the highest level of efficiency. Data Security & Compliance: Implement data security, governance, and compliance best practices to ensure that data is handled securely and meets industry standards. Pipeline Monitoring & Troubleshooting: Monitor, troubleshoot, and improve data pipelines to ensure uptime, reliability, and smooth data processing. Process Automation: Automate data-related processes to improve efficiency and reduce manual intervention, increasing the overall speed of data flow. Required Qualifications: Experience: 5+ years of experience in data engineering or 3-4 years of experience as a Data Engineer. Technical Skills: Strong proficiency in SQL and database management systems such as PostgreSQL, MySQL, SQL Server, etc. Experience with ETL tools such as Pentaho, Talend, Cdata, and SSIS. Exposure to Python, Java, or Scala for data processing is a plus. Experience with big data technologies such as Apache Spark, Hadoop, or Kafka. Familiarity with cloud services (AWS, Azure) and data storage solutions such as S3, Redshift, Snowflake, or BigQuery. Strong knowledge of data modeling, warehousing concepts, and data architecture best practices. Soft Skills: Excellent communication skills with the ability to collaborate effectively across teams. Strong problem-solving skills and the ability to work with large, complex datasets. What We Offer: Competitive Salary: Attractive salary based on experience and expertise. Collaborative Work Environment: Work in a dynamic and fast-paced environment with a team that fosters innovation and collaboration. Growth Opportunities: Opportunities to enhance your skills and career growth in the data engineering field. Comprehensive Benefits: Benefits package designed to support work-life balance and overall employee well-being.
Data Engineer: Data Warehouse
International Business Machines Corporation
Job Title: Application Developer - ETL and Data Management Introduction: In this role, you ll work in one of our IBM Consulting Client Innovation Centers (Delivery Centers), where we provide deep technical and industry expertise to a wide range of public and private sector clients globally. Our delivery centers leverage locally-based skills to help clients drive innovation and the adoption of new technologies. A career in IBM Consulting is built on long-term relationships and close collaboration with clients across the world. You will work with leaders across industries to improve the hybrid cloud and AI journeys for some of the most innovative and valuable companies worldwide. Your ability to make a meaningful impact for clients is enabled by our strategic partner ecosystem and robust technology platforms, including Software and Red Hat. Curiosity and a constant quest for knowledge are key to success in IBM Consulting. In your role, you ll be encouraged to challenge the norm, explore new ideas, and come up with creative solutions that result in groundbreaking impact for a wide network of clients. Our culture is built on evolution and empathy, focusing on long-term career growth and development in an environment that values your unique skills and experience. Your Role and Responsibilities: ETL Workflow Development: Develop and implement ETL workflows by creating ETL jobs, and data models in datamarts using technologies such as Snowflake, DBT, Unix, and SQL. Batch Processing Redesign: Redesign Control M Batch processing for ETL job builds to run efficiently in a production environment. System Evaluation and Improvement: Study the existing system to evaluate its effectiveness and design new systems to improve workflow efficiency. Business Program Analysis & Support: Perform requirements identification, business program analysis, testing, and system enhancements while providing production support. Agile Environment: Work effectively in an Agile environment and gain familiarity with tools such as JIRA and SharePoint. Client Interaction: Good written and verbal communication skills are essential as you will interact directly with client counterparts to understand requirements and provide solutions. Required Technical and Professional Expertise: Experience: A minimum of 3 years of experience in developing ETL applications, implementing workflows, and creating data models using Snowflake, DBT, Unix, and SQL technologies. Agile Environment: Strong understanding of working in an Agile environment and proficiency in tools like JIRA and SharePoint. Problem-Solving Skills: Ability to manage change and proven time management skills. Strong interpersonal skills to contribute effectively to team efforts. Continuous Learning: Stay up-to-date with technical knowledge by attending educational workshops and reviewing relevant publications. Preferred Technical and Professional Expertise: ETL Development: Experience in developing triggers, functions, and stored procedures to support ETL workflows. Impact Analysis: Assist with impact analysis of changing upstream processes on the Data Warehouse and reporting systems. ETL & Reporting Support: Participate in the design, testing, support, and debugging of new and existing ETL and reporting processes. Data Profiling & Troubleshooting: Perform data profiling and analysis using a variety of tools, troubleshoot and support production processes, and maintain documentation. Innovation: Be part of a team that drives global change and leverages cutting-edge technologies to solve complex problems. Growth: Gain access to continuous learning and career development opportunities to further your expertise in data management and cloud technologies. Collaboration: Work with a diverse team in a collaborative environment that values new ideas and creative solutions. Global Impact: Your work will contribute to improving business operations and technological advancements for clients around the world. If you're passionate about driving innovative solutions, working with a variety of clients, and continuously evolving your skills, IBM Consulting is the perfect place for you to advance your career.
Staff Data Engineer
Intuit
Intuit is a global leader in financial technology, dedicated to helping individuals and businesses thrive. Our suite of products, including TurboTax, Credit Karma, QuickBooks, and Mailchimp, serves approximately 100 million customers worldwide. At Intuit, we believe in providing everyone with the tools and resources they need to achieve financial success. We are constantly innovating to make financial empowerment a reality for all. Job Overview Join the Intuit Data Platform (IDP) team as a Staff Engineer and help us transform the way we handle big data! The IDP team is responsible for the Intuit Analytics Platform, which powers real-time data ingestion, cataloging, analytics, and machine learning across the entire organization. As Intuit s customer base grows, so does the volume of data we process. Our engineering excellence ensures that we can scale and leverage this data to drive machine learning and product innovations. We re in the process of building the next-generation real-time and batch ingestion engine, capable of indexing, cataloging, and organizing data and metadata. We are passionate about using open-source technologies to solve challenges and contributing back to the community. If you're excited about building a platform that will directly impact data scientists and analysts and have a desire to shape the future of data at Intuit, then come join us! Key Responsibilities Architect & Design: Build fault-tolerant and scalable big-data platforms using open-source technologies to handle massive datasets. Data Solutions: Create architecture solutions that address complex use cases like data normalization, lineage, governance, ontology, and discoverability. Cross-Team Collaboration: Work with analysts and data scientists to understand data requirements for building operational propensity models and gaining deep customer insights. Hands-On Coding: Lead development efforts within the Hadoop ecosystem using technologies such as Java MapReduce, Spark, Scala, HBase, and Hive to build and optimize data pipelines for both real-time and batch applications. Database Management: Work with NoSQL, SQL, and in-memory databases to design high-performance data systems. Code Reviews: Ensure code quality, consistency, and adherence to best practices through regular code reviews. Architectural Alignment: Ensure alignment between enterprise architecture and business requirements. Prove Feasibility: Conduct proof-of-concept (POC) experiments for new technologies or approaches and drive them to production. Collaboration with Data Cataloging Team: Work closely with data catalog teams and architects to index and catalog all data sources at Intuit. Agile Leadership: Lead fast-paced development teams using agile methodologies and promote best practices in software development, testing, and incident response. Design & Model: Build dimension models suited for customer business use cases and ensure seamless integration of business and technical requirements. Qualifications Experience: 12+ years of relevant experience, with at least 5+ years specializing in the big data domain. Big Data Architecture: Proven experience in architecting end-to-end ecosystems for big data and analytics platforms. Expert Knowledge: Deep expertise in building fault-tolerant, scalable big data solutions, especially using the Hadoop ecosystem (Hive, HBase, Spark, Kafka, MapReduce, etc.). Programming Expertise: Mastery of Java and Scala, with a focus on building high-throughput data services. Machine Learning: Knowledge of machine learning principles and AI applications in big data. Big-Data Technologies: Familiarity with tools such as HDFS, Storm, Zookeeper, Cassandra, Redshift, GraphDB, and others. Understanding both real-time and batch processing in the Hadoop ecosystem. Communication: Strong communication skills, with an ability to explain complex technical topics to both technical and non-technical audiences. Programming Skills: Intermediate experience in Python or R for data processing. Education: BE/BTech/MS in Computer Science or a related field (or equivalent experience). Collaboration: Demonstrated ability to work cross-functionally and lead change through influence and example. At Intuit, you ll be part of a talented, passionate team working on innovative solutions that shape the future of data analytics and machine learning. As a Staff Engineer, you ll have the chance to work with cutting-edge technologies, build scalable systems, and help revolutionize how Intuit leverages data to drive product innovation. If you're looking for a dynamic environment where you can have a meaningful impact, come join us at Intuit! Qualification : BE/BTech/MS in Computer Science (or equivalent)
Sr. Data Engineer
Databricks
Job Title: Data Engineer Job Summary As a Data Engineer in the IT team, you will work on various big data challenges using the Databricks platform. You will provide data engineering, data science, and cloud technology projects which require integrating with client systems, training, and other technical tasks to help business to get most value out of their data. The Impact You Will Have You will work on a variety of impactful Big Data projects which may include building reference architectures, how-to's and production grade MVPs. Work on implementing transformational big data projects, 3rd party migrations, including end-to-end design, build and deployment of industry-leading big data and AI applications. Work on architecture and design; bootstrap or implement strategic projects. What We Look For 7+ years experience with Big Data Technologies such as Apache Spark , Kafka, Cloud Native and Data Lakes. 4+ years of experience working on Big Data Architectures independently. Preferred experience working in the Databricks ecosystem. Comfortable writing code in either Python or Scala. Experience working across Cloud Platforms (GCP / AWS / Azure). Documentation and white-boarding skills. Build skills in technical areas which support the deployment and integration of Databricks-based solutions to complete customer projects. About Databricks Databricks is the data and AI company. More than 10,000 organizations worldwide including Comcast, Cond Nast, Grammarly, and over 50% of the Fortune 500 rely on the Databricks Data Intelligence Platform to unify and democratize data, analytics, and AI. Databricks is headquartered in San Francisco, with offices around the globe and was founded by the original creators of Lakehouse, Apache Spark , Delta Lake, and MLflow. To learn more, follow Databricks on Twitter, LinkedIn, and Facebook. Benefits At Databricks, we strive to provide comprehensive benefits and perks that meet the needs of all of our employees. For specific details on the benefits offered in your region, please visit https://www.mybenefitsnow.com/databricks.
1 - 20 of 0 jobs
* No exact matches found. Showing closest results insteadNo results found
Modify search criteria or create an alert to get relevant jobs as soon as they’re posted