Data Engineer-data Platforms-aws Job in International Business Machines Corporation
Data Engineer-data Platforms-aws
- Cochin, Ernakulam, Kerala
- Not Disclosed
- Full-time
Job Title: Data Engineer
Location: Rajkot, Gujarat, India
Company: IBM Consulting
Introduction
A career in IBM Consulting is rooted by long-term relationships and close collaboration with clients across the globe. You'll work with visionaries across multiple industries to improve the hybrid cloud and AI journey for some of the most innovative and valuable companies in the world. Your ability to accelerate impact and make meaningful change for your clients is enabled by our strategic partner ecosystem and our robust technology platforms across the IBM portfolio, including Software and Red Hat.
At IBM, curiosity and a constant quest for knowledge serve as the foundation for success. You'll be encouraged to challenge the norm, investigate ideas outside of your role, and come up with creative solutions that make groundbreaking impact for a wide network of clients. Our culture of evolution and empathy centers on long-term career growth and development opportunities in an environment that embraces your unique skills and experiences.
In this role, you ll work in one of our IBM Consulting Client Innovation Centers (Delivery Centers), where we deliver deep technical and industry expertise to a wide range of public and private sector clients. Our delivery centers provide locally based skills and technical expertise to drive innovation and adoption of new technologies.
Your Role and Responsibilities
As a Data Engineer, you will develop, maintain, evaluate, and test big data solutions. You will be involved in developing data solutions using Spark Framework with Python or Scala on Hadoop and AWS Cloud Data Platform.
Key Responsibilities:
Data Pipeline Development:
- Build data pipelines to ingest, process, and transform data from files, streams, and databases.
- Process the data using Spark, Python, PySpark, Scala, and Hive, HBase, or other NoSQL databases on Cloud Data Platforms (AWS) or HDFS.
Software Code Development:
- Develop efficient software code for multiple use cases leveraging Spark Framework, using Python or Scala and Big Data technologies for various use cases built on the platform.
Streaming Pipelines:
- Develop streaming pipelines to process real-time data.
Cloud Data Solutions:
- Work with Hadoop/ AWS ecosystem components to implement scalable solutions to meet the increasing data volumes using big data/cloud technologies like Apache Spark, Kafka, etc.
Big Data/Cloud Technologies:
- Build scalable solutions on AWS, including tools like AWS EMR, AWS Glue, DataBricks, AWS Redshift, DynamoDB, and other cloud technologies.
Required Education
- Bachelor's Degree in Computer Science, Engineering, or a related field.
Required Technical and Professional Expertise
- 5 - 7+ years of experience in Data Management (DW, DL, Data Platform, Lakehouse) and Data Engineering skills.
- Minimum 4+ years of experience in Big Data technologies with extensive experience in Spark, Python, or Scala.
- Minimum 3 years of experience on Cloud Data Platforms (preferably AWS).
- Exposure to streaming solutions and message brokers like Kafka technologies.
- Proficiency in SQL.
- Experience with AWS EMR, AWS Glue, DataBricks, AWS Redshift, DynamoDB.
Preferred Technical and Professional Experience
- Certification in AWS, Data Bricks, or Cloudera Spark Certified Developers.
About IBM Consulting
IBM Consulting is IBM s consulting and global professional services business, with market-leading capabilities in business and technology transformation. With deep expertise in many industries, we offer strategy, experience, technology, and operations services to many of the most innovative and valuable companies in the world. Our people are focused on accelerating our clients businesses through the power of collaboration. We believe in the power of technology responsibly used to help people, partners, and the planet.
Qualification : Bachelor's Degree in Computer Science, Engineering, or a related field.q

