Big Data Engineer - Kafka Job in Srijan Technologies
Big Data Engineer - Kafka
- New Delhi, NCT
- Not Disclosed
- Full-time
Company:-
Srijan Technologies is a 18-year-old technology services firm.
For a large part of its life, Srijan has specialised in building content management systems with expertise in PHP-based open-source CMS, specifically Drupal. In recent years Srijan has diversified into
Srijan is approx 400 people strong (including some Senior Consultants) Srijans development offices in India are located in New Delhi, Gurugram, Goa, Bangalore and Mumbai, Delhi, Gurgaon and Goa are the largest offices. In addition, a few developers & delivery leads are located in several countries globally -- USA (New York, Charlotte), Singapore, Philippines (Manila), Australia (Sydney, Brisbane, Melbourne), Germany (Berlin), Japan (Tokyo). In each of these countries, Srijan has a functional legal subsidiary.
Srijan works largely with enterprises or mid-large sized global firms and focuses on recurring business from these accounts, thereby bringing much-needed predictability of revenue for high-growth companies. It works with several top brands at the moment.
The firm is beginning to invest in startups and in joint research projects with top institutes. For instance, it recently partnered with IIT-Delhi to invest in an 18-month project for building a solution for Honey traceability using Blockchain.
The leadership team at Srijan has set itself an audacious goal of reaching $25 million in revenue (while maintaining healthy EBIDTA margins) in FY 2020 -- doubling our revenues. This requires significant technology and delivery leadership bandwidth to be created in the firm to ensure our high-quality standards are not compromised.
Each year Srijan donates 7% of its profits to Srijan Foundation Trust a registered non-profit which runs several projects including non-formal schools (directly or via partner organizations) and Indic civilizational projects such as #SrijanTalks.
Please visit company website for more information
Job Responsibilities:
- Develop scalable and reliable data solutions to move data across systems from multiple sources in real time as well as batch modes (Kafka)
- Build Producer and Consumer applications on Kafka, and appropriate Kafka configurations
- Designing, writing, and operationalizing new Kafka Connectors using the framework
- Accelerate adoption of the Kafka ecosystem by creating a framework for leveraging technologies such as Kafka Connect, KStreams/KSQL, Schema Registry, and other streaming-oriented technology
- Implement Stream processing using Kafka Streams / KSQL / Spark Jobs along with Kafka
- Bring forward ideas to experiment and work in teams to transform ideas to reality
- Architect data structures that meet the reporting timelines
- Work directly with engineering teams for design and build their development requirements
- Maintain high standards of software quality by establishing good practices and habits within the development team while delivering solutions on time and on budget.
- Proven communication skills, both written and oral
- Demonstrated ability to quickly learn new tools and paradigms to deploy cutting edge solutions
- Create large scale deployments using newly conceptualized methodologies
Skills:
- Proven hands-on experience with Kafka is a must
- Proven hands-on experience with Hadoop stack (HDFS, Map Reduce, Spark)
- Core development experience in one or more of these languages: Java, Python / PySpark, Scala etc.
- Good experience in in developing Producers and Consumers for Kafka as well as custom Connectors for Kafka
- 1+ plus years of developing applications using Kafka (Architecture), Kafka Producer and Consumer APIs, Real-time Data pipelines/Streaming
- 1 plus years of experience performing Configuration and fine-tuning of Kafka for optimal production performance
- Experience in using Kafka APIs to build producer and consumer applications, along with expertise in implementing KStreams components. Have developed KStreams pipelines, as well as deployed KStreams clusters
- Strong knowledge of the Kafka Connect framework, with experience using several connector types: HTTP REST proxy, JMS, File, SFTP, JDBC, Splunk, Salesforce, and how to support wire-format translations. Knowledge of connectors available from Confluent and the community
- Experience with developing SQL queries and best practices of using KSQL vs KStreams will be an added advantage
- Expertise with Hadoop ecosystem, primarily Spark, Kafka, Nifi etc.
- Experience with integration of data from multiple data sources
- Experience with stream-processing systems: Storm, Spark-Streaming, etc. will be ad advantage
- Experience with relational SQL and NoSQL databases, one or more of DBs like Postgres, Cassandra, HBase, Cassandra, MongoDB etc.
- Experience with AWS cloud services like S3, EC2, EMR, RDS, Redshift will be an added advantage
- Excellent in Data structures & algorithms and good in analytical skills
- Strong communication skills
- Ability to work with and collaborate across the team
- A good "can do" attitude

