Pyspark Jobs in Mumbai
2 Jobs Found
Data Collection Transformation Senior Associate
Msci
Job Title: Data Collection Transformation Senior Associate Location: Mumbai Experience: Relevant experience in data acquisition, ESG data management, process automation, and data quality. Company: MSCI About MSCI: MSCI is a global leader in decision-support tools and services for the investment community. With over 50 years of expertise in research, data, and technology, we help clients understand key drivers of risk and return, enabling them to build more effective portfolios with enhanced transparency. At MSCI, we foster a culture of innovation, high performance, and inclusion, empowering our people to grow their careers through continuous learning and a wide range of internal mobility opportunities. Your Team: The ESG Data Collection team plays a critical role in acquiring, validating, and maintaining high-quality ESG data that powers MSCI s ESG products. As part of this team, you will work at the forefront of MSCI s ESG transformation agenda, driving projects that enhance data quality, scalability, and automation to meet the evolving ESG landscape and its growing importance in global financial markets. Your Key Responsibilities: Collaborate with internal ESG Research and Technology teams to design and operationalize data collection processes aligned with evolving ESG and Climate frameworks. Work on electronification of ESG policies and principles by converting them into structured, operational data definitions. Develop data collection templates and translate them into implementable data models. Conduct hands-on research and analysis of company disclosures to support scalable data collection solutions. Analyze collected and third-party datasets to detect patterns and trends, enabling development of automated anomaly detection frameworks. Design and implement contextual/thematic QA checks to strengthen data quality controls, leveraging historical data correction patterns. Collaborate with technology teams to build NLP-driven data extraction models (leveraging both traditional methods and LLMs) to automate identification and extraction of relevant ESG facts from disclosures. Help establish and optimize new data collection processes while ensuring seamless integration with existing workflows. Deliver high-quality data aligned with MSCI methodology, service-level agreements, and regulatory requirements. Contribute to creating methodology and SOP documentation, embedding data and content expertise into internal processes. Drive process automation by developing tools and systems for automated data quality diagnostics, reducing manual intervention. Build dashboards and reports to visualize data quality metrics, identify outliers, and provide data-driven recommendations to stakeholders. Partner with internal stakeholders, including downstream teams, Research, and Product teams, to understand data requirements and ensure seamless delivery. What We re Looking For: Strong analytical mindset with a keen attention to detail. Hands-on experience with Python/SQL for data analysis and process automation; exposure to Machine Learning/RPA is a plus. Experience working with visualization tools such as Power BI. Advanced Excel skills, with the ability to manipulate and analyze complex datasets. Self-starter with strong problem-solving abilities, capable of working in unstructured environments. Strong collaboration and communication skills, with comfort working across hierarchies, functions, and geographies. Previous experience in Financial Services, Technology, or Business Analysis, ideally with exposure to ESG data. Basic understanding of financial markets and asset classes; ESG knowledge would be a significant advantage. Preferred Qualifications: Bachelor s or Master s degree in Finance, Economics, Environmental Science, Data Science, Business, or a related field. Certifications in ESG, Sustainable Finance, or Data Analytics are a plus. What We Offer: Transparent compensation and comprehensive benefits tailored to your location. Flexible work options and access to cutting-edge technology. A culture of learning and development, with access to LinkedIn Learning Pro and Learning@MSCI. Clear career progression paths with opportunities for internal mobility and leadership development. A global network of talented colleagues, supported by inclusive Employee Resource Groups like Women in Tech, Climate Action Network, and more. Qualification : Bachelors or Masters degree in Finance, Economics, Environmental Science, Data Science, Business, or a related field.
Data Scientist 1
Mondelez
Company Overview Mondel z International is a leading global snacking company with iconic brands like Oreo, Cadbury, and Toblerone. The company s mission is to lead the future of snacking, creating delicious moments of joy for people around the world. As part of Mondel z's dynamic Digital Solutions team, this role will offer an exciting opportunity to drive innovation through AI, data, and analytics. Role Overview As a Data Scientist within the Mondelez Digital Solutions team, you will play a critical role in developing cutting-edge AI and GenAI solutions that will empower the US Sales & Planning teams. This role focuses on leveraging Analytical AI algorithms and Large Language Models (LLMs) to create actionable insights and transform commercial reporting processes. You'll work closely with multiple teams to implement these solutions and continuously improve the reporting and decision-making capabilities of the organization. How You Will Contribute 1. AI and GenAI Model Development Design, develop, and deploy Analytical AI models (e.g., linear regression, decision trees, boosted trees models) to gather insights related to customers, brands, categories, and products. Implement anomaly detection algorithms (e.g., isolation forest) to identify anomalies in commercial data and alert teams about significant changes. Utilize Large Language Models (LLMs) to summarize the current business state and explain trends in the data, supporting the commercial reporting team. 2. Data Preparation and Utilization Oversee the extraction, preparation, and utilization of large datasets for model training and validation, ensuring the data is clean, accurate, and relevant for AI model development. Continuously update and refine the AI/GenAI solution with new data sources and features to enhance business insights. 3. Collaboration and Implementation Work closely with cross-functional teams to ensure smooth deployment of AI and GenAI capabilities into the company s commercial reporting dashboards. Collaborate with stakeholders to ensure the tools meet business requirements and deliver high-quality insights. 4. Reporting and Documentation Regularly report on the achievements and challenges of the AI tools to senior management and relevant stakeholders. Maintain detailed documentation of AI models, GenAI solutions, logic, and performance metrics, ensuring the business can leverage insights effectively. 5. Continuous Improvement and Innovation Stay up to date with the latest trends in AI and GenAI and integrate new advancements into the reporting system to improve its efficiency and effectiveness. What You Will Bring 1. Required Skills & Experience Bachelor's degree in Information Systems/Technology, Computer Science, Analytics, or related fields. Strong analytical and critical thinking skills with the ability to translate complex data into actionable business strategies. Proficiency in handling large datasets using Python and SQL. Strong knowledge of data visualization tools such as Tableau and Power BI. Experience with cloud platforms (e.g., Google Cloud, Azure, Databricks) for deploying AI solutions. Experience with data preprocessing, cleaning, and ETL processes. Familiarity with Natural Language Processing (NLP) techniques. Knowledge of recent advancements and trends in GenAI and a passion for continuous learning in this field. 2. Additional Skills (Good to Have) Working knowledge of syndicated data (e.g., Nielsen/IRI or other retail sales data sources). Proficiency in using data containerization and orchestration tools for maintaining complex data workflows. Experience with programming languages like PySpark and R. Experience in leveraging Large Language Models (LLMs) to build GenAI solutions or proofs-of-concept. Experience with frameworks like Langchain for building language applications. Relocation and Support Within-country relocation support is available. For candidates voluntarily moving internationally, minimal support is offered through the Volunteer International Transfer Policy. Qualification : Bachelor's degree in Information Systems/Technology, Computer Science, Analytics, or a related field.
1 - 20 of 0 jobs
* No exact matches found. Showing closest results insteadNo results found
Modify search criteria or create an alert to get relevant jobs as soon as they’re posted