Principal Associate - Full Stack Engineering Job in Capital One
Principal Associate - Full Stack Engineering
- Bengaluru, Bangalore Urban, Karnataka
- Not Disclosed
- Full-time
Principal Associate Full Stack Engineering (GenAI Observability)
Location: Bangalore
Company: Capital One India
About Us
At Capital One India, we re tackling some of the most complex problems in financial services using machine learning, advanced analytics, and cloud-first engineering. Our mission is to build cutting-edge, patentable solutions that transform customer experiences, enhance operational efficiency, and ensure robust risk and compliance standards.
We re a team of makers, breakers, doers, and disruptors obsessed with turning data into real-world impact at scale.
About the Team Machine Learning Experiences (MLX)
The MLX team is pioneering the future of model governance, ML observability, and Generative AI infrastructure at Capital One. We re enabling teams to seamlessly deploy ML and GenAI models at scale, with full visibility into performance, health, compliance, and ethical usage.
This is the platform powering the next generation of AI-driven financial products across the company.
About the Role
We re looking for a Principal Associate Full Stack Engineer to lead the development of observability platforms for Generative AI systems. You ll be part of a cross-functional team focused on governance automation, LLM monitoring, and intelligent diagnostics using telemetry data, metadata, and advanced analytics.
You ll design systems to collect, analyze, and visualize performance data from our large-scale GenAI infrastructure, helping data scientists and engineers make faster, safer decisions.
What You ll Do
- Lead architecture and development of observability tools and dashboards for monitoring GenAI models and platform health.
- Design and build core APIs and SDKs to instrument large language models (LLMs) and foundational models (training, fine-tuning, prompting stages).
- Integrate Generative AI to enable observability features like anomaly detection, predictive analytics, and copilot-assisted troubleshooting.
- Partner with platform, MLOps, and governance teams to ingest and analyze telemetry, metadata, and runtime metrics at scale.
- Drive development of tools to ensure compliance with AI ethics, data governance, and industry regulations.
- Collaborate with product, design, and research to turn complex requirements into scalable, cloud-native software solutions.
- Lead proof-of-concept initiatives to test and showcase how GenAI can improve platform observability and decision-making.
- Contribute to the open-source community and stay at the forefront of GenAI and ML infrastructure evolution.
Basic Qualifications
- Bachelor s or Master s degree in Computer Science, Engineering, or related field
- 4+ years of experience building distributed, data-intensive systems using microservices architecture
- 4+ years of experience in backend development with Python, Go, or Java
- 4+ years of expertise with observability stacks (Prometheus, Grafana, ELK) and adapting them for AI systems
- Strong knowledge of OpenTelemetry, and experience building custom SDKs and APIs
- 5+ years of hands-on experience with Generative AI models, especially applied to observability, governance, or compliance
- 2+ years of experience with cloud platforms such as AWS, Azure, or GCP
Preferred Qualifications
- 4+ years building and optimizing ML systems in production environments
- 3+ years of experience with MLOps tools like MLflow, Kubeflow, or commercial platforms
- Experience with GenAI frameworks and libraries like LangChain, Haystack, and vector databases (FAISS, Chroma, OpenSearch)
- Familiarity with emerging observability tools for LLMs such as Langfuse, Phoenix, Helicone, or OpenInference
- Contributor to open-source GenAI or ML infrastructure projects
- Author or co-author of published work in AI/ML observability, governance, or performance monitoring
- Experience with PyTorch, TensorFlow, Spark, or Dask
- Knowledge of NVIDIA GPU telemetry, CUDA programming, and performance optimization for AI workloads
- Understanding of AI ethics, data governance, and regulatory frameworks for machine learning systems
Why Join Capital One India
- Work at the intersection of technology, AI, and compliance helping shape the future of responsible AI
- Join a team driving enterprise-wide adoption of Generative AI
- Collaborate with world-class engineers, data scientists, and product leaders
- Enjoy a high-performance culture that encourages innovation, learning, and mentorship
- Access to cutting-edge tools, open-source contributions, and cloud-native infrastructure
Qualification : Bachelors or Masters degree in Computer Science, Engineering, or related field