Shubham Mallick
Data & Application Architect | Technical Lead
Professional Summary
14 years of extensive experiences as Senior Data/Application Architect with specialization in large-scale distributed data systems, data governance (lineage, quality, compliance), and cloud-native platforms. Delivered enterprise solutions processing 50TB+ daily with 70% latency reduction and 99.9% SLA. Expertise in orchestration (Airflow/Kubernetes), observability (Splunk/Datadog/Prometheus), CI/CD, multi-cloud, and AI/ML integration across organizations.
Professional Experience
Atlassian
Sep 2025 - Present
Lead Engineering
- Leading data platform initiatives with Databricks/Spark for enterprise analytics & AI/ML workloads for Go-To-Market product
- Establishing data governance standards including quality frameworks and pipeline observability among teams
- Building AI-ready data infrastructure on Databricks enabling GenAI model deployment and ML feature engineering
Salesforce
Jul 2022 - Aug 2025
Staff Engineering, Performance Engineering, Sales AI
- Architected cloud-native data platform with event-driven ticketing-system, achieving 70% latency reduction using Spark/EKS/Kafka orchestration in AWS ecosystem
- Implemented platform observability with Grafana/Splunk; established SLAs & data quality for monitoring CI/CD, and application performance
- Contributed IaC automation (Terraform/EKS/Jenkins) reducing deployment time by 70%; enabled developer self-service
- Drove GenAI/AI integration with Agentforce into data infrastructure, achieving 25-40% performance improvement
Apple Inc.
Jun 2020 - Jul 2022
Technical Lead, Module Owner
- Led 12-member engineering team, architecting enterprise data platform processing 100GB+ daily with 99.9% SLA
- Built real-time data pipelines (Kafka) handling 1M+ msgs/hour with comprehensive data quality and lineage tracking
- Led cloud migration (on-prem to AWS) with Data Mesh principles, reducing costs 50% and improving scalability 300%
- Established data governance standards for data discovery, quality, and compliance across supply-chain domain
- Built data APIs with semantic models and metadata management serving 10+ internal analytics teams
Qualcomm
Nov 2017 - May 2020
Senior ML, Distributed System Engineer
- Architected large-scale distributed platform processing 20TB+ daily with Spark/HIVE/MapR DB in Hadoop ecosystem
- Built AI/ML data pipelines with 92% model accuracy; optimized data infrastructure reducing resource usage 25%
- Implemented data quality & monitoring frameworks ensuring data reliability across enterprise analytics workloads
- Won Innovation Maestro, HaQkathon; presented the innovative solutions to CxO leadership
- Designed orchestration workflows (Airflow) & data transformation frameworks improving pipeline efficiency by 35%
- Partnered with engineering & analytics teams enabling developer self-service access to 15+ data products
IBM
Dec 2016 - Nov 2017
Advisory Analyst, Azure Cloud SME
- Led 6-member team architecting multi-cloud data platform on Azure, delivered ahead of schedule
- Migrated 500GB+ data with 99.9% integrity using Azure Data Factory, Service Bus; established data quality SLAs
- Implemented IaC automation and compliance frameworks meeting regulatory requirements across 3 geographic regions
PwC
Oct 2014 - Dec 2016
Senior Developer / Consultant
- Architected BI platform with OLTP/OLAP optimization querying billion+ rows in 2-5 sec; delivered dashboards for business analytics
- Built scalable ETL pipelines processing 50M+ rows daily
- Implemented logical data modeling & schema versioning improving data consistency by 40%, across analytics workloads
- Collaborated with business stakeholders across Sales, Finance enabling data-driven decision-making
- Won STAR Performer for data platform excellence & multiple client appreciation awards
Cognizant
Apr 2012 - Oct 2014
Application Developer
- Managed 24x7 production support for MetLife backend processing, and application support
- Automated processes reducing manual effort by 60%, across 5+ applications